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Starting from the ordinary definition of the pair distribution function, the following theorem is proved 
without the help of the cluster integral expansion and “‘toron”’ diagrams: the pair distribution function in the 
grand canonical ensemble can be expressed in terms of the two-body propagator corresponding to the scat 
tering in position-reciprocal temperature space. A simple expression for the internal energy of a nonideal gas 
with pair interactions is obtained in terms of the pair distribution function. 


INTRODUCTION 


URING the past few years many important con- 
tributions have been made to the many-body 
problem of quantum statistics. In this connection it 
was pointed out independently by Matsubara’ and 
Montroll and Ward? that the evaluation of the grand 
canonical partition function could be carried out in a 
manner similar to that of vacuum fluctuation in the 
quantum field theory. Fujita, Isihara, and Montroll* 
showed the similar analogy between the pair distribution 
function in the grand canonical ensemble and the two- 
body propagator, using the cluster integral expansion 
formula‘ and ‘“‘toron” diagrams.? However, the theorem 
that the pair distribution function in the grand canon- 
ical ensemble can be expressed in terms of the two-body 
propagator corresponding to the scattering in position- 
reciprocal temperature space, can also be proved in a 
simple and straightforward manner using only familiar 
concepts of second quantization. This makes unneces- 
sary the use of theorems pertaining to the ‘“‘torons” as 
well as theorems pertaining to advanced field theory. 
Starting from the ordinary definition of the pair 
distribution function x® and working with number 


* Work supported by the Office of Naval Research. 

1T, Matsubara, Progr. Theoret. Phys. (Kyoto) 14, 351 (1955). 

2 E. W. Montroll and J. C. Ward, Phys. Fluids 1, 55 (1958). 

3 Ph.D. thesis by Fujita to be submitted to Graduate School of 
University of Maryland. Its abstract was presented at the 1959 
Annual Meeting at New York [ Fujita, Isihara, and Montroll, Bull. 
Am. Phys. Soc. Ser. II, 4, 14 (1959). 
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representation, we first derive a compact expression for 
n®) in terms of the usual quantized field operators y 
and y': 

Tr [yb ( rep (rie F4 ty (1) Wy (12) ] 


n)(fPo) = 
Tr [e pH+aN’| 


Here H, \, e*, and 8 are the Hamiltonian, the total 
number (operator), the fugacity, and the reciprocal 
temperature, respectively, of the system. Transcription 
of the above by means of interaction picture operators 
leads to the proof of the above-mentioned theorem. 

The connection between the pair distribution function 
and the correlation function introduced by Martin and 
Schwinger® and Klein and Prange® will be discussed in 
Appendix IT. 

If the Hamiltonian consists of a kinetic energy part 
and the sum of the pair interactions with coupling 
strength g, i.e., 


N V 
H=L p?+ L gvr(ri—ry), 
j=l i>j=1 


the internal energy / in the grand canonical ensemble 
can be expressed by the following simple expression in 
terms of the pair distribution function: 


0 4) 
£(8)= Bian +4 f ag’ f fare or r’) B 
0 0B 


Xn (rr’; Bg’). 


5P. C. Martin and J. Schwinger, Bull. Am. Phys. Soc. Ser. IT, 
3, 202 (1958). 
6 A. Klein and R. Prange, Phys. Rev. 112, 994 (1958). 
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Here Lideai is the kinetic energy of the ideal gas with 
the same statistics. This is shown in Appendix IV. 

The following units are taken throughout in the 
note: = 2m=1, where h is 2x times Planck’s constant 
and m is the mass of a particle. 


PROOF 


Let us consider a system of N identical particles with 
the Hamiltonian 


N 


N } 
Hy=>d p2+ Dd 24;=Hot+M, (1) 
i=l i<j 


where p; is the momentum of the 7th particle and 2,; 
the interaction potential between the ith and jth 
particle. The position space representation of the 
density matrix is usually defined by 

p(r’® 2%) =(0'% | e-PAN | p) 

=D V(r’ )e PPh, *(r%), (2) 


where the functions {¥,(r¥)} are properly symmetrized 
eigenfunctions of the Hamiltonian Hy, E; being the 
kth eigenvalue; the asterisk denotes complex conjugate 
and @ is the reciprocal temperature (k7)~', k being the 
Boltzmann constant. 

If we choose the density matrix operator written in 
terms of quantized field operators, its coordinate repre- 
sentation is easily shown’ to be 


p(r’’r¥) = (@y| V(r’ Je Binyt (fp) |@o), (3) 


where W(r*) is a normalized wave field (operator) 
defined by 


¥(r*)=(N TI v(x); (4) 


j=l 


|@o) stands for the vacuum state. The field y will be 
expanded in the usual way: 


v(r)=V-4 De’? *a(p), (5) 


where a(p) is the annihilation operator [a'(p) the 
creation operator] and V is the volume of normaliza- 
tion. The proper symmetrization (statistics) may be 
taken care of by imposing the commutation relations 
between creation and annihilation operators : 


[a(p),at(p’) } =5p.p’, 
La(p),a(p’) }+=[La'(p),a'(p’) > =0, (6) 


(upper sign for bosons, lower sign for fermions) 


where 65,, is the three-dimensional Kronecker’s delta- 
function. H is now the Hamiltonian in second quan- 


7A proof is given in Appendix I. 


FUJITA 


tization defined by 
H= - fermvwats f era 


XV (DY (r’)o({r—r' | )y(rY(r) (7) 
=) p’a'(p)a(p)+2 DL a'(pitq) 
P 


Pl, p2.q 
Xa'(po—q)u(q)a(ps)a(p:) (8) 
= Ho+H,, (9) 


where «(q) is the Fourier transform of the interaction 
potential 


u(q)=V-4 J dr e'**0(r). (10) 


The /-body distribution function in the canonical 
ensemble is defined by 


al 


Gries s@ ry p(rXrX), (11) 


ny (2!) =——__ — 


(V—2)!Zy 


where Zy is the partition function: 


aye f dna: --@ry p(r%,r). (12) 


In particular, the pair distribution function is 


N! 
nN (2) ( rT2) _ —om dr3: + ary p(r’r%). 


(N—2)!Zy 


(13) 


Since the following discussion is equally applicable 
to the general /-body distribution function with appro- 
priate change in the definition, we confine our dis- 
cussion to the pair distribution function hereafter. 

Substituting (3) into (13), we have 


N(N-1) 1 

ny) (42) =————— fern dP ry— 

ZN N! 
X (Pol|¥ (tw): ‘ W(1s){P(12)p(11) 

Xe FAY (ri )Yt (12) }W (43) ‘> -W' (ry) |@o). (14) 


Noting that [(N—2)!]-!y'(r3)- --¥* (rw) |&o) is just the 
properly symmetrized position ket-vector of (V—2) 
particles, i.e.,° 
L(N—2)!-"(15)-- -¥' (tw) | £0) 

= | r3° a rvy)= | rX-2) 


(15) 


we rewrite the expression (14) as 


ny) (ty8e) =Zy 


for Bry (rh | (12) (11) 


- Ke PAY (1 )y" (12) |r). 
§ See Appendix I for the proof. 


(16) 
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The integration above is nothing but the trace of the 
quantity (ro)y(r)eF4y" (r:)¥' (re) with the restriction 
that the number of particles is fixed equal to N—2. 
Hence we obtain for the pair distribution function in 
the canonical ensemble, 


ny® (rr2)=Zy Tr’ [v(re)W (rie F*yt (r yy" (12) J, (17) 
where the prime means the stated restriction. In the 
occupation number representation it will be written as 
)=Zyt Din’ (n| (12) (11) 

Ke FAY! (1 )p' (12) | n) 


NN ©) (rire 
(18) 
with the restriction 

Da Na=N—2. 


Here |) is the abbreviation for the occupation number 
state: 


(19) 


(20) 


N)= | MiNe-**Na-**), 


and mq denotes the number of particles occupying the 
momentum eigenstate a. 

Now the pair distribution function in the grand 
canonical ensemble is defined by 


t . ‘ 
i a = ny (1yr2)Zy2%, (21) 
Zq N=2 


-G * 


n) (rye) 

where Zg is the grand partition function defined by 
(22) 
and 2z is the fugacity. Substituting my from (18) into 


(21), we have 


1 L 
n®) (12) =— DY 2% O'(n|y(r2)o(1) 


7g N=2 
Ke PAY" (ry)! (2) | 2) 


W(ri)e~ BH+a Nyt (rr, yWr(r 


(23) 
2) | m) 

(24) 
o~ fiy [yi ro)p( re FA + aNyt ( ry" ( r2) |, 


1 
> (n\ p(12) 


where we have taken 
s=e*, (25) 


and NV is now an operator describing the total number, 


N= fa ¥' (r)y(r) 


=>, a'(p)a(p). 
The quantity 7¢ defined in (22) can also be expressed as 


Zo=Tr [e pH+aN ) 


(27) 


(28) 


Hence for the pair distribution function in the grand 
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canonical ensemble, we obtain a compact expression: 


Tr LY (re) (ile PU reNYt (rp ( (12) J 


n) (442) =———— 
(rit) Tr [e- pH+aN} 





This can be written in a slightly different manner. If 
we define S(8) such that 


e-BH — ¢-BH0S (QR). (30) 


it is well-known from operator algebra that S(8) can 
be expressed — by an infinite series: 


~f- fas dB, 


X PLH1(81)-: 


8 
= P| exn( = J u(3'as') | 
0 


where 


-H1(Bn) } 
(31) 


H,(B') = 8’ 40H e-8' Ho, (32) 


5(8) is an analog to the S-matrix in field theory, where 


the latter is defined by 


S= | exr( - if Hy( ‘nat ) | 


4) 


(33) 


with 


Hyt')=e'' oH ye #0, (34) 


P is Dyson’s chronological operator, which has the 
proy erty that operating on a product of time- (reciprocal 
temperature)-labeled operators, it rearranges them in 
the same order as the time sequence of their label, the 
latest one in time occurring first in the product: 


PC A1(t)+ ++ Hr (tn) J= Air (ti) Ar (tj): +-Hi(te) (35) 


4>1j>++ >t 


Substituting (30) into (29), we have 


Tr [¥(12)¥ (rie Phot aN S (BY (i) ( (73) ] 
Tr [e-bllotaN 5(8y] 


n®) (rr) = 

(36) 

where we have used the fact that the Hamiltonian H 

defined in (7) as well as Ho in (9) commute with the 

number operator JV. 

Defining the interaction picture field by a similarity 

transformation: 
¥(1,8)= OH oy (r)e-#M0 
=V-!>*, a(p) exp(ip- r—8p") 


(37) 
(38) 
and 


yp" (r,8)=e8" ol (re pHo 
= | 4+>°, a'(p) exp(—ip-r—Bp"), 


(37a) 
(38a) 
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the expression (36) is again rewritten as 


n®) (rr) =— 
If we now define a quantity by® 


K (2 (24X33 XX») =— 


where T is Wick’s chronological operator defined by 
T=5pP, (41) 


dp being the sign plus or minus according to whether 
the permutation of fermion factors is even or odd, and 
x;= (r;,8:), (42) 

then n®) (rire) will be written as 
n® (ryre)=e4*K © (ro8 118; 1,0 120). (43) 


It is apparent that the quantity defined in (40) has 
a close similarity to the two-body propagation kernel 
appearing in the quantum field theory, where it is 
defined by 
K ¢ (x4,%33 X1,%2) 
(By! Tel xaWel xs) Swe! (aye (x2) || Po 
7 (Do S| Bo) 


, (44) 


where ¥.(x)=y.(r,t) is the field in the interaction 
picture and S is the scattering matrix defined already 
in (33). In fact the only differences between the two 
propagators (40) and (44) are that (1) in the quantum 
statistical case the scattering takes place in the interval 
zero to 8, while in scattering theory it is supposed to 
occur in the infinitely long time interval (—*, +), 
and (2) the expectation value is taken over the free- 
particle grand canonical ensemble in the former instead 
of between the noninteracting vacuum state, and 
finally (3) it was replaced by 8 everywhere in the for- 
malism of the quantum statistical case. 

In order to see the properties of A? (44,735 ¥1,%2), it 
is suggested that .$(8) be expanded as in (31) aut that 
the Feynman diagrams be drawn corresponding to each 
term of its expansion. However, since the analysis of 
K,®@ will be found in the standard textbooks,” it will 
be omitted here except for mentioning that a compact 
readable analysis was made by Salpeter and Bethe" in 
the case of quantum field theory. 

An explicit analysis along this line is in progress in 
the hope of calculating the pair distribution function of 


® A similar quantity to this was defined and referred to as the 
correlation function without proof by Matsubara.! Martin and 
Schwinger,’ and Klein and Prange® also discussed similar quan 
tities using Heisenberg field operators. See Appendix II. 

1 See, e.g., Schweber, Bethe, and de Hoffmann, Mesons and 
Fields (Row Peterson Company, Evanston, 1955), Vol. 1, Chap. 
8, especially Chap. 25, Sec. 25f. 

il, E, Salpeter and H. A. Bethe, Phys. Rev. 84, 1232 (1951). 


Tr [eP# ete N42) (ro BW (11,8)S (B)Y"* (11,0) ¥" (12,0) J 
Tr [e bHotaN §(8) | 


Tr Le BHotaN Ty xa) rs) S(BW (iW -~ 
Tr [e-BHotaN $(g) ] | 
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: (39) 


(40) 





various quantum gases (electron gas, hard-sphere 
quantum gas, etc.). 

The application of (37) to ideal gases is briefly dis- 
cussed in Appendix III. 
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APPENDIX I. DENSITY MATRIX IN 
SECOND QUANTIZATION 


A simple proof” of the expression (3) is given by 
means of Dirac’s transformation theory. We wish to 
show that 


p(n’ ® e%) = (0% | eF4 | r) 
= (By! V(r!" eF4 T(r) | bo). (ATA) 


Since the form of the Hamiltonian in terms of creation 
and annihilation operators is well known [see expression 
(8) ], it will be sufficient to expand the properly sym- 
metrized position ket-vector |r’) in terms of the 
number ket-vector and show the relation: 


irY)=Wt(r¥) |p). (A.1.2) 


This is easily carried out in the following way. For 
definiteness, we consider first the case of bosons. Let us 
expand the symmetrized position ket-vector |r‘) in 
terms of eigen-ket-vectors |H ’) of the free-particle 
Hamiltonian Hp, i.e., 

|r¥)=>> | Ho')(Ho' |r). 


Ho’ 


(A.1.3) 


The eigenvalue Ho may be specified by the set {kj} of 
N single momentum eigenvalues. (H’|r%) is the sym- 
metric wave function usually found in the standard 
textbooks, where it is given by 


(Ho | 2%)=({k,;} |r) 
" 
=—N!V9 TT (ne!) FX Pr )IT 
a r 7=1 


Xexp(—ik;-1r;),  (A.1.4) 


2 An alternative, simple, and elementary proof was shown ro 
the author by A. J. F. Siegert. This proof proceeds directly from 
the equation of motion for the field operators in the Heisenberg 
picture to the Bloch equation for p. 
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where #q is the number of particles with momentum a 
in the set {k;} and is subject to the condition 


Data=N; (A.1.5) 


the permutation P({r;}) operates on the suffixes {7} 
of position coordinates {r;}. 

Substituting (A.I.4) into (A.I.3) and commuting the 
order of summations with respect to P({r;}) and {k;}, 
we have 


Jry)=d P(r) a CNV !V% J] (ma!) }-3 


N 
XI] exp(—ik;-r;) | {k;}). 
7=1 


If we note the equality 
-({r,})T] exp( —ik;- ro) {k;} ) 
] 


= P-({k,})]] exp(—ik;-r,)|{kj}),  (A.1.7) 
d 

where P-'({k;}) is the inverse operator of P but acts 

on the suffixes of {kj}, then (A.1.6) is again rewritten as 


)=> P7({k;}) > CNIVY I (2!) + 
P ky +» «ky a 


“ 


XI] exp(—ik;-r;)| {k;}) 


vt ,; 
- = AG |) Mert ik;-1,) | {k;}). 
V* TIa( 


(A.1.8) 
Projecting this into the number ket- ‘vector 


| {na}), we 
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obtain 
« rT} i 
ingle 
V¥ Tl a(ta!) 


a(Ma! 
TT exp —ik;- w(— — "y {na}) (A.1.9) 


7=1 


=z 


ki ---KN 


N 
V/2 T] exp(—ik;-1;) | {7a}). 
j=l 


If we now recall the well-known rule of construction 
of the number ket-vector: 


| {a} = (N)~ [T {at (a)}"*| 0) 
“ (A.1.10) 


N 
=(N!)- JJ at(kj) | ®o), 


j=1 


the ket-vector |r’) takes the form 


rv) on 


(k;) exp(—ik;-rj) | Po) 
(A.1.11) 


N!)- MILv A yujat 


. 
= (NV!) TT Wt (r,) |b0) =" (r¥) | 0). 


7]=1 


This completes the proof of (A.I.2). 
The similar argument holds for fermions, in which 
case the wave function (A.I.4) should be replaced by 


(Hy |r%)=(NIVY)ADY P({r;})bp 
P 
N 
<> exp(—ik;-1rj), (A.1.12) 
j=1 


and the final expression (A.I.11) ends up with a product 
of anticommuting fields. 


APPENDIX II 


In the text we have employed the interaction picture to show the analogy between the pair distribution function 
and two-body propagator. It is of course possible to use the Heisenberg picture for this end. In reality it is simpler 


to do so. 
Defining Heisenberg fields by 
vi (1,8) =e*y (re P4, 


the expression (29) can be written 


n) (rr) = 


Wi" (r,8)=e8 4 (r)eF4, 


Tr Le P4 te N42) 1 (roB) Wn (1i8)Wu" (11,0)Wn" (12,0) ] 
Tr [eB# +N) 


(A.II.1) 


(A.IT.2) 


’ 


which is equal to e?*K® (123,1,8; 1,0,r.0). More generally, it is apparent that the two-body propagator can also 


be written as 
K@ ”) ( X4X3 5 


XX) = 


which is again an analog to the known result: 


Tr Le PXtoN Thi (xsbu(xsWa’ (awa (x2) }] 


(A.1T.3) 


’ 


Tr [e BH+ all | 


K ,° (44X33 1X2) = (Py’ Tl Wirelxs Wael(x } Wu t' (X1 Wire! (v2) Py’ P 


where |’) is the vacuum of the interacting field. 
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Martin and Schwinger and Klein and Prange have used the same form (A.II.3). 


APPENDIX III. FREE-PARTICLE PAIR DISTRIBUTION FUNCTION 


As an example, we consider noninteracting quantum gases: 
H,=0; (A.ITI.1) 


hence 
5(6)=1. 


The pair distribution function is now given by 


Tr [e PA ot aN tay (98) (118)Y" (110)Y* (120) ] 
—_—_———., (A.IIT.2) 


n™ (1yFe) = amg 
Tr [e“BiteNn | 


Upon using the expansion (35 and 35a), this is reduced to 


n) (ryt) =V-2 DY (Le !P-P2) «(1-*2)) exp(— p38+a) exp(— ps’8+a) 
Pip 
Tr [e~*#ot eX {0 1+-n( pi) [1+ (pe) ]+6pi pol 1+ (ps) ]} J \.111.3) 
acai ‘ ——__—__—___— ——, (A. oc 


Tr [e -BH o+ aN] 


After careful calculation of the trace, we arrive at a simple expression: 


exp(—8p’+a) ) 
‘ inicio —e'P (rire) 


1 _ exp( —Bp’+a) - 1 .. 
n'*)(ryPo) ( ee ) ( > —— 
V p 1¥Fexp(—Sp?+a) V p 1¥Fexp(—Sp’+a) 


1 exp( —pBp-+a) ' 
“'—( Sy . e'P (rire ) z (A.IIT.4) 
J 


p 1Fexp(—6p’+a) 
where # is the number density. This is the well-known London-Placzek formula.” 


APPENDIX IV. INTERNAL ENERGY IN TERMS OF THE PAIR DISTRIBUTION FUNCTION 


Consider a system with the Hamiltonian of the form 


H=Ho+glr- — fervanvv(ntie f arty V'(n)y'(r')o(r— ry (ro (0), (A.IV.1) 


where g is the coupling strength. 
From the basic assumption of statistical mechanics, the internal energy / of the system is given by the grand 


canonical average of the Hamiltonian: 
E(g)=Tr [ He-P# +4" )/Tr [e868 +eN }. (A.IV.2) 


Differentiating with respect to the coupling strength g, we obtain 
a Tite? "|] hrigac*™) Trac) irae") 
= —8 8 a 
Og Tr [e BH+aN ] Tr [e BH+aN | {Tr{ ¢-P8+e" }}? 


Tr [ Hye ten | 
+8 : ey i 
Tr [e BH+aN] Tr [e BH+aN | 


(0/08) Tr [Hye #4 +2 | Tr [Hye-8# +2" ](0/08) Tr [eb4+2% | ; 
—B SSR PEESEP eo : (A.IV.3) 
tar [ePHt+aN }\2 
Here we have used the differentiation formula: 


re) 
eB Motolt1 | B Tr (f(D) MyeP Horo tan), (A.1V.4) 


B re sa 
Og 


8 F, London, J. Chem. Phys. 11, 203 (1943); G. Placzek, Proceedings of Second Berkeley Symposium on Mathematical Statistics 


and Probability, p. 581 (1950). 
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where f(H) is an arbitrary function of H. The formula may be proved in the following way. 


Bd de -Batan 
L=Tr| f()— gu 

i Qri Og s—(Ho+gH)) 
ae a 1 
f(D— g ON basa lit ect | 
L 2ri i> (Ho+gH1) s= (Ho+gHr) 


Z 1 1 
f( H)H,— gis e BstaN___ mt | 
: 2mi (s—H)? 
= —6 Tr[f(H)Hye-F# +4" 7, 





Now, rearrangement of the terms in (A.IV.3) gives 


OE Tr[Hre#t+eN)] = 9. Tr [Hye~F#+4N | 


0g : Tr [e~SH+ eN’) 0p Tr [e-bu+aN] ; 
a e [Hye-PH +N] ) 


98 \ Tr[e-u+aNy] 


Substituting H; from (A.IV.1) into the quantity appearing in the brackets, we have 
Tr [Hye satan} Tr [yt (rly (rw (r'Y (De pH+aN) 
=f Trleourer] 


: f Tr [Y(r')y(r)eP#+eNyt (r)yt(r’) ] 


a d'rd*r' u(r—r')— 
Tr [e-B# tan) 


I< of pm erserntmereneeom, (A.IV.6) 
Tr [e BH+aN | 


The last factor of the right side is just the pair distribution function n®)(r-r’) in the grand canonical ensemble, 
give in the text in Eq. (29). Hence substitution of (A.IV.6) into (A.IV.5) gives 

0k 0 

—= if array o(r—r’)—Bn (r,r’; Bg). 

Og 0g 


Integrating over the coupling strength g, we have 


g 0 
k(g)= Bo+3f ag’ f era? v(r—r’)—Bn®)(rr’; B,g’), 
0 0B 
where E(0) is the kinetic energy of the ideal gas which is given by 
‘ 2 ' 
E(0)=—— f 4rp'dp p’{(p), (A.IV.9) 


(2m)3 J, 


f(p) being the Bose or Fermi distribution function according to the statistics of the system. 
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This is the first of a series of papers dealing with many-particle 
systems from a unified, nonperturbative point of view. It contains 
derivations and discussions of various field-theoretical techniques 
which will be applied in subsequent papers. In a short introduction 
the general method of approach is summarized, and its relationship 
to other field-theoretic problems indicated. In the second section 
the macroscopic properties of the spectra of many-particle systems 
are described. Asymptotic evaluations are performed which 
characterize these macroscopic features in terms of intensive 
parameters, and the relationship of these parameters to thermo 
dynamics is discussed. The special characteristics of the ground 
state are shown to follow as a limiting case of the asymptotic 
evaluations. The third section is devoted to the time-dependent 
field correlation functions, or Green’s functions, which describe 
the microscopic behavior of a multiparticle system. These func 
tions are defined, and related to intensive macroscopic variables 
when the energy and number of particles are large. Spectral repre 
sentations and other properties of various one-particle Green’s 
functions are derived. In the fourth section the treatment of non- 
equilibrium processes is considered. As a particular example, the 
electromagnetic properties of a system are expressed in terms of 


I. INTRODUCTION 
N any physical system the properties actually subject 
to measurement are few in number. These attri- 

butes, energy, momentum, angular momentum, number 
of particles, and the like, are distinguished by the 
feature that each is related to an invariance property 
under a simple group of transformations. In the case 
of energy-momentum it is the group of space-time 
translations, while for angular momentum, it is the 
rotation group. Associated with each of these groups 
there is a continuous unitary representation which is 
characterized by a set of Hermitian generators. In 
quantum field theory, these Hermitian operators are 
constructed as space integrals of products of field 
operators, ¥(r/), y'(ri). At a given time, the field 
operators have simple commutation properties charac- 
teristic of the particle statistics. 

A many-particle system, in the context of quantum 
field theory, is one for which the eigenvalue of the 
number operator is large. In such a system, physically 
recognizable changes in energy are so huge compared 
to the energy intervals between neighboring states that 
the energy levels may be assumed to vary continuously. 
When a system is this large, the quantities of interest 
naturally fall into two categories. The first concerns the 
behavior of extensive quantities such as energy and 
number for which only macroscopic changes are meas- 
urable; the second refers to microscopic features in- 


* A preliminary resume of this work was reported in Bull. Am. 
Phys. Soc. 3, 202 (1958). 

t We wish to acknowledge the hospitality of the University of 
Wisconsin where part of the paper was written. 


the special two-particle Green’s function which describes current 
correlation. The discussion yields specifically a fluctuation-dis- 
sipation theorem, a sum rule for conductivity, and certain disper- 
sion relations. The fifth section deals with the differential equa- 
tions which determine the Green's functions. The boundary 
conditions that characterize the Green's function equations are 
exhibited without reference to adiabatic decoupling. A method 
for solving the equations approximately, by treating the corre- 
lations among successively larger numbers of particles, is con- 
sidered. The first approximation in this sequence is shown to 
yield a generalized Hartree-like equation. A related, but rigorous, 
identity for the single-particle Green’s function is then derived. 
A second approximation, which takes certain two-particle corre- 
lations into account, is shown to produce various additional 
effects: The interaction between particles is altered in a manner 
characterized by the intensive macroscopic parameters, and the 
modification and spread of the energy-momentum relation come 
into play. In the final section compact formal expressions for the 
Green’s functions and other physical quantities are derived. 
Alternative equations and systematic approximations for the 
Green’s functions are obtained. 


volving changes in energy and number that are neg- 
ligible on the macroscopic scale. 

The purpose of this paper is to develop general 
methods for treating multiparticle systems from the 
quantum field-theoretical viewpoint.’ In this discussion 
it will prove useful to employ and extend the mathe- 
matical techniques which were devised for application 
to relativistic quantum field theory. These same tech- 


!The many-body problem has been studied with the aid of 
perturbation theory by many authors. This paper will not draw 
on any results of these works but has various points of contact 
with them. We mention the work of T. Matsubara, Progr. 
Theoret. Phys. (Kyoto) 14, 351 (1955); K. M. Watson, Phys. Rev. 
103, 489 (1956); W. Riesenfeld and K. M. Watson, Phys. Rev. 
104, 492 (1956), 108, 518 (1957), and other articles; K. Brueckner, 
Phys. Rev. 100, 36 (1955), and other articles; K. Brueckner and 
S. L. Gammel, Phys. Rev. 109, 1038 (1958); J. Goldstone, Proc. 
Roy. Soc. (London) A239, 267 (1957); L. Van Hove, Physica 22, 
343 (1956), and other articles; N. M. Hugenholz, Physica 23, 481 
(1957), and other articles; Huang, Lee, and Yang, Lecture at 
Stevens Institute Conference on Many-Body Problems, 1957 (to 
be published); Lee, Huang, and Yang, Phys. Rev. 106, 1135 
(1957); T. D. Lee and C. N. Yang, Phys. Rev. 112, 1419 (1958), 
and 113, 1406 (1959); Schafroth, Butler, and Blatt, Helv. Phys. 
Acta 30, 93 (1957); E. Montroll and J. Ward, Phys. Fluids 1, 55 
(1958); J. Hubbard, Proc. Roy. Soc. (London) A240, 539 (1957), 
and other articles; J. Lindhardt, Kgl. Danske Videnskab. Selskab, 
Mat.-fys. Medd. 28, No. 8 (1954); P. Noziéres and D. Pines, 
Nuovo cimento 9, 470 (1958); A. Klein and R. Prange, Phys. Rev. 
112, 994, 1008 (1958); C. DeDominicis and C. Bloch, Nuclear 
Phys. 7, 459 (1958), and other articles; R. H. Kraichnan, Phys. 
Rev. 112, 1054 (1958); S. T. Beliaev, J. Exptl. Theoret. Phys. 
(U.S.S.R.) 34, 417 (1958) [translation: Soviet Phys. JETP 7, 289 
(1958) ]. A spirit similar to ours occurs in the work of V. M. 
Galitskii and A. B. Migdal, J. Exptl. Theoret. Phys. (U.S.S.R.) 
34, 139 (1958) [translation: Soviet Phys. JETP 7, 96 (1958) ]; 
the note of L. Landau, J. Exptl. Theoret. Phys. (U.S.S.R.) 34, 
262 (1958) [translation: Soviet Phys. JETP 7, 182 (1958) ]; and 
in E. S. Fradkin, J. Exptl. Theoret. Phys. (U.S.S.R.) 36, 951, 1286 
(1959). 
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niques, it will appear, achieve in a natural manner the 
above-mentioned separation of the macroscopic and 
microscopic domains. In the former domain they lend 
themselves to the discussion of the spectral densities of 
the extensive constants of motion; in the latter domain 
they provide a characterization of the local properties 
of the system. It is only the second class of problem 
which is usually considered in relativistic quantum field 
theory where the vacuum appears as the natural 
reference state. 

The two aspects of the many-particle problem are 
intimately related, of course. Indeed, the most feasible 
method for determining the macroscopic properties of 
the system is by integrating the differential equations 
which relate them to the microscopic behavior. The 
quantities that fully describe the local behavior, and 
which thereby serve to characterize both the macro- 
scopic and microscopic aspects of the situation, are 
time-dependent field correlation functions, or, in the 
language of field theory, Green’s functions. With the 
aid of these functions it is possible to present in a unified 
manner all aspects of large systems, both static and 
kinetic. The information thus provided is purely 
dynamical, but it occurs in a form which is immediately 
applicable to the statistical mechanical treatment of 
equilibrium phenomena. 


II. MACROSCOPIC PROPERTIES. DENSITY OF STATES 


The spectral measure of the commuting Hermitian 
operators which describe the constants of motion is 
most conveniently found in terms of the unitary 
operator representation that originally determined these 
Hermitian operators. More specifically, Fourier trans- 
formation of the traces of the group of unitary trans- 
formations yields the desired measure. For example, if 
a system has a Hamiltonian, 1, and a number operator, 
V, the formal function 


(2.1) 


generates the spectral density of the commuting quan- 
tities, energy and number. The expression (2.1) may 
be written in terms of Py, the projection operator for 
states of given energy and number, or, more explicitly, 
as 


eV =>” e~N-iEt Ty Py p => eiM-iEt Tryp 1, (2.2) 
NE NE 


with the summation extending over energy eigenvalues, 
#, and number eigenvalues,? .V. In terms of a formal 
spectral density, p(.VE), Eq. (2.2) becomes 


(2.3) 


eV (id. ir) =>" ¢€ fac e—*¥t9(NE). 
N 


* The use of the same symbol for the number operator and its 
eigenvalue causes no difficulty in context. 
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From it the spectral density is determined by Fourier 
transformation, ; 


™dry f* dr 
pve)= f J eftiiet wie, (24) 
2a x 2a 


However, the summation (2.1) of an infinite number 
of increasingly rapidly varying exponentials is only 
formal and requires a summability procedure to give it 
meaning. The most useful way of giving a rigorous 
significance to expressions like (2.4) is based upon the 
interpretation of (2.1) as the boundary value of an 
analytic function in the lower half \ and 7 complex 
planes. Since the spectra of energy and number are 
bounded from below, and if, as is presumably true in 
all physical systems, the density of states grows at most 
algebraically with increasing energy and number, the 
exponential decrease introduced by arbitrarily small 
negative imaginary parts of 7 and X is sufficient to 
guarantee absolute convergence. If, furthermore, the 
energy per particle increases without bound for sufh- 
ciently large particle densities, as it does in a fermion 
system with no attractive forces, or in any system which 
cannot be indefinitely compressed, the exponential 
factor in the energy produced by a negative imaginary 
part of + assures absolute convergence whatever the 
imaginary part of A. 

For a large class of systems it is possible to evaluate 
(2.4) asymptotically by taking advantage of the ana- 
lytic behavior outlined above. This is most clearly 
accomplished by first defining the result of the \ inte- 
gration, which projects out the trace with a given 
number of particles, 


Hr=' 


expW y(tr)= EF Pye = Iry é iHr 


(2.5) 


and then evaluating asymptotically the 7 integral, 


(2.6) 


® dr 
p(.VE) -f explLiEr+W y (iz) J. 
) 


~ on 


As a function of 7, expWy is analytic throughout the 
lower half-plane. Furthermore, along the negative 
imaginary 7 axis, the second derivative of Wy with 
respect to iz is real and positive, since it has the form 
of the dispersion of the energy with respect to a positive 
weight function. Consequently —@Wy/dir, which 
approaches infinity as 7 — 0, decreases monotonically as 
7—» —ix to the limit given by 2o(.V), the minimum 
energy of the system for .V particles. It follows that on 
the negative imaginary axis there is one and only one 
solution, iry>=8>90, to the equation 


E=—dWy/(8)/08, E> E,(N). (2.7) 


The expression (2.6) may then be integrated by the 
method of steepest descents, by deforming the contour 
of integration into the lower half plane so that it passes 
through 8 in the direction, parallel to the real axis, 
along which the integrand decreases most rapidly from 
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its maximum. The contribution to the integral from the 
neighborhood of this saddle point is evaluated in terms 


of the positive-definite second derivative 


LW x/ dB = — (0E/dB)y 20, (2.8) 


with the result 


p(NE)=exp[BE+W y(8) ]L27PW y(B)/d8}'. (2.9) 
A more detailed justification of the asymptotic domi- 
nance of this neighborhood will not be given here. 

The function, Wx(8), is now written in terms of 
W (id,8), by making explicit the steps which led to (2.6) 
from (2.4) 


dd 
expL W (8) | [ eiNAt+W (id, B). 


J_,24 


(2.10) 


The integral (2.10) can also be evaluated asymptoti- 
cally. Under the assumption of a maximum particle 
density, exp[W(iA,8)] is analytic throughout the A 
plane. An argument similar to that employed for 
OW n/dir shows that as A decreases along the imaginary 
(8), the 
to the 


axis —OW/did decreases monotonicaly from N 
maximum value consistent with the given 8, 
value zero. Consequently there exists a unique point, 
\o= —la axis, which satisfies 


, lying on the imaginary 


V=-oWw N(g)>N>0. (2.11) 


Ja, 


By deforming the contour to pass through this unique 
point of steepest descent along the imaginary A axis, an 


| - (2:02) 


. (2.13) 


asymptotic value for (2.10) is obtained, 


CW 
N+W(a,8 dr 
0a? 


when inserted into (2.9) yields 


E+W “| (2) 


For the systems of physical interest, the exponential 
factors in (2.12) and (2.13) are the sig- 
2.12 


nificant parts and consequently (2.12) may be written 


W »(B) 


expLW y(g) ]=e* 


which, 


CW Wy 
p(.VE)=e 
0p? 


0a? 


numerically 


-aNV+W (a,p). (2.14) 


This asymptotic approximation is consistent with the 
indicated dependence of the functions since 


ow 
=ad N +-—(a,8)dp, 


0B 


dW w(B)- (2.15) 


in virtue of (2.11). Equation (2.7) can therefore be 


replaced by 


k= — dW (a,8)/ 0p. 2.16) 
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Furthermore, the second derivative satisfies 
_# -W 


OW /da 
yee, 
~ 9p? dadB OBT 
—@ WwW -(— ) (- W ) 
ag? 0adB 
so that (2.13) may also be written in the form 
cal 
dadB 


Again, in physical systems the numerically significant 
parts of this expression are the exponentials, so that 


Inp( VE)=aN+BE-+ W (a,8), (2.19) 


FW w 


ag a 


p( NE) = e®Nt88+W (a8) 


(2.18) 


1 a ew 
a? 


0a? 


which is consistent with the differential relation 


d |Inp(VE)=adN+ dF. (2.20) 


The result (2.18) has been obtained in two stages. With 
the assurance that the dominant contribution arises 
from the vicinity of the one point a, 8, the two-dimen- 
sional integration can be performed directly. Using this 
procedure, we are led to the quadratic form 
Ou 
- (6a) (68) +— 
op? 


; Ww 
(6a)?+2 
daaB 


ew 
ew = (68). (2.21) 


0a? 


This expression is shown to be positive definite by ex- 
hibiting it as a dispersion 


OW =((Nba+ H6B)*)**? — (( Néa+ H6B)**)?, 


with respect to the averaging process 


X)*8= (Tr e284 X) 


Tr e~aN-BH 
= Tr e-@’—8H-WlaB)V (2.23) 


The positive definiteness is fully characterized by the 
three inequalities 


ow ow 


da? 20), 


0p? 20, 
OW 


(— ) (—) 
dad8 O08 Jy Oa? 


OV nH 
-(- ) ——— 20. (2.24) 
da J » Of? 


By deforming the contours to pass through the unique 
point at which 


N=—ow, 


Ww vw 


0g? 


Oa 


da=(N)*%, (2.25) 


E=—dW/dB=(H)*, 


and integrating the quadratic form in the neighborhood 
of this point, we obtain the expressions (2.18) and (2.20) 
directly. 
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It is necessary at this point to describe an intensive 
property characteristic of any spatially distributed 
system—the pressure. This quantity is occasionally 
introduced by referring to an external parameter, the 
container volume, which is regarded as occurring in the 
Hamiltonian. The negative rate of change of the energy 
with respect to this volume is then used to define the 
pressure. It is clearly preferable, however, to introduce 
the notion of pressure in the more fundamental way 
which relates it to the local transport of momentum. 
Needless to say, the container walls are the agency 
ultimately responsible for producing the local stresses 
in the absence of other external forces. 

We shall restrict ourselves to systems containing one 
type of particle described by a field, y, for which the 
momentum density operator is (#=1) 

G (rt) = (1/27) [Yt (9) Vy (1d) — (ot (rf) (r,t) ]. (2.26) 


We shall also suppose, for definiteness, that the Hamil- 
tonian has the form 


1 
H=— fa Vv (rl)- Vy (rl) 


2m 
+4 f dear’ V(r yee F WODH(H), (2:27 


which is characteristic of instantaneous two-particle 
interactions. From the equation of motion 


0 1 
i—p (rf) = ——V*p (rl) 
al 2m 


+ fav’ yier'dv(e r’)W(r')W(rt), (2.28) 


and the adjoint equation, we then derive 


0G 

—=-—- 97 -T 

al 

- fas’ y\(edyr(ep vole eM FOV), (2.29) 


where the first term is the negative divergence of the 
stress tensor for a noninteracting system, 


T;; = (i 2m) LV atv t+ ViViVe— 2ViV (WY) ], 
(2.30) 


while the second term of (2.29), because of the nonlocal 
nature of the interaction, is not a divergence. Never- 
theless, for short-range forces, the second term may be 
replaced effectively by a divergence since the contri- 
bution to its integral over any volume comes entirely 
from a region in the neighborhood of the boundary of 
that volume whose dimension is comparable in size with 
the range of forces. This follows from the fact that the 
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double integral over identical regions for r and fr’ 
vanishes since the integrand is antisymmetrical, a 
property which reflects the equality of action and reac- 
tion. The effective interaction stress tensor, T, is 
constructed by integrating the term identified with its 
divergence 


v-TO= fade W(nVi(e) vor KW DH(H), (2.31) 


over a region, V, large compared to the range of forces. 
In terms of n, a unit vector normal to the surface of V, 
we may write 


fasin-t0(n)= far felons 


ri2 0 

x—- alrdvr wed} (2.32) 
ri2 Ori. 

where ry2="1—Fe and ry2= |r|. The contributions to 

(2.32) come entirely from a region in which ry is inside 

V, re is outside V, and ry is smaller than the effective 

range of forces. 

We now introduce a hypothesis of local uniformity, 
asserting that within a physically small region the sig- 
nificant expectation values of field operator products 
ire dependent only upon relative position vectors. The 
integral (2.32) may then be written in the form 


fas n-T(r,)= fer fare ds 


where, for fixed rj2, the domain accessible to re is a 
shell of thickness —n-rj2 so that drz > —n-fPyodS». As 
a result of the restriction that rz be outside and r, 
inside V, only half the ryz space is covered, and therefore 


TO (nr)=- 1 f draniens Yt (r+3rie)yt(r—3ri2) 


1 oa 
x 0(rio)W(t— dri)W(r+ ri). 


rio Orie 


(2.33) 


The pressure, p(r), is identified as the average diagonal 
element of the total stress tensor, } >>; T;,, and thus 
the appropriate operator becomes 


1 
p(n) | wi wy—3 fdr: Wi(rtory)W!(r—fryp) 


m 


Ov(ri2) : 
Xrie—— V(t 20h (r+ dri) |. (2.34) 
Ory» 
The assertion of local uniformity has again been invoked 
to justify the omission of the term containing ¥?(y'y). 
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We can now make contact with the viewpoint that 
relates the pressure to the total volume occupied by 
the system. For this purpose we consider the trace 
definition of the function W(a,8,V), and perform the 
coordinate scale transformation 


r=6r', (2.35) 
together with the operator canonical transformation, 
¥(r)=0-'(r’). (2.36) 
In view of the invariance of the trace under a canonical 
transformation we find that with the new scale of 
coordinates 


explW (a,8,6V) | 
1 
=Tr exp| a \V—p6 | —¥y'- vy 
2m 
-38 fwoionya] (2.37) 
Differentiation with respect to @ at the point 6=1 now 


yields 
ow - 
V =0f far pi)) =PpV, 
OV 


and the numerical pressure so defined is given by 


(2.38) 


p=(1/8)(aW/aV). (2.39) 


Furthermore, for the system being considered, which 
involves no external forces, the operator p(r) describes 
a local property independent of the total volume, so 
that the pressure is an intensive variable, p(a,8). Hence 
W (a,8,V) must have the form 


W =VBp(a,8), 


and likewise the energy and particle densities are given 
by the intensive expressions 


(2.40) 


N 0 ¢ 0 
_ (Bp), =— 
V Oa V 0g 


(8p). (2.41) 


For the purpose of introducing a conventional nota- 
tion we note that the relation 


dW = — Nda— EdB+BpdV (2.42) 
may be written as 
d(W+aN+8E)=adN+pdE+B8pdV, 
or as the following extension of Eq. (2.20): 
(1/8)d Inp= (a/8)dN+dE-+ pdV. 


We also note that (2.40) and (2.19) lead directly to 


(1/8) Inp= (a/8)N+E+ pV. (2.44) 


(2.43) 
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It is clear that thermodynamic systems may be dis- 
cussed by interpreting the symbol 1/8 as the absolute 
temperature (multiplied by k), Inp as the entropy, S 
(divided by k), and —a/@ as the chemical potential, yu. 
We shall for convenience use this thermodynamic 
language without, in any way, confining ourselves to 
domain of equilibrium processes to which thermo- 
dynamics applies. The inequality (2.24) may be con- 
veniently reexpressed in terms of the thermodynamic 
variable u. In particular since the chemical potential is 
given by 

u=(0E/ON)sv, 


its derivative satisfies the relation 


OE Ou 1/0a a /0B 
(2), 30.50) 

oN? SV ON SV B ON SV 3? OV SV 
(2.46) 
(2.47) 


B(dN)? 


1 
:--——#°W >0, 
B (dN)? 


[ (da) (dN) + (dp) (dE) | 


(2.48) 


which may also be written in the form 


0 
( (2-44)) <0. 
OV SV 


(2.49) 


In a homogeneous system, for which the energy density 
is an intensive variable and p= — (0E/0V) sy, the rela- 
tion (2.48) may be written in the form 


6 & & 27 OE 
OV7 sy OV?7 sn ) ca. 


It also follows from (2.24) that 
( Ou/ON) gy 2 0, 


which in a homogeneous system reduces to 


Op V\?7 Op 
OV an V ONT 3) 


One basic problem to which the techniques developed 
in the remainder of this paper shall be applied in the 
determination of the ground-state properties of an 
interacting system. Such a state does not strictly fall 
within the realm of the preceding asymptotic evalua- 
tions. Yet the properties of this particular state are 
obtained by taking the zero-temperature limit of the 
expressions derived above. In particular we note first 
that in the limit as 8 — ~, Inp, which satisfies 


0 Inp OF 
(12), A) 
op VI OB vv 
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approaches its minimum value, zero, and / approaches 
its minimum, Eo(N), both of which correctly charac- 
terize the ground state. In this limit Eq. (2.44) reduces 
to 

uN=pV+E,(N). 


The conditions (2.49) and (2.51), which are equivalent 
for the ground state, may be applied to (2.54) where 
they require that 


ap 17a 
=) ---(- (Es(.)-14)) 20. (2.55) 
av/y Van y 


Furthermore since p=0 when V=0 and ? is continuous, 

p=—(1/V)(Eo(V)—uN) 20. (2.56) 

A particularly simple situation occurs when the 

ground state of the system is spatially localized by its 

own forces and requires no container. For such a state 
the pressure satisfies 


(2.54) 


p= — (dE)/dV)v=9, (2.57) 


so that the asymptotic equation (2.54) reduces to 
Eo =yV. (2.58) 


Turning from the asymptotic evaluation to the 
direct discussion of the ground state we note that these 
same results are obtained. In this case the essential 


point is that, in a large system interacting by short- 
range forces, the energy per particle in the ground 
state is a function of the density only. From the 
stationary property of the ground-state energy with 
respect to the total occupied volume, which requires 
that 


(dk dV) v=0, (2.59) 


it then follows, in agreement with (2.58), that 


V 0Fy Oy Eo 

—— — ) =_— — -+ ==(), 

NNXOV 7 wn ON/Jy WN 
Similar considerations may be applied to unbound 


systems by including the work done by the container 
walls. 


(2.60) 


III. MICROSCOPIC PROPERTIES. 
THE GREEN’S FUNCTIONS 


The microscopic properties of a system which are of 
physical interest are those which are essentially identical 
in a large fraction of the states with prescribed values 
for the extensive constants of the motion. For this 
reason, it must be possible to determine the properties 
by averaging over all states with the prescribed con- 
stants. In the present section we shall be concerned 
with a class of average values in terms of which all such 
quantities may be expressed. It is the set of average 
disturbances induced by the introduction of external 
particle sources localized in space and time. 
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In the previous section it was demonstrated that the 
density of states as a function of energy and number 
could always be determined from the function W (id,ir). 
Indeed, in a many-particle system, the macroscopic 
properties at each level of energy and number were 
shown to depend upon W(a,8) for unique real a and 8 
determined by EF and N. As a result the system was 
characterized by intensive parameters denoted as tem- 
perature and chemical potential, from which, con- 
versely, the energy and number might be found. 

In the determination of the class of averages which 
describes the microscopic properties, procedures anal- 
ogous to those of the previous section will be followed. 
First a rigorous method for obtaining the averages, 
independent of asymptotic considerations, will be indi- 
cated. Then an asymptotic evaluation of these quan- 
tities will be carried out. In this asymptotic evaluation 
we shall find that the relevant macroscopic variables 
are the intensive quantities, a and 8. 

The class of Green’s functions associated with pre- 
scribed macroscopic properties V and £ are the set of 
average diagonal matrix elements 
GaN FE (tities Sata; Bilas ++ Ente’) 

= (—i)"— NE| (nits): + -W(ral,) 
Xi (rn’tn’) Wn’h’)),|NE], (3.1) 


where 


[NE|X|NE)=Tr(PweX)/Tr Pye 


== TevsX/Trvst. (32) 


The structure of these Green’s functions is related to 
the dynamical description in terms of external dis- 
turbances; consequently the operators y and wy! occur 
from right to left in the order of increasing time coor- 
dinates. [This ordering is denoted by the (_), bracket. ] 
The factor € in (3.1) is identically +1 for boson fields 
while for fermions it is the antisymmetrical function of 
the time coordinates, which equals +1 when the 
operators occur in the order exhibited in (3.1). The 
subscript ” refers to the number of field operators y or 
¥' which occur in the function. Since the field operators 
create and destroy particles, the number ” represents 
the maximum difference in the number of particles 
between various intermediate states and the original 
state with NV particles. We shall refer to (3.1) as the 
n-particle Green’s function although, as just indicated, 
the name actually refers to the number of particles 
superimposed on the background. Finally, the coor- 
dinates r should be understood to contain implicitly 
the internal coordinates, like spins, which may also be 
present. 

As with the density of states, it is convenient to deter- 
mine the function (3.1) by introducing a generating 
Green’s function characterized by 7 and A, which in a 
notation generalized from (2.23), 


(Y h.ir =[Trie 


iNA iar X) | Tre iN iH 


(3.3) 
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has the form 


G,™ T= (—i)"e((W (ryt) - " W(Pnln) 

Xo (tn'tn’) + -Wt(ri'h’))4)% (3.4) 
These functions, G”’, are defined as quotients of 
traces so that the dominant rapid variation, asso- 
ciated with the exponentials of the energy and number 
of the entire system, is removed. From them, the 
functions (3.1) are determined by the relation 


dx dr 
pivEnGr®= ff — f — etNAtiBrtW(O,IGa.ir (3,5) 
2r nr 


While W and the Green’s functions have been inde- 
pendently defined it should be noted that the deriva- 
tives of W are simply related to the Green’s functions. 
The A derivative of W satisfies 
aw | 

; (3.6) 


=(N)* it 


0(id) 


+i farG (rt; rt*), (3.7) 


where / is any time, and ¢* indicates the ordering of the 
field operators at equal times. The alternative signs, 
+ and —, refer, respectively, to bosons (B.E.) and 
fermions (F.D.). Similarly, the + derivative of W 
satisfies 
ow 
———=(H)"*, 
d(ir) 


(3.8) 


so that when the Hamiltonian has the form (2.27), we 
may write 


9 


ow : Vv" 
_ =+i fdr lim ——G,*>**(rt; r’t+) 
O(ir) rr = 2m 


=3 fara. v(r1;—12)G» . 'T(rilrol; ryftrolt). (3.9) 


With the aid of the field equation (2.28), 


0 = 
(: + w r/) 
Ol 2m 


- far v(r’— rt (ry (r')y (rt) =0, (3.10) 


the relation (3.9) may be written in an alternative form 
involving only G, namely 


ow 
—- =i far lim 
Oir late Sl 


9 


"i fa 
, ——— Guanes), 
“3 ol 2m 


(3.11) 
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Considerable information about the structure of the 
functions G"'" is obtained from their definition as 
traces and from the commutation relations for the field 
operators. For example, the one-particle function G,”:” 
for ‘>?’ is equal to the function 
Gy” (rt; £'t’) =e Va.) 


X Trlew (1 /i(rdyt(r't’)], (3.12) 


and for /<?’, it agrees with 
Ge th, it (yf: r’t’)=+te-W (a. #) 


x Trem (1/ayi(r'’)p(rl)]. (3.13) 


From the cyclic properties of the trace, it follows that 
(3.13) is equivalent to 
ic. id it( rf: n't!) =Ae-W (a, ir) 


X Trl (1/i)p (rept (r’t’) (3.14) 


The evolution of the field operators in time is governed 
by the relation 


W (rl) = C1! O-Dy (F1y)e~ tH (ata) (3.15) 


and the annihilation properties of the field operators 
imply that 


¥f(N)=f(N+1)y. (3.16) 


In virtue of these relations, Eq. (3.14) may be rewritten 
as 
Ge® (et; PP) =Ae-™ Tre Me 8-1/5) 
Xv(r, i+ r)yt(r’t’) | 
=+e—°Gs(ri+7; r’/). 


(3.17) 
(3.18) 


If Fourier transforms of these functions which depend 
only on (—?’ are introduced, 


© dy 
GC a iA, if — +i aad D> a 
» 2m 


then it follows from (3.18) that 


it(rr'w), (3.19) 


gc (rr’w) = te 8 Tg,:( rr’). (3.20) 


Further information may be obtained from the com- 
mutation relations, which require that 
i[G(rt; r’t-)—G(rt; r’t+) ]=6(r—r’). (3.21) 
Since the values of these Green’s functions are given by 
dw 


—g>(rr'w) 
wi 


dw 
Ge(ri; r't) -f gc(rr'w), 
2ri 


Get: T)= 
and 


(3.22) 
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the functions g, and gz satisfy 


* dw ® dy 
f —L[g>-g<]= —[1Fe-- }g,. =5(r—r’). 
—@® 2x —@ 2r 
(3.23) 


These formulas may be written in a more symmetrical 
form by introducing the function A according to 


A® it (rr'w) 
g>™ *(rr’w) = ——___——__-; 
1-e-?- rt 

(3.24) 
: A®.it(rr'w) 
<9 “(s¢'a) = -—————— 
§ 1-eAtiver 


This function A satisfies 


* dw 
f —A(rr’w) =5(r—1’), (3.25) 


eee 
and in terms of it, the Green’s function may be written 


dw A i‘, t(rr'w) 
—e~ iw(t—t’ 


1 
G® (rt; rt’) = i> 


i 2n 
1 du 
= : f = g 
i! 27 


To obtain a single expression for the Green’s function, 
we write 


1 Fe- id- ior 
(3.26) 
A®(rr'w) 


iw (t—t") a 


pi t<l’. 
e' iwr 


(3.27) 


G(rt; r't')=n,(t-U)Gs (tt; r'l’) 
+y_(t-l')Ge(rt; r't'), (3.28) 
where 
n(t—t)=1, t>t 
=Q, t<?: 
n_(t—t')=0, > 
= ae oe fs 


and employ the integral representation 


x dw e7 iw(t-t’) 
n(i—t)=4if ——_, 
_» 27 wtle 


(e> 0+) (3.30) 


together with the evaluation 
1 1 
——=P-F¥7i6(w). 
wtie w 
The result is 


* dw ~ doy’ 1 
G(tt; r= f ete f —|[p— 
—o LF _» 2rl w-—o’ 


t-te er 
— 15 (w—w’)—— 
1-e-?- 


|: a. telne'ea’). (3.32) 
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So far in this section no asymptotic considerations 
have been employed. With the aid of such considera- 
tions we shall now show that G’” reduces asymptoti- 
cally to G*:*" with 7d and ir replaced by the values of 
a and @ appropriate to the given V and E£. As was 
remarked after (3.4), the function G:*” is the quotient 
of two terms possessing the same rapid variation, but 
in which the numerator also contains functions that 
vary only to the extent of exponential factors involving 
the energy and number of excitations. These factors 
are of negligible importance compared to those involv- 
ing E and N and so the asymptotic evaluation of (3.5) 
is essentially identical with that of (2.4). The only 
difference lies in the fact that at the saddle point a, 8, 
the integrand is now multiplied by the slowly varying 
function evaluated at that point, 


G%=G.1, iA=a, ir=f. (3.33) 


Equation (3.5) therefore yields the asymptotic identity 
GNESG*8, (3.34) 
We note here the forms assumed by the derivatives 


of W(a,8), as derived from the previously obtained 
relations (3.7), (3.9), and (3.11): 


ow 
— —=(N)et= V4: fdr G,°" (rt; rift), 
da 


(3.35) 


ow Vv? 
-—=(H)t= b= +i far lim [cee r)| 


wails 2m 


op 


_ jf arar o(r—r’)G2*(rir'l; rttr't*), (3.36) 


=+1 4dr lim 
v—tt,r’ 


It is instructive to analyze the function G, rigor- 
ously and to demonstrate how it reduces asymptotically 
to the Green’s function G,**. We begin with the 
definition (3.1) which relates the Green’s function 
G,¥¥ (rt; r’t’) for >?’ to the function 


) 


1 0 VY? 
(: -— Jee r/; | 
L2N OL 2m 
(3.37) 


NE 


GSNF (rt; rt’) = (A/T NE| Pp (rb (r't’) | NE]. 


(3.38) 


As a matrix in the coordinate indices rt and r’t’, the 
function iG” is Hermitian, 


iG>N* (rt; r't’)=[1Go"*(r'l; vl) }*, (3.39) 


and positive definite. We shall use the notation 


IGN (rt; rt’) 20 (3.40) 
to indicate this matrix property. No confusion will 
result from this notation since, when it does refer to 


numbers, these will be the elements of a diagonal 
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matrix. Equation (3.38) may be written in the form 
iG>** (rt; r'l’) 


= ¥ [NE|W(1l) Py ept(r'’)| NE]. 


N’‘E’ 


(3.41) 


If we take into account the creation and annihilation 
properties of the field operators and the operator de- 
scription of their time dependence, we may reduce 
(3.41) to 


iGS** (rt; ’’)=[Trvel}? ¥ Trl Pyey(n) 
E’ 


X Pua eWi(r’) Je 2'-2-#),- (3,42) 


In this equation, the arbitrary common time of the 
field operators has been omitted. 
The Fourier transform of this expression is defined as 


* dw 
Gy" ¥(rt; r= f 4 
» 21 
where 
g>%¥ (rr'w) = 2ap(N+1, E+w) 


X Trl Py eW(r) Pw 1, F+W" (r’) | 4 
(Tryel)(Trn41, e401). 


iw(t—t) 9 NE(pr'w), (3.43) 


(3.44) 


The transform, g5‘", has properties as a matrix in r 
and r’ which correspond to the conditions (3.39) and 
(3.40), 

g>%¥ (rr’'w) = g>%*¥ (r'tw)* 20. (3.45) 
Furthermore, its form indicates the special properties 
which apply to the ground state of the system with V 
particles, for which E= E)(N). The energy E’= Ey(N) 
+w must be a possible energy for the system with 
N-+1 particles, so that 


w= E(N+1)—E,(N) 2 Eo(N+1)—Eo(N) 


~“dIE\(N)/ON (3.46) 


and only frequencies greater than the chemical poten- 
tial, w= 0Eo(N)/AN, appear. 

Similarly, the Green’s function for ¢<¢’ satisfies the 
relations 


(3.47) 
(3.48) 


IGN® (rt; f)=[iG*(r'l’; rt) }*, 
+iG-N¥ (rt; r't’) 20. 
If its Fourier transform is defined by 


dw 
GNF (rt; rt)= f e~ wt) g NE(er'w), 


2mi 


(3.49) 


the Hermitian amplitude 


+g-%¥(rr'w)=2np(N—1, E—w) 
X Trl Pwe' (r’) Py-s, eo (r) |/ 
(Try e1)(Try-1, wl) 


AND 


J. SCHWINGER 


is also positive definite. For the ground state we infer 
a frequency condition converse to (3.46) ; E’= Eo(N)—w 
is an energy for V—1 particles and consequently 


w= Ey(N)— E(N—1)< Fo(.N)— Eo(N—1) 
~0Ey(N)/ON=p. (3.51) 

As Eqs. (3.46) and (3.51) suggest, a certain simplicity 
can frequently be achieved by adding a term, —poN, 
to the Hamiltonian, which displaces the zero of energy 
for the individual particles. Then conditions (3.46) and 
(3.51) become restrictions to positive and negative 
frequencies for positive and negative time differences, 
respectively. 

Thus far the function G,¥” has been analyzed 
exactly, with the exception of the replacement 


E\(N+1)—Eo(N)2X0Ey(N)/ON. 


To proceed further, however, it is useful to involve the 
asymptotic properties in a more fundamental way to 
derive a parallel to the exact relation (3.20). We note 
that when many particles are present, the averaged 
matrix elements will be fractionally insensitive to a 
change of one in the total number, or to a change in the 
total energy by an amount associated with a single 
excitation. Equation (3.50) may then be replaced by 


+g -N¥(rr'w)=2rp(NE) 
x Trl Pw; 1 E+" (0) Pred (4) |, 
(Try41, 2401)(Try el) 


gE (rr'w)[p(NE)/p(N+1, E+w)]. 
(3.53) 


(3.52) 


The ratio of the density of states at infinitesimally 
different energies, for systems differing by one in the 
number of particles, is given by Eq. (2.43), 


d Inp=adN+fdE — at Bu. (3.54) 


Consequently Eq. (3.53) coincides with the form of 
(3.20) obtained by the replacement iA — a, ir > B: 


g<(rr’'w) = +e~* Fg, (rr'w). (3.55) 


We note that the conditions (3.46) and (3.51), rigor- 
ously valid for the ground state, are reproduced by 
these asymptotic representations as B—« with 
—a/8=y held fixed. The subsequent analysis proceeds 
in exact parallel with (3.21) through (3.32) where the 
function A**, defined by 
A*(rr'w) 


g>(rr’w) =- ae 
1 e-o-bo 


® dw 
{ —A*(rr'w) =5(r—r’), 


oe 


(rr) A*(rr’w) 
§<(IT w)= — =e 
1 Fectbeo 


(3.56) 


satisfies 


(3:57) 
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and occurs in the Green’s functions in the form 


GNF (rt; r’t!) SG (rt; r’t’) 


1 dw 
= foceee 
1 2r 


dw A (rr’w) 
—e~ iw (t—t’) 


2r 1=Fettbe ’ 


AB (rr’w) 
{>t 


“9 
1F-e abu 


t<i’ 


23 doo | 1 
—j}P— — rib(w—w’) 
a | w—w! 


X {tanh[48(w—p) ]}* } 1°8(rr’w’). (3.60) 


The positive-definite matrix conditions gs¥"20 and 


+g-¥20 imply that 
B.E., 
FD. 


A*(rr’w) coth[48(w—pz) ]20, 


A*8(re’'w) 20, (3.61) 


For a large class of systems it is possible to simplify 
the spatial dependence of the function G,(rt; r’t’) 
=G,(rr'’; {—?’') by using the fact that in the interior of 
a homogeneous medium the field correlation functions 
depend only upon coordinate differences, 


G,(rr’; (—0')=Gi(r—r’; t-1’). (3.62) 


It is convenient to transform to momentum space by 
introducing the Fourier representation 


288 6 (9; i—F). 
2m)? 


(3.63) 


The momentum-space Green’s function is then given 
by [for definiteness we use the asymptotic forms 
although identical formulas obtain for the functions 


Gi™"(p; 1-0) ] 


(3.64) 


» <F 
F eathu 


where the amplitude 4°°(pw), which is still a matrix in 
the internal variables, obeys the additional relations 


dw 
f A%8( pw) =1, 
Qn 


A*8(pw) cothl} (a+fw) ]>0 


A*8 (pwr) >0 
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Equation (3.35) then reduces to 


dp dw tr A%*( pw) 
\ = fac f fz ‘ 

(Qr)' J Qn extbo=F] 

or in terms of the occupied volume, 
=f f= tr A2F( pu) 
(27) J Qn eatbu=e | 

In these expressions summation of diagonal elements of 
the internal coordinate matrices has been explicitly 
denoted by the symbol tr. We may evidently identify 


the frequency integral with the momentum distribution 
per unit volume of phase space, 


dw tr z AN (Pa) 
n(p)= i; 


In a similar manner, Eq. (3.37) becomes 


E dp 1 p’ \ tr A2F( pw) 
Flere 
V (27)* 2 2m/ extbe==] 
The equality of the second derivatives, 
OK ew _ ON 


— —, (3.70) 
Oa ~ aaaB 0g 


(3.66) 


(3.67) 


(3.68) 


(3.69) 


imposes an additional condition on the numbers 


tr A*°(pw), namely 
f. dp fz: (: = ‘ldlidatal & ae A“ (poo) 
: pail ma 
(2r)> J 24 2\2m (extbo = 1)? 
dp 1 Fr @ 
(2m) 2m ewt poz] 0p 


1/p 0 
5% +0) tr A**(pw)]. (3.71) 
2\2m da 
For noninteracting particles of either statistics the 


1*8(pw) has the particularly simple form’ 


(3.72) 


function . 


A? (pw) = 22b(w— p?/2m). 


With this function, Eqs. (3.67) 
familiar statements, and the integrability condition 
(3.71) is explicitly satisfied. We also infer from (3.65) 
the special restriction associated with noninteracting 
bosons, a2 0 or nO. 


to (3.69) reduce to 


IV. TWO-PARTICLE GREEN’S FUNCTIONS. 
ELECTROMAGNETIC PROPERTIES 


While it is possible to discuss some macroscopic 
properties and the behavior of a certain class of “‘single- 


3 We defer the proof of this expected result to the later more 
general discussion. 
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particle” excitations in terms of G,(rt; rt’), a treatment 
of the transport properties of the system is most easily 
carried out in terms of multiparticle Green’s functions. 
Typically, the transport of the quantity of interest 
results from the application of an external field to a 
system in which there is no flux initially. The flux 
induced by the disturbing field may then be computed 
in two distinct ways. The first involves exact solution 
of the field equations in the presence of the externa! 
agency; the second treats the disturbance as a per- 
turbation of the system which is still described by the 
eigenstate basis appropriate to the unperturbed Hamil- 
tonian. The quantities whose transport properties are 
of interest (energy, momentum, matter, charge, and 
the like), are all characterized by multilinear forms of 
field operators and their derivatives. The flux of these 
quantities is consequently described perturbatively by 
means of expectation values of products of these multi- 
linear operator expressions at different space-time 
points. Such expectation values comprise a special 
subset of Green’s functions in which some of the coor- 
dinate arguments are set equal. In the present section 
we shall illustrate this perturbation treatment of 
transport phenomena by a detailed consideration of 
one simple example, the flow of electric charge. Since the 
electromagnetic field is coupled with the charge and 
current densities, consideration of this problem leads in 
particular to the discussion of the special two-particle 
Green’s function, 


Fy’ (rt; ft )=(NE| GeCrOp(r'’)),| NE], (4.1) 
where j is the current density operator in the absence 
of the applied field. The corresponding correlation 
function for the charge density does not require explicit 
discussion since its coupling can be removed by working 
in the gauge in which the scalar potential vanishes. The 
gauge invariance which permits this choice is of course 
intimately tied to the current conservation relation, by 
means of which, conversely, the behavior of the charge 
may be determined from that of the current. 

In the presence of an external field, described in the 
gauge in which the scalar potential vanishes, the kinetic 
energy term of the Hamiltonian is altered from that of 
(2.27) to 


1 1 eA 1 eA 
u-— far- y— )rin( 7~- Join, (4.2) 
2m 1 "; i C 


(In the case of fermions there are also effects resulting 
from the spin magnetic moment which we shall omit.) 
The current density is the coefficient of the first vari- 
ation in —(1/c)A and is therefore equal to 


1 2eA 
(ey et+y'-vy— yw ]=s° +J". (4.3) 
1 1 c 


AND 
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In this expression the operators y are solutions of the 
field equations in the presence of the electromagnetic 
field. The addition to the Hamiltonian, H“’, resulting 
from the appearance of the vector potential‘ in (4.2) is 


1 1 
qA=—- faso- - fase. 
Cc 2c 


To determine the linear induced electromagnetic prop- 
erties of the medium it is necessary to evaluate, to first 
order in the external field, the average current 


[VE|J®|NEJ+[NE|J®| NE] 


generated in those states which had energy F and 
number .V before the field was applied. The current 
[VE\|J@)|NE] contains this field linearly and _ is 
therefore known to the required order. The current 
[VE|J®|NE] in the presence of the field may be 
determined to the desired accuracy by inserting the 
first approximation to the unitary transformation that 
gives the accumulated effect of the external field for 
the interval between /) and ¢, during which the electro- 
magnetic field acts, 


[VE|J(rt)| NE] 


| t 
=[ve (+if H' Hat’ Via) 
| to 
t 
x(1-if Ele wat ) we] (4.6) 
to | 


Accordingly, this part of the induced current is 


(4.4) 


(4.5) 


[VE|I(rt)| NE] 


| + ¢' 
=|.ve Lic, f at far jcre)-acer)| vel (4.7) 
Cc to | 


Since the field can be represented by a vector potential 
A(rt) which is identically zero for 1</, we may write 


s yp 
[VE| J (rt) vE}=- f at far 
CV _x 


XLVE|Ei(1), (0) NE]-A(r'’), (4.8) 


and then combine the two terms of (4.5) into 
, 1 
[NE\J(rt)| vE}= f dr'dt’ x(rt; r'’)--A(r'l’), (4.9) 
fess Cc 


where the polarization tensor, x, is expressed in terms 
of the states and operators in the absence of the external 

‘In this discussion it is the external potential that is designated 
by A. The electromagnetic interactions among particles are under- 
stood to be included in the unperturbed Hamiltonian. 
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field, 
ker(rt; r’t’) 
=n(t—l’)iLNE|(je(rt),j:(r'l’) || NE] 
—5,:6(r—r’)5(t—t’) (e2/m) 
X[NE| pt (rtp (rt)| NE]. 
Here n(i—?’) is the function 
n(t—U) =n (t-L) (4.11) 


defined in (3.29). The asymptotic considerations which 
led to (3.33) apply equally well to the commutator and 
permit the replacement 


kn (t—U')i(( pe (rt), jr’) |) — 6,6 (4 — r’) 
X5(t—1') (2/m) yt (rtp (11). 


Although the current correlation functions occurring 
in (4.1), (4.10), and (4.12) are determined from two- 
particle Green’s functions, their space-time dependence 
is that of a single-particle function of a ‘current field,” 
which in addition possesses symmetries consequent to 
the Hermitian character of this field. Rather than 
follow the example of Sec. III and first describe the 
current correlations with the aid of the functions 
Fy," (rt; rt!) and Fy," (rt; rt’), we turn this time 
directly to the function Fy. (rt; rt’). Similar manipu- 
lations would appear in a direct analysis of G,*(rt; r’t’). 

When ¢>1?’, the function Fy,%(ri; r’t’) is equal to 


(4.10) 


(4.12) 


Fyay 8 (tt; r’t’) 


=Trle aN—BH Wa By (rl)gi(r'l’) |, (4.13) 


while for ‘</’, it is represented by 


Fy pe ( rt: r't’) 


= Tr[e~24—8H-W (a.B)5,(r't’)§, (rl) ]. (4.14) 


As matrices in the indices k,r,f and /,r’,/’, these functions 
are each Hermitian, 


Fyiy, <8 (rt; r't’) = Furs, <9 (r't'; rt)*, (4.15) 


and positive definite 
Fy (rl; r'l’) 20. (4.16) 

Since the two functions Fy and Fe (unlike Gs and Ge) 
are really the same function evaluated at different argu- 
ments, we have the connection 

Fy 4 (rt; rt’) =Fy>4(r''; rl), 
which in virtue of (4.15) becomes 

Fy (rt; rt!) = Fas (rt; r't’)*. 


(4.17) 


(4.18) 


The relation exhibiting the trace structure, which is 
analogous to (3.18), can be written formally as 
Fy? (rr; (—t') = Fy (rr; (—U'—ig). (4.19) 


The Fourier transforms are introduced here as 


* dw 
Fri>, <8 (rt; rt)= f —fit>, <?(rr’w)er ie —"), 


» 26 
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In consequence of the properties (4.15) to (4.19), we 
have the corresponding statements ; 

(4.21) 
(4.22) 
(4.23) 
(4.24) 


fet>,<? (tra) = fur, <8 (r'tw)*, 
fei>, <8 (rr’w) 20, 
fit<® (rr’w) = fur? (r’r—w) 
= fer>® (rr —w)*, 
and 
Sit<(rr’w) =e fas (rr’w). (4.25) 


These relations are most conveniently presented by 
introducing the real positive-definite combination, the 
symmetrical product 


Ser (rt; 0!) =({ ja (rl),ja(e’t’)} 8 
=F yy (rts + F 4 (rt; rt’). 


(4.26) 


Its Fourier transform, defined by 


* dw 


Sir(rr’: -1)= f —e~iw(t—t) 5. (rr’w), (4.28) 
0 oF 


satisfies the condition 
sq(rr’'w) = si(r’'r—w) 20, 


because S is symmetric and positive definite, and the 
condition 


Sxi(tr'w) = 5.1(rr’—w)*, (4.30) 


because S is real. From (4.20), (4.25), (4.27), and (4.28) 
we see that 


Sui(re’'w) = fxrs(tr'w)+ fire (rr’w) 
= (1+¢6-“8) firs (rr’w) 


= (1+) fx<(rr’w). 


(4.31) 


Hence the requirements on fs,< are satisfied in terms 
of those for s and conversely. We also observe that 


(Che (rt) jal’) ])* 


® dw 
f : 
» 26 


which, in view of (4.31), becomes 


wt fa (rr’w)— firc(te’w) j, (4.32) 


dw 
= j —e~*#( tT tanh (Bw/2) |ser(rr’w). (4.33) 


Qn 


We shall restrict ourselves in the remainder of this 
discussion to current correlation functions appropriate 
to a homogeneous, isotropic medium. It is then possible, 
as in Sec. III, to transform to momentum space by 
introducing 


dk 
Sei(tr'w) = sult—r',0)= f e'k (05. (kw), (4.34) 
(27)? 





1354 Y.*€; MARION 


where s,4:(kK,w) satisfies the analogs of (4.29) and (4.30), 
su(—k—w) (4.35) 
su(kw)* 20. (4.36) 


Sit( kw) 


According to our assumption, the tensor character of 
s1(Kw) must be expressed by 


Syt( kw) = 5,181 (kw?) + (k,.k,— k5,7)52(k’w?). (4.37) 


In writing (4.37), we have also observed that s,:, as an 
even function of k, symmetric in the indices, k and /, 
must be an even function of w. Furthermore s;.(kw) 
must be real and positive definite so that 


Sy 20, 5,—k’s2 20. (4.38) 


An additional condition on s,, imposed by the com- 
mutation relations at equal times, will be found sub- 
sequently. 

We can now evaluate the polarization tensor by sub- 
stituting the expression (4.33) for the commutator 
which occurs on the right-hand side of (4.12). With the 
form (4.37) appropriate to a homogeneous isotropic 
medium, this commutator becomes 


(Cel r!) ju r'l’) ] ) 


0 7” dw 
= 6,11 . f € 
Ol J_, 2r 


dk tanh(}8w) 
xf eth -(e—0") __ 51(k°w") 
(2m)3 * 


* dw 
Houv—we) f "aia 


~~ 25 


dk : 
xf e‘* "9 tanh (38) |so(k’w?). (4.39) 
(27)? 


Equation (4.39) may be written in the alternative form 
kyi(rt; r’l’) 


L 


0 dw 
saaee jut) f = 
Ol » 25 


dk tanh ($6w) 
xf ek —— siks)| 
(27)? Ww 


® dw 
+ (uv) f - 
~ 26 
dk 
xf e* (rT tanh ($8w) |so(k’w?) 
(2r)8 
dk 
+a((—iu f “Jegaecaial 


(27)8 


* dw tanh(48w) ne 
x(f -- 51(k’w?) —— ), (4.40) 
» 26 w m 
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where » is the density of particles per unit volume. The 
last term in this expression vanishes in virtue of the 
following identities. From electric current conservation 
we obtain 


dp(rl) 
viene =—(|= ic). (4.41) 
t 


Equation (4.39) permits the replacement of the left- 
hand side of (4.41) by 


0 dw tanh(38w) 
i vf "oat: tial —_——§s,(k*w*), 
dl 2r w 
(4.42) 
and we thereby infer that 
(Lo(1,1),y(0'0’) })** 
dw dk 
--iv f —eiw(t—t’) —__—gik- 
2a (27) 
tanh(}6w) 
x 


———-5,(k*w”). 


Ww 


(4.43) 


When /=/’, the commutation properties of charge and 
current density permit the evaluation 


e dw dk 
—iv—é(r—r’)n= -ivf f ik (rr 
m ww (2n)3 


tanh(38w) 
x Sy ( kw ) 9 


w 


(4.44) 


and, finally we arrive at the relation referred to after 
(4.39), and required for eliminating the last term of 


(4.40), 
* dw tanh(48w) ne 
f - — 51 (k’w?) =— ° 


» 27 w m 


(4.45) 


We may now integrate (4.9) by parts with the ex- 
pression (4.40) substituted for x. In the gauge we have 
utilized, the fields satisfy 


10A 
_ c at 
(4.46) 
vxH=vxX(vxXA)=[VV-V"]-A. 


We thereby obtain 
(J(r/) = far far 2n(t—t’)[o(r—r’, (—0’) E(r’t’) 


+¢(d/dl)y(r—r’, (—U’)¥'XH(r'’)], (4.47) 
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where o(k’w) and y(k’w*) are the functions 


1 tanh($6w) 


o(k*w?) = —s1(k%w"), (4.48) 


2 w 


9 9 9 


1 tanh(36w) 
cy (kw? a eaten 


—so(k*w*), (4.49) 


WwW 
and 


dw 
o(r—r’, (—-/’)= Jz f=  Seoioiied 


Keik -(r- q’ a( *w"), 


x eik (8) (k2w?), 


These functions satisfy the positive-definiteness con- 
ditions (4.38) which assert that 


o(k’w*) >0, o(k’w*) > (kc)*y (k’w?) (4.51) 


The function o also obeys the sum rule (4.45), 


2, ff ne” 
dw a (k*u”) =—, 
Ty m 


which in turn implies the inequality 


2 -_ ne* 1 
f dw y (k?w?) < — —-. 
1 Y¥ mc k* 


With the definitions 
(4.54) 
(4.55) 


o(r—r’, (—t')=2n((-—U’)o(r—r'’, t—1’), 
x(r—r’, (—t') =2n(t—’) (0/dl)y(r—1’, (—’), 


Eq. (4.47) may alternatively be written in the form 


Wa)= favarta(e—v, t—t')E(r’t’) 
+x(r—1', (—t’)cv'’XH(r'’’) ]. 


(4.56) 


In view of the integral representation (3.30) for n(t—(’) 
the Fourier transform of the right side of (4.54) may 
be separated into its resistive part, the conductivity,* 
and reactive part, proportional to the polarizability, 


1 7” o(k*w"*) 
(ks) =~ f du'—— 


rv_, w- eyes 


o (k*w*) —twa(k*w*), (4.57) 


5 The connection between conductivity and_pair-correlation 
function have been discussed in work of R. Kubo, Can. J. Phys. 
34, 1274 (1956), and J. Phys. Soc. Japan 12, 570 (1957); W. Kohn 
and J. M. Luttinger, Phys. Rev. 108, 590 (1957); H. Nakano, 
Progr. Theoret. Phys. (Kyoto) 15, 77 (1956); M. Lax, Phys. Rev. 
109, 1921 (1958); S. F. Edwards, Phil. Mag. 33, 1020 (1958). 
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which are related by 


a(k*w*) =- 


From the sum rule (4.52) we conclude that in the 
high-frequency limit the polarizability reduces to its 
value for a system of free charges 


lim wa( kw) = — (4.59) 


wn 


Similarly, from (4.55) and (3.30) we obtain the ex- 
pression 


=x (k’w*) — iwy ( 


k’w*) (4.60) 
page 
—» FT wW-w+ile 
for the generalized magnetic susceptibility, from which, 
the relation 
w’*y (k*w"), 


Z dw’ 
x(k” on rf a 
wr Jo wo 


between its resistive and reactive parts, follows. From 
(4.61) and (4.53) we also infer that the zero frequency 
susceptibility is limited in its degree of diamagnetism, 


(4.61) 


ne | 
x (k?0) > ———- — 


; (4.62) 
me? k? 


which limit becomes arbitrarily large as the wave 
number approaches zero. 

We may now construct an integral representation for 
the current correlation function Fy,;“(rt; rl’) in a 
homogeneous medium in terms of the quantities of 


direct physical significance, ¢ and y: 


F,°8 (rt; rt!) =9, F 5+9_F. 


x 


dw dk 
-f e iw(t of 
nT (27)? 


Ketter") f 0 


® doy! 1 
fir?® (kw) -{ -- [P —ri coth( 1i)6(0~«')| 
»2rl w—w’ 


(4.64) 


(kw), (4.63) 


X 2iw’[ a ( -4))5 + (KK, — by ik? )e 2 (kw 2), 


which is similar in structure to the single-particle 
boson Green’s function (3.60). Equation (4.64) may 
also be written in the form 


fir®? (Kw) 
= [wo (k2w?)coth (48w) — iw? (a (k%w®) + (ne?/mw*)) Jbx1 
+[y (k%w")w coth (46w) +i (kw?) Je? (Ki.kty— 5:ck?). 
(4.65) 
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The real part of this expression may, through (4.26), 
be interpreted as the fluctuation-dissipation theorem,° 


({ je(rl), jl’) 


* dw dk 
f e~ivit of eik-(r To (k%w*)b,1 
» 25 (27)’ 


1 : é 
+ (k,.k,— k°6, ey het) of +— | (4.66) 
2 


efo—] 


The method for calculating / and thus determining 
the physical properties associated with its weight func- 
tion are based upon the connection of F with a two- 
particle function. This relation follows directly from the 
definitions (4.3), (4.1), and (3.1) for the current density, 
the current function, and the Green’s 
function: 


correlation 


F,°? (ryt; rot’) ; 


¢ 2 
-( ) (ove tvs—we), 
Ts 2m 


KW ry OW () (12't'+ Wb (tet’)),,) 


lim 
ri’ ri, re’ 


(4.67) 
e 


“p= lim 


(2m)? 11’ on, 12’ 42 


(Wi- Va (V2 Ve) 


XGo°7 (rylrol!; r,/ttre't+’). (4.68) 
As we have noted, the condition of gauge invariance 
and current conservation permit a determination of the 
charge density correlation function from the current 
correlation function: 
00 
0'9: G(ri)j(r'l’) (p(rt)p(r't’)), (4.69) 
at at’ 
00 
0'9:(G(r)j(r'’)),) = = ((o(ti)p( rt’), 


ot at 


0 
~a—00¢| oie, nie) 
4 al ‘ 


(4.70) 


From these relations the structure of the density corre- 
lation function may be inferred. Indeed, manipulations 
of this character have been employed in obtaining 
(4.43) from which we may with one further step derive 
a representation for the charge density commutator’: 


(Lp r/),p( r't’) ] 


dw dk 2k’ 
f ge iw(t-t eik (r—r’)_ —o (kw). 
2r J (2r)* w 


6 Fluctuation-dissipation theorems are historically associated 
with Nyquist and have been recently discussed by H. B. Callen 
and T. R. Welton, Phys. Rev. 83, 34 (1951) ; J. Weber, Phys. Rev. 
101, 1620 (1956). 


7 The division by w is not valid if « contains a delta function of w. 


(4.71) 
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By applying (4.70) to (4.65) we obtain the relation 


dw 
f e w(t of 
2r 


ek KT wa coth(58w) — iw'a | 


00 
=— —((p(r/)p(r't’)),). 
at dt’ 


(4.72) 


By integration, we then obtain the analog of (4.71) for 
the time-ordered product of charge densities, 


((p(rl)p(r't’)),)— (ne)? 


dw dk 
-if ew iw(t of. = etk(r ¥* 
on (2m)8 


coth (46w 
4.73) 


i 
Ww 


x kf + 


V. DETERMINATION OF THE GREEN’S FUNCTIONS 

In previous sections we have indicated how physical 
properties can be expressed in terms of Green’s func- 
tions. We turn now to the problem of constructing the 
functions themselves. Of the several Green’s functions 
previously defined, the ones most conveniently deter- 
mined are the generating Green’s functions, G,”:'7. In 
this section we will derive a set of equations for them, 
and discuss one method of approximate solution. The 
set we shall derive consists of integro-differential equa- 
tions coupling the various multiparticle functions. For 
a variety of systems at low densities or with weak or 
slowly varying interactions, solutions to the first few 
equations yield Green’s functions which satisfactorily 
describe various static and kinetic properties. 

The set of differential equations for the Green’s 
functions is generated directly from the equations of 
motion and commutation properties of the field 
operators. Thus, the equation of motion 


i(dp(rt)/d1)= — (9? mdi + fdr v(r—r’) 


XV (Dwr Dwr), (5.1) 


implies that® 


[7(0/dt)+(9 2/2m) |(—DeQ(nt)Y (n't), 
+i far. o(ry— ree (W" (rots) W (tol Writ) (11't’)), 


=n nn’) FY (nt) 16 (4 — hh’) 
=6(r—11/)6(41—t1’). (5.2) 
On multiplying this equation by the operator 
exp(—iNA—1H7), 
5 In the remainder of this paper we continue the omission of spin 


and other internal degrees of freedom which we instituted in Sec. 


II] 
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and taking the trace, we obtain, with the definition (3.4) 


[i(d ‘dh)+(V2 2m) |G, **( rh; r,t,’) 


¥i far v(f— r2)G2" "Try Pols; ry/ty'Tolit) 


=6(r1—1r,')5(t,;—ty’). (5.3) 


Similar equations are derived by applying the differ- 
ential operator of (5.2) to the multiparticle Green’s 


functions, 


[i(0/d4)+(¥ 2 2m) |G,” (nh > Palas ry'l;'- ° 


Fi f dtag0( tyros) 


XGnroi (nil: ** Fails; rity’: : 


c w. td} 


‘Tryilit) 
n 

=> 6(ry—1,/)6(t)—0;’) (41)! ; 
inl 
XGr-1 "(Poles + Pata by + ey l iy 


Kripa tiga’ s+ tn't’). (5.4) 
Corresponding differential equations describe the de- 
pendence of these functions on the space-time coor- 
dinates of the y' field. 

Since the same equations are satisfied by any appro- 
priately normalized matrix element of the time-ordered 
field operators, it is necessary to adjoin boundary con- 
ditions to characterize the desired solution. It is con- 
venient, for this purpose, to discuss the function for a 
restricted domain of its time arguments and then deter- 
mine its behavior elsewhere. 

The conditions on the function in this domain are 
most simply expressed in terms of a slightly modified 
set of Green’s functions, 


, 


CPRSLLAG ot ES oF Ds oe oe 
=eif(ti--+th ty’ +. tn’) 


MG "(Gh hh nh tt), CH 


where 


These functions satisfy (5.3) and (5.4) when the dif- 
ferential operator is altered from 


1(0/dt)+(¥12/2m) to i(0/d4)+¢6+(92/2m). (5.7) 


As this replacement suggests, the substitution of (5.5) 
is equivalent to a change in the origin of energy for an 
N-particle system by —¢.V. This equivalence follows 
directly from the observation that a change in energy 
origin induces an additional phase transformation in the 
expression for the time dependence of the field operator, 


ei —fN)(t—e! v( rl’ Je 


( 
ei (H- [N (Ot (rte i(H—-EN)(t-t’) = p- (t-’ yp (rt). 


i(H—{N)(t—t' =e Cl (rf), 
5.8) 


This phase transformation reproduces in the Green’s 
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functions the factor introduced in the definition (5.5). 
Equation (5.8) indicates that G,5:'*(rt; r’t’) may, for 
t>t’, be identified with 
Gish (rt; r't’) 

=—j Trle~ i -{N)(r—t+t! y(r)e -i(H—-{N) (t- yt (r’)] 
XCTre re, =((5.9) 


while for (<t’, it becomes 


Gy: *(rt; r't’) 
— Fi Trle i(H—€N) (1—-t’+t V(r’ )e—i4 oN) C(t’ Oy(r) | 
[Tre HEYF HA, (5.10) 


In these expressions, H and ¢N occur only in the 
operator combination H—¢.V. Similarly, the multi- 
particle Green’s functions G, depend only on the 
operator H—¢.YV. 

The boundary condition for the Green’s functions 
can be motivated and illustrated with the Green’s 
function, G;. From Eqs. (5.9) and (5.10) we see that 


Gis(er’ 5 t—t'+7)=+0i<(rr' ; t-1), 


or equivalently 


Gis (t—)=+G,-(t-'—7). (5.11) 
In order that these statements about the two functions 
(rs and (< serve to impose a boundary condition on the 
single function Gj, it is necessary that ‘—/’<0 imply 
t—t'+7>0 and (—?/>0 imply (—t/—7<0. More con- 
cisely, the variable /—/’ must satisfy 

r2 \t—F]. (5.12) 
This observation suggests the utility of first construct- 
ing the function within such a limited domain. The 
limitation is conveniently enforced by restricting ¢ and 
t’ to the interval [0,7]. In this interval, the conditions 
(5.11), applied to Gi, become the periodicity properties 


Gi(rt)=+G,(0) and Gy(t7)=+4,(1,0). (5.13) 


The relations (5.11) which imply the boundary con- 
dition on G, implicitly require that GS and Gz, and 
therefore G, be defined for suitable complex values of 
the time difference. The necessity for this complex 
extension of the Green’s functions has its origin in the 
small imaginary part of 7. The domain of possible ex- 
tension may be ascertained by examining Eqs. (5.9) 
and (5.10). The former indicates that Gs is defined 
whenever Im 7 <Im(t—(’) <0; the latter that @e is 
defined for Im 7 <Im(t/—/) <0. These conditions are 
summarized in the statement 


Im 7 <Im(/y—1/<) <0, (5.14) 


where the complex numbers /, and ¢< are labelled by 
the order of their real parts. The restriction (5.14) is 
analogous to the condition (5.12) we imposed on the 
real time interval. Both of these conditions are fulfilled 
by making all times which occur in G complex by the 
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transformation /—> /(1—ie) where —ier is the imagi- 
nary part of r. 

Similar considerations apply to the many-particle 
Green’s functions. Thus, the boundary condition on the 
equation for G, is also determined by the connection 
between its values for (;=0 and t;=7. When (;=0, the 
operator y(rjf;) is the field variable with the earliest 
time. With the time dependence of (5.8) explicitly in- 
troduced, the Green’s function takes the form 


Gi (t;=0) = (—i)Me Trem F294) 


Xseq,e* 


KX pe i(H-{N) (tay tia °) 
XX can—1y~ AEN) tem (7,0) ]. (5.15) 


In this equation Xq), --*, Xn—-1, represent the field 
operators ¥(r), j#¥i, j=1, -, n and y'(r/0), 
j=1, «++, , labelled from (1) to (2n—1) in the order 
(from latest to earliest) of the time coordinates, t; or (;’. 
Similarly, when (;=7, (rif;) is the field operator with 
the latest time, and the Green’s function is given by 
G,.(t,= 7) = (—i)"e TrLe-Y (re 48) 


Kel [N) tnx ne i(H—-{N)(tai)—te 


yen HEN) tae], (5.16) 
In view of the cyclic property of the trace, and the 
oddness of the permutation that relates the two values 
of G, Eqs. (5.15) and (5.16) imply the boundary con- 
dition 

(5.17) 


G,(t;=7)=+G,(t;=9), 


with all other time variables arbitrarily fixed. As with 
the one-particle function, the required complex time 
extension is achieved by replacing each time variable, 
including the variable 7, by ¢—> ¢(1—7e). With this 
substitution and the condition (5.17) the solutions to 
Eqs. (5.4) are completely specified. 

The boundary conditions may be incorporated di- 
rectly in the equations by converting the equations to 
integral form or by restricting the equations to func- 
tions which automatically satisfy the boundary con- 
ditions. Since the imposed condition is one of periodicity 
(or antiperiodicity) the second technique, making use 
of Fourier series, is particularly convenient. All func- 
tions with the required periodicity (antiperiodicity) can 
be constructed from the following complete set of 
functions labelled by the integer v, which ranges from 
— 2 (90 ©: 

(vy even (B.E.) et 
exp(—i1vl/r). (5.18) 


(F.D.) 


- odd 


Thus the function G, may be expressed as a multiple 
Fourier series, 


Ga(titas + talnj Mili’: -Pn'tn’) 
= ¥ expla (vii te: tenla—oi'h!: + —p'ty’)/7] 


{v} 
X7r-"G, (nv "Fan; rv’: he tare’), (5.19) 
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which associates a set of 2” frequency indices, {v}, with 
its 2m time coordinates. The Fourier coefficients are 
determined by the formula 


ys . vu. 9 es 
G(twie Enns Tr En Vn) 


: - | dt;: . -dl,' exp[ir(>- vi—>~ v't’) t | 


XG, (nt "Kala; ryt,': ag fa ta). (5.20) 


The time translational invariance of the functions 
G'({t}) implies that G({v}) vanishes whenever )) v¥ Do’. 
The Fourier transform, Gj(rv; r’v’), satisfies the 


equation 


[(av/7)+¢+(92 2m) JG, (11; r)'v;') 


° 
oO ” , 
vi, V2 — 2 


Fir fdr e(r.—n) 3 


vi’ vove’ 
XK Go TV)" Tove; r'vy' Tove’) exp( ive'O*t) 


=6,.'5(r1—"'), (9.21) 


and similarly, G@,({v}) obeys 


C(avr/ tr) AEH (9 2/2m) JG (tiv: = tay H1'vy! = * tn'Y,’) 


Fir free v(fi— Engi) 


ae = 


” , 
VL PnslYn4l 


Bis vi, nat m—vne’ exp(iv, + 0+) 


y ” . 9 ’ / 
XGagi(tr + Paging 1M + a4 Vnyi) 


=¥ {6nw5(t1—2/)(1) 


iva 1( Tove" Tan; ry’ yy ++; Via 


/ 


Tiga Vigan * ‘T.ha hb 


(5.22) 


To illustrate the inversion of these transformed func- 
tions and to provide a tool for further manipulations we 
determine the space-time form of the function 


G=[(av/7r) +64 (92/2m) }'8(r—- 1’). 
This space-time function, which could be used to 
convert Eqs. (5.4) to integral equations incorporating 
the boundary conditions, satisfies the equation 


(i(0/dt)—h+¢]G\(rt; rt’) =6(r—r')6(t—’), (5.23) 


and the conditions 


Gi(rr; FU) =+G)(00; r'’), (5.24) 
where —¥?/2m has been replaced by 4. The function 
(7;° is, therefore, the single-particle Green’s function for 
a system with no interaction. It is first obtained, in a 
form which holds for all times, by solution of Eqs. (5.23) 
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and (5.24). In particular, we note that 
le i(h—{) (t—t’) 

G(r; rt’) = 6(r—r’), 
be ihr 


I> 


1 em ith (0) 
= —-——____4(r-r’), t<t’ (5.27) 
i 1Feilh-h)r 
where the denominator is well defined in virtue of the 
imaginary part of 7. The two expressions (5.26) and 
(5.27), may be combined as 
G9(rt; rt’) =4{ F [cot ((h—$)7/2) ]!—ie(t-1)} 
Ke FO“) 0G§(g—v’), (5.28) 

where 

e((—l') =n. (t—U')—n_(t-/’). (5.29) 
Since these expressions represent the unique solution 
to (5.23) and (5.24), they must be equivalent to the 
lourier series representation, 


“ 1 1 
G(r; rt’) =- >> — — 
t » (rv/7r)+¢—h 


e€ imv(t—t’)/r 


Xé(r—r’), (5.30) 
for ¢ and ?¢’ within [0,7]. This equivalence may be 
verified directly by exhibiting an integral whose evalua- 
tion by two methods yields the pair of expressions above. 
The desired integrand must have poles at the points 
mv/r, with residues equal to the terms of the series 
(5.30). When ¢>/', a suitable integrand is 


—[2n(wt+¢—h) (Ie 7) He 6(r—r’). (5.31) 
The integrand vanishes exponentially at % whenever 
t>t’ and t—t'<+r. By choosing a contour which passes 
between the singularities of (1-Fe~'@")“'and (w+ f—h)", 
and closing the contour in the lower half plane, we 
obtain, for t—?t/>0, the desired summation (5.30). On 
the other hand, by closing the contour so as to encircle 
the pole of (w+¢—/)~", we obtain the expression (5.26) 
which is therefore equivalent to (5.30) for 0</—t/<r. 
A similar procedure reduces the summation (5.30) to 
(5.27) when r>?/—f>0. 
When there is no interaction we infer that 


G,8(r—r'; (—1), 


obtained by multiplying G by e“~" and replacing ¢ 


by uw and ir by 8, is given by 
en th(t—t’) 


G,°4(rt; r’t') =- ————4(r-r’), 
i 1 e-2—Bh 


{>t 


1 € ih(t—t’) 
te ——§6(r—r’), t<’. 

i ert Bh] 

Once the functions G, have been determined for all 
values of the time variables in the interval [0,7], they 
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may be extended to larger values of the time differences. 
This possibility exists because G, for real ¢ and 7 is the 
boundary value of a function which is analytic for 
small negative imaginary values of each positive time 
difference and small positive imaginary values of each 
negative time difference. From its form in the interval 
[0,7], it may therefore be continued to larger time dif- 
ferences by extension on the appropriate side of each 
real time-difference axis within the strip 


0<Im(i,—te)<Imr 


of analyticity. This continuation determines G,, and 
therefore G,,”:'’, for complex 7 and arbitrary real times. 
From G,, the function G,® is then obtained by letting 
7 become pure imaginary. 

The nature of the continuation may be discussed in 
more detail when the function, like G,, depends only 
on a single time difference. For such a function we may 
compare the Fourier expansion (which depends on a 
single frequency in view of time translational invari- 
ance), 

Gi(rr’ 3 (—U/)=7 DY, et (-OnG (rr; v), (5.33) 


with the generally applicable integral expression (3.32), 


= dw doy’ 1 
Gu(ee'st—1)= f e w(t of P 
Qn 2a w-w’ 


F r(cot}wr)+ a(o-w) | (rr’w’), (5.34) 


A(rr'w) = A~*?7(rr’ wtf). (5.35) 
The w integration of (5.34) may be performed as in 


(3.30) and G, written in the form 


v4 dw 
Glee —1)= f LT  (cot}wr)+!—ie(t—’) | 
Ma 


LT 


Xe (MOA (re’w). (5.36) 


We now make use of the equivalence of (5.28) and 
(5.30) to write 
imv(t—t')/r 


A(rr’w), 
wr OFT 


dw . e 
ie 


2r » 


G,(rr’;(—)= 


and thus identify the Fourier coefficient with 


_ * dw TL 
G,(rr’; v) -f A(rr’w). 


2 28 TV—WT 


(5.38) 


By comparison of the form of solution obtained for the 
differential equation (5.21) with (5.38), the function A 
may be inferred and the continuation achieved. 

With the exception of the trivial example (5.28), the 
exact solution of Eqs. (5.4) or (5.22) is presumably not 
possible. There are, however, a number of approxi- 
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mation methods which may be employed. In the 
remainder of this section we shall confine ourselves to 
particularly simple ones which involve the solution of 
a subset of Eqs. (5.4) or (5.22) in which the correlation 
of more than n particles is neglected. The simplest such 
approximation, and one we would expect to hold when 
the forces have long range and are slowly varying, is 
to neglect all dynamical correlations of the particles. 
In this approximation, which may be called a generalized 
Hartree approximation, we replace the two-particle 
correlation function by its form at large space-time 
separations, a product of single-particle functions. 
Since the two-particle function must be symmetric (or 
antisymmetric), it is natural to make this replacement 
in such a way that the symmetry is preserved. We 
therefore set 


Go(rylytole; ty'ty’t2'te! EG (eyls; ty'ty’)Gy (rele; re'te’) 
+ Gi(nits; ro'te’)G (Tele; r,'l;'), (5.39) 


which is, incidentally, always correct for noninteracting 
particles. We are then led to the equation 


[7(0/d4)+(92 2m) +¢ 1G, (1h; r'1,') 


- fi (ry—fs)dro(n(r2))” 


—i f o(r.— rar, Gi(nits: rol +)Gy (rel; ry't,’) 


iG (ryt; > r,'t,') 


=6(r,—1)/)6(4;—11'), (5.40) 


where ((r)) is a local particle density, 
£1G,:'* (rt; r+) =+1G, (rt; rf*). 


The interaction terms in this expression may be viewed 
as an effective potential which is nonlocal but time 
independent. The corresponding equation for the 
Fourier coefficient Green’s function (5.21) may be 
reduced to 


C (wv /r) + (9 2/2m) +6 1G (rv; 1'r) 
- f(r. r)dto(n(r2))® "Gy (ry; 1y'r) 
—irt> fru ro)dr. G(r’; rev’) 


» 


x Gi (rev; ry/v)ei”’ 6(r;—1,’'). (5.41) 


The equation is of the form (5.23), where / is the 
matrix 


(rihir’ 


|- (vy? 2m)+ fo(r—r")(nie iy vie" bre) 


+i >, ro o(r—')G, (rv; r'v)e**, (5.42) 


AND J. 
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The Green’s function therefore satisfies Eqs. (5.26) and 
(5.27) which, in this case, define an equation for 
> Gi(rv; r’v)e’* from which G, is determined. 

In a uniform system, with 


Gi(rr’: (—t')=G,(r—r'; t-1’), 


the approximation (5.40) is conveniently discussed in 
momentum space, where 


“ dp . , 
Gi(e—r1-0)= f et 9Gi(p;t—t’). (5.43) 
(2r)' 


The matrix (5.42) is then diagonal and its elements 


satisfy 


Pp’ 
h(p)=—+ | o(r’’)dr’’(n) 


2m 
/ 


dp 
+ fo(p—p)(n(p* (5.44) 
(2 


2m)? 


The function v(p) occurring in (5.44) is defined by 


v(p)= fe ip -ty(r)dr, 


and (n(p))':'* by 


(5.45) 


(n(p))F*=[eiO-O re (5.46) 


Equations (5.26) and (5.27) then define an integral 
equation determining /(p) and »(p) which may be cast 
in the perspicuous form 


p° 
(n(p))7#= exp] a( +09 =0 


2m 


dp’ ee 
+ f(p—p)(n(w?) )+e]r1 + . = (5.47) 
(2r)’ 


by transforming to ir=$, (=—a/6 and setting the 
particle density equal to m. Equation (5.47) is a special 
case of (3.68) wherein the function 


A (pw) = 275(w—h(p)) 


associates a definite frequency with each momentum. 
The last nonlocal term in the matrix / is an exchange 
term which produces a velocity dependence in the 
“effective single-particle potential.” For a long-range 
force this exchange term is generally negligible com- 
pared with the preceding direct interaction term; for a 
short-range force they are approximately equal. 
Although this association of a definite frequency with 
each momentum is specific to the Hartree approxima- 
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tion, the integral statement 


dw 
f —wA (pw) 
2r 


dw p 


2r 2m. 


dw dp’ 1 
+f — ——+(p—p’)A(p'o)——__ (5.48) 
2m (29)' extBu =} 


dw dp’ 1 
(ps) +00) f = ——A (p’w)——— 
2a (2m)* ert Bo = J 


is more generally valid. The derivation of this result 
exemplifies a class of relations based upon the high- 
frequency behavior of the Green’s function equations. 
For any potential which is bounded, and whose effect 
therefore vanishes at high frequencies, we may conclude 
from Eq. (5.22) that G,({v}) behaves as v as any 
argument v approaches infinity. Indeed the expression 


lim rvG,(rv; rv) =6(r—r’)r (5.49) 


vO 


may be used with (5.38) to rederive 


dw _ 
f A(rr’'w) =6(r—r’). 


2r 


(5.50) 


Additional relations are obtained by keeping sub- 
sequent terms in the high-frequency expansion. Thus 
we determine a second identity by operating on (5.21) 
from the right with [av— (Ay’—¢)r ]: 


{(avi— (hi—f)7 Gy TiV13 r/v1)— 76(rj— r:')} 


X[Larv.— (y’—f)7 J 


Tov2)d( St r’) 


=+i>} 7 f(r —r)aeslGi (ear; 


+, ( P1V2; T2v2)6(r2— r’) | 


r,— r2)dre 


~ de 


¥2,¥3 


K G3 ( TV 1 Pevel3v3;5 11 V1 F2V203v3)dr30( Sees r') (5.51) 


and, as in (5.26), letting k= —V¥?/2m. 

For a bounded potential, this expression explicitly 
exhibits the »' and vy? terms in the asymptotic ex- 
pansion of the Green’s function. Comparison of (5.51) 
with (5.38) then yields the relation 


dw 
J we 
2r 
dw 


v(r—r’)dr’ A(r’'r A 
2r at Bw FE | 


dw 1 
+f v(r—r’)A(rr’w) 2 
2dr ext bu = | 


—h)A(rr'w) 


6(r—r’) 
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which, in a homogeneous medium, reduces to the form 
(5.48). 

A second approximation, in the chain in which Eq. 
(5.40) is first, is obtained by considering only two- 
particle correlations. In addition to improving the 
treatment of those systems for which the generalized 
Hartree approximation is a satisfactory starting point, 
this extension should apply to systems in which strong 
correlations do occur but at distances small compared 
to the mean interparticle spacing, where two-particle 
correlations should predominate. Since the inhomo- 
geneous term and boundary conditions of the differential 
equation for the two-particle correlation function are 
the same as those of the symmetrized product of single- 
particle correlation functions, we write the equation 
determining the measure of correlation as 


[i(0/dty) —M+¢]{G2(12; 1/2’) 


—[Gi(1; 1’)G,(2; 2)+G1(1; 2')G,(2; 1’) ]} 


Fi f 0(1,3)(G3(128; 1/2'3) —[G2(13; 1'3)G,(2; 2’) 
+ Go(13; 2'3)G1(2; 1)]} =0. 


In this equation space-time indices have been sup- 
pressed, v(1,3) represents v(m—r3)6(4:—¢s), and inte- 
gration extends to all additional space-time variables 
under the sign of integration. Equation (5.53) admits 
an approximation one step beyond (5.39). We recognize 
that the right-hand side involves the difference between 
a three-particle Green’s function in the space-time 
region where two particles interact, and the product of 
Green’s functions which take into account the corre- 
lation between this pair neglecting their correlation 
with a third. For long-ranged slowly varying forces and 
for short-ranged forces at low densities we would expect 
that G; might be replaced by a sum of functions of the 
form G2G,, which agrees with G; at large separations 
and also approximately within the range of the inter- 
The function with this desired behavior 


(5.53) 


acting pair. 


2’) + G2(13; 32’)G,(2; 1’) 
+G.(13; 2/1')G,(2; 3), (5.5 


'3)G,(2: 


leads to an approximate form for (5.53), 
[i(0/t) —M+¢]{G2(12 
ae ene : 1 


~i foa,s)Gas; 2 NG,(2: 3)= 


In the remainder of the sec a we shall make the addi- 
tional replacement of G,(2 a} by G9(2; 3) in (5.55). 
This replacement will be a ei first approxima- 
tion in the determination of the combination 2(1,2) 
XG.(12; 1/2’) that occurs in the one-particle equation 


NG, (2; 2'\+G,(1; 2'/)G,(2; 1’) ]}} 
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whenever the interaction of a pair of particles with 

others may be neglected while their mutual interaction 

is producing correlations. The error entailed in the pair 

of approximations is given by the right-hand side of 

the rigorous equation, 

[i(d Ot;) —h+¢)[i(a Oly) —ht+e]{ Gol 12 ) 1’2’) 
—[Gi(1; 1)G,(2; 2) 4G,(1; 2')G,(2; 1')}} 
—iv(1,2)G2(12; 1/2’) 


= - fo 1,3)0(2,4){G4(1234; 1/2/34) 


—[G(13; 1'3)Go(24; 2'4) 


+G.2(13; 2'3)G.(24; 1'4)]}, (5.56) 


obtained by applying [1(0/0t2)—he+¢ ] to (5.53). Much 
of the subsequent analysis of the simplified equation 
obtained by neglecting this remainder carries over to 
more accurate equations derived in Sec. VI. 

It is convenient to introduce a matrix notation in 
which multiplication involves integrating the coor- 
dinate matrix indices over all space and over the time 
interval [0,7 ], the potential matrix is given by 

(12! 2y2| 1’2’)=0(1,2)6(1,1')6(2,2’), (5.57) 
and the appropriately symmetrized unit matrix is repre- 
sented by 


1 yo=6(1,1)6(2,2’)+6(1,2’)6(2,1’). (5.58) 


The approximation to (5.53) or (5.56) is then succinctly 
expressed as 


Gre = GiGel wt iG Go (5.59) 


with the subscripts on the Green’s functions denoting 
particle coordinates. A convenient auxiliary for dis- 
cussing this equation is the matrix 2). which satisfies 
Qhe= Lye t1G G02). 

= 1)2+[ (G,")-7(G") 1— ivy | “viel 19, 


Gy2= QyeG1 Ge. 


(5.60) 


By employing the relation, 

GG =[(GY+GLTL(G))+ (4)7p, 
we may rewrite (5.60) in the form 
Qyo— [LG Y+-G 1 (G)A+ (G2) F9420)2= 1h. 


The matrix 2 occurs in the G; equations (5.3) and (5.21) 
in the form 22, where v is an instantaneous interaction. 
Consequently, for determining G, it is sufficient to find 
the submatrix in which the time variables of the two 
left indices of Q are set equal. Since the second term 
contains, as we read from right to left, 2 with equal left 
time indices, the operator (Gy)"+(G..)- which 
translates the total time but does not alter relative 
times, and the combination G,°+G.", which, on the 
one hand, changes either of the time arguments but not 


(5.61) 


(5.62) 
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both, and, on the other hand, leaves the pair of time 
arguments equal, we may conclude that it is only the 
combination G,°+G.°, at vanishing time difference, 
which occurs. This combination has a well-defined limit 
as the time difference approaches zero (reflecting the 
continuity of the Green’s function as the difference in 
time indices of two y or two y* field variables passes 
through zero). The equation for this submatrix may be 
written explicitly in the form 


(ry rot | Q) ry’ r2't’)+37{[ cot 3 (i—f)7) | 


+Leot (h(a) f at f ar." f dre" 


X (6 Poel [i(d 01) +2¢—hy he} Oe 
Xo(0y! — re”) (ry 12"t| Q| ry’ r2"t’) 
=(6(r1—1ry’)6(ro— re’) 


+6(r,—r2’)6(r2e— 11’) J6(t—’), (5.63) 


by removing a 6-function from the equation for 
(Ty Fol Q ry t)/ Toto’) = (ry Fol Q ry rot)’ )6 (ty! — 1s’). 


Since the single-time functions refer to a pair of equal 
times, the differential operator and the matrix ® satisfy 
periodic boundary conditions in the interval [0,7] for 
fermions as well as bosons. This property is auto- 
matically built into the equation by understanding that 
id/dt is represented by 2(v1+2)/t=2v/7, and that v 
must be even. In terms of this frequency mv/r, Eq. 
(5.63) becomes 


[ (wv/r)+26—Iy—he \(rire| Q(v) | ry'r2") 
—43{[coth(di(d—f)7) }*!+[coth Gi(de—f)7) ]*} 
X o(ry— re) (rire! Q(v)! ry'r0’) 
=[ (av/r)+2¢—Iy— he l[6 (11 — 1y')6(r2— 82’) 


+6(r—re’)6(r2—1’) ]. (5.64) 


Because of the spatial translational invariance of Q, 
and its simple dependence on the center-of-mass coor- 
dinate }{r:+ 12), this equation is best discussed in terms 
of the variables 

R= i (ri+r2), 


r=1— fo, (5.65) 


and the operators 
P= (1 1)(VitVo), 


p= (1/27)(Vi—V2). (5.66) 


Indeed the essential problem is construction of 


G(rr’; P,v) which satisfies 
Pp Vv" 
( +i-—+- Jato’; Po 
T 4m om 
—}{{cothl((4P+ p)?/2m—¢)hi7 ]}#! 
+{cothl((3P— p)?/2m—£)}i7 }}#"] 


Xo(r)G(rr’; Pv) =5(r—r’), (5.67) 
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in the momentum-space representation for the total 
momentum P. In terms of G, © is given by 


(ry! Q(v) | ry’r2") 


dP 
=f eh er Lote Pease, PH] 
27 )° 


9 9 


Tv if . 
x(’ “+23-—+—), (5.68) 


T 4mm 


where the differential operator acts on the primed 
variable from the right. 

The function G is, of course, also a Green’s function. 
Indeed, it is very similar to the Green’s function 
ordinarily encountered in the two-particle problem. 
More specifically, if we make the replacements 
E= (mrv/r)— (P?/4m), ir=B, f=, and p=—a/B we 
recognize G as the inverse of 


E+2yu— (P2/m)—3[{coth3[@(P— p)?/2m+a]}* 


+ {coth3[6(4P-+ p)?/2m+a]}+*"]v. (5.69) 


In this form physical interpretation is possible. As a 
approaches « (which corresponds to vanishing density), 
the operator reduces to the one occurring in the 
Schrédinger equation for the relative coordinate of a 
pair of particles. With uw fixed, the equation describes 
interaction in the presence of a medium which results 
in an effective potential whose value depends qualita- 
tively on the relation of energies of the individual 
particles to the chemical potential, y. 

Following conventional scattering theory, we intro- 
duce the useful auxiliaries 


t(P,v)G (P,v) =vG(P,v), 
SG (P,v)=[ (av/7) +26—M—he |", 


(5.70) 


in a matrix notation in the relative coordinate spatial 
variable. In virtue of (5.68), the matrix ¢ satisfies 


(r/é(P,v) | r’)+ (r|é(P,v) | —r’) 


=1(r) f (nr Q(v) | ry’r2’)d(R—R’)e~'P R-P), 


We may also write é as 
t=0+2G{[coth(3(M1—$)ir) ]! 
+[coth (3 (t2—f)ir) }*!}2, 


or in terms of the equation 


t=0+iG3{[coth(3(41—¢)ir) }*! 
+[coth(}(he—f)ir) J}, (5.73) 


which is exactly soluble for several interesting problems. 
Equation (5.73) also provides a basis for various ap- 
proximation techniques. For example, the solution to 
the ordinary Schrédinger equation for any potential 
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determines / such that 
{ — v1 JO, 
Together with (5.73), this equation implies that 


t=t+iGg {Lexp(i(—f)r) F173 
+[exp(i(do—f)r) FL} yim. (5.75) 


The latter equation, from which the potential has been 
removed, permits an iterative solution when there are 
strong repulsive short-range forces. Indeed in some cir- 
cumstances, it is possible to solve (5.75) approximately 
by other than iterative means, and also to set up an 
effective-range theory for operator f. 

As we have indicated, a major reason for determining 
t is to obtain an approximate equation for the single- 
particle Green’s function which takes certain two- 
particle correlations into account. Such an equation now 
follows from (5.21), (5.60), and (5.71), in the form 


TV Fa 
( +5—hi )Gnw rr) 
T 


i - - 
FDP L(n—re| ttt) (P, vv’) — 12") J 


a 
dP - 
KeiP (RR) ___de "dro! dreGy(re’’v’ ; rev’) 
(2m)’ 
x Gy (1y’v; ry/v) =6(1,— 1’) ; (5.76) 
where 
(rit’\r’)=(r\t|—r’). 
The effective one-particle potential in this equation is 
frequency or time dependent, and consequently the 
one-particle Green’s function can no longer be de- 
scribed, as in the Hartree approximation, by an effective 
single-particle Hamiltonian. 

As in the Hartree approximation, this equation is 
particularly tractable for systems which may be treated 
as uniform, that is, for which the Green’s function is 
given by 
. dp _ 
Gi(r—r’ 5 t-/) = rt! > G,(p,v) 


(29)? 
Xe imv(t—t’ Teip (6 r’) 


For such systems, Eq. (5.76) reduces to 


9 


1 dp’ 
*~ 2. 


T 2m T°’ 


= wv p 
Guine)-| Tt 


(27r)* 


X (3(p—p’)| (tt) (pt+p’, v+v’)|3(p—p’)) 
1 
x Gu(p'r) expt’) yas 
where 


(p|i(P,v) a= fe ip-t(r/i(Pyv) |r’ )e4 "drdr’. (5.7 
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By employing Eq. (5.38), we conclude 
du’ A (p’w’) 
Qe rv—w'r 


Tv p° dp’ 
-| ss beams +- a 


T 2m rv (2m)? 


JS< du’ 


« (}(p—p’)| 40) (pt+p’, v+v’)|4(p—p’)) 


xy’ RB 1 
x(=-«') A(p’w’) exp(iro")| . (5.80) 
T 


In order to proceed with the analysis of this equation 
we suppose that the eigenfunction expansion of G is 
known. In the a, 8 formulation as a— ~%, this ex- 
pansion would be the familiar one for (/—H,2.)~' where 
Hy. is the Hermitian operator that characterizes two- 
particle interactions. The existence of such a represen- 
tation more generally is inferred by treating any poten- 
tial as the limit of a sum of factorizable potentials for 
which its existence can be explicitly demonstrated. 
From the expansion and Eq. (5.72) a representation for 
(P,v) may be obtained: 
(r t(P, v+y’)| r’)=0(r)b(r—r’) 
v(r)p,(rr’)v(r 


tT 2,- — (5.81) 
n w(v+v’)— 


’ 


in terms of coefficients, p,(rr’), and poles, E,,, which, 
in general, need not be real. With the aid of this ex- 
pression the summation that occurs in (5.8) may be 
evaluated. We obtain for the inverse of G,, 


in Tv 
-P(0, ‘) (5.82) 
2m T 
where 


5 7 dp’ 
P(e. ‘)-f n(p’)[o(p=0)+0(p—p’) | 
T J (2) 
dp’ as i, ( P,P’) 
rf Ja : 
(2m) 2x » (xv/t)+w’—E, 


i Tv 
LG, (py) | | +f— 


1 
eter 
[exp(iw’r) #1] “Texpl int) —1] 


(5.83) 


XA (p'w’), 
and 


ra(psp)= fen P’) to (r)p,(rr’)v(r’) 


X[eid(r-P)-'’ + ¢ 3(p p’) +9”). (5.84) 


We may now reduce (5.80) to an integral equation for 
A(pw). This reduction is immediately achieved by ob- 
serving that both the left- and right-hand sides of 
(5.80) define analytic functions when rv/r is replaced 
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by the general complex variable z=w+i6. Since these 
functions agree on a set of points which have a limit 
point at %, and the functions are analytic at «, they 
are equal everywhere and 


1 
A (pw) =lim — ~ fae|— ~- —— |i 
win oni —w is w—w'+i6 
(5.85) 
is given by 
r( 


~ [w—e(po) +04 P(pw) 


(5.86) 


where 


p* 7 _ 
(pw) -—+¢=lim 3[V (p, o+75)+V (p, w—i6) ] (5.87) 
2m 50 


U 


ap 
-f On)' n(p’)[v(p=0)+0( (p—p’) | 


(Zar 


+f= af as *»(PP’) 
(2m) 2m n iia! —F, 
X {Lexp(tw’r) FJ" 


+[exp(iE,.7)—1]}°}A(p’w’), (5.88) 


1 " 
pw)=lim [0 (p, o—i)—V(p, w+i8)+2i8] (5.89) 
1 


im 25 +f- dp’ — f= du’ 
(2m)? 


7»(DP’) 
———{[exp(tw’r) #1]"' 
n lal —E,)?+8 


+ [exp (iF 7) —1} YA (p’o’ | (5.90) 


The latter approaches zero when the denominator, 
w+w’—E,, does not vanish for any choice of w’ and E,, 
for which the numerator is nonvanishing. On the other 
hand, if E,,—w’=w for some w’ and E,, for which the 
numerator is finite, the limit is 


du)’ 
T'(pw) = j- <a D’ 5(E,—w—w’) 7 n(pp’) 


1 1 2 
ft Nines 
ewr—1  eflwte’)r— 4 

The prime on the summation indicates that it contains 
only those poles, E,, which lie on the real axis, since 
these are the only ones for which limit of the second 
term of (5.90) is nonvanishing. Special consideration 
must be given to any real wy which satisfies wo= €(p,wo) 
and 1” (wo) = 26 — 0 since at such a point, (5.86) becomes 
A (pw) = 275 (w—(pw)). This special case applies, for 
example, to the noninteracting system. 


(5.91) 
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Equations (5.86) to (5.89) may also be derived by 


using the summation techniques employed with (5.31). 
When />?' we infer from (5.82) that 


dw en iw(t-t’) 


re (1Fe-*")[w— b—V(pe) +t) 





Gi(p, i—t')= 
(5,92) 


where the contour of integration is chosen to pass above 
the singularities of [w—A+¢—V (pw) ]“ (which lie on 
the real axis since they represent the possible energy 
differences of the system) and below the poles of 
(1 e-‘“"). By closing the contour in the upper half 
plane so that it encircles in a counterclockwise sense the 
points at which (1 e~‘*’) vanishes, we obtain the sum- 
mation (5.77), provided ia <r. By encircling the 
singularities of [w—h+¢—V(pw)]}" we obtain @ for 
all t—?/>0. Comparison of the latter expression with 
(3.26) leads to the relations (5.86), (5.87), and (5.89). 

We note that expressions of the form (5.86) and (5.92) 
follow for the rigorous single-particle function as well 
as the approximate Green’s function discussed above. 
Consequently these equations and those subsequent 
ones which do not employ the forms (5.83), (5.88) and 
(5.90) for V are generally valid. 

If we replace ir by 8 and ¢ by yu, we obtain for the 
real function 4*(p,w) 

18 ( pw) 


[w—e°9(puo) P+[31'°9( pa) 


p 
e*4( pw) =—+Re 
2m 


A*8( pw) = (5.93) 
where 
V8 (pw+i0), (5.94) 


and 


P'°4( pw) =2 Im V24(p, w—10). (5.95) 


When (5.88) and (5.90) are introduced, the functions 


€ (pw) and ['%8(pw) become 


p° dp’ 
€*8( pw) =—+ ie 
2m -/ (2n)8 
X Re[ (3 (p—p’) | t28(pt+p’, wtw’+i0) 
gi gahentat 


8(p’w’) dus 
it. Sind ale 
ext Bu’ 1 ( 29)! Me 
Av*(pia') 
: a (5.96) 
n ‘pba, —f, a8 g2atBEn— | 
dp’ du’ 
(2m)3 


X2 Im(3(p—p’) | *8®+7'*8) |$(p—p’)) 


1 
sani 


grat Blwto" Y seats 1 
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In these expressions, we have introduced the function 
(8 (Pw) =t8(P, v= (w—2u)r/r) and utilized the rela- 
tion ¢%(z*)=/*8(z)* which follows from the reality of 
(5.69) when z is real. 

As the notation is intended to suggest, the quantities 
e and I have simple interpretations, at least in the 
neighborhood of those points wo(p) for which 


(av dw) woK[1 — (de ‘dw) wo |. (5.98) 


€(p,wo)=wo and 


In the neighborhood of these points we may write 


Z (wo) ¥ (wo) 
BN ih ee ccenereemanntnns, (5.99) 
(w—wo)?+[ 37 (wo) |? 
where 


Z=Y r=[1— (de dw) wo |} - (5.100) 


When (5.99) continues to be a good approximation to 
A(pw) for values of w—wo which are sufficiently large 
multiples of y(wo), the frequency integral of A (pw) in 
this neighborhood equals 27Z(wo). Since the entire 
frequency integral of A(pw) has the value 27, a neces- 
sary condition for the dominance of the region is that 
Z(wo)~1, or equivalently that |de/dw|<«<1. For fer- 
mions, the additional restriction A (pw) 20 insures that 
there can be only one such point and that at this point 
Z(wo)<1 and de/dw<0. The limiting case of this be- 
havior arises when A (pw) is rigorously 276(w—wo(p)). 
We shall call any solution of wo=e(p,wo) a resonance 
and y(w) the resonance width. When the statistical 
factor in the expression for G,“(p,t) also varies slowly 
in the neighborhood of its resonances, their contribution 
to G; is given by 


+1G,°9(pt) =>; Z:{+1+Lexp +6wi(p)) F1}"} 
Xe-wsle-tvet,  §>0, 
=>; ZLexp(et+fu,(p)) 1} 


K em wilpe byil tl i<0. 


(5.101) 


We have here allowed for the possibility of several 
resonances, whose weights in the fermion example must 
satisfy }>; Z;<1. Equation (5.101) exhibits the relation 
between the widths of single-particle resonances and the 
corresponding lifetimes. These lifetimes reflect the 
approximate rate at which equilibrium is restored when 
a particle of momentum p is suddenly introduced or 
removed from the medium. We shall not enter here 
into a detailed discussion of the dominance of (5.101) 
but merely observe that it can be the major contri- 
bution only for a restricted time interval. This interval 
cannot extend to very short times since the discon- 
tinuity at ‘/=0 of the approximate form (5.101) does 
not generally reproduce the unit discontinuity of iG. 
This error in the discontinuity reflects the importance 
of the entire frequency spectrum of A(p,w) for very 
short times. The influence of the complete spectrum is 
only transient, however, the nonresonant contribution 
decreasing rapidly with time. Nevertheless, at sufh- 
ciently long times, this contribution overshadows the 
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exponentially decreasing term (5.101). As long as y(wo) 
is not exactly zero, the resonances will have lifetimes 
characteristic of the interaction strength and the par- 
ticle density. If y(wo) — 0 when 6-0, that is, if the 
second term in (5.90) vanishes identically, the lifetimes 
will be limited only macroscopically and superfluid 
properties will result. 

In the low-density limit (a— «), I’ reduces to the 
anticipated collision rate. The function [ is then 
expressed in terms of the imaginary part of the diagonal 
element of the scattering matrix /. The latter is related 
to the total cross section o by the optical theorem which 
states that ¢| p— p’| /2m=Im(3(p—p’) ‘+1’ 3(p—p’)). 
In terms of the velocity, v= p/m, of the particles we 
obtain the elementary low-density result 


/ 


dp 
rin) f ——a(\v—v’|)|v—v'|n(p’). (5.102) 
(23)? 


The expressions (5.96) and (5.97) also contain a term 
which indicates through its characteristic statistical 
denominator [ exp(2a+6(w+w’))—1}" the Bose nature 
of a composite system formed by two fermions or 
bosons. This term also contributes to the width when 
we depart from the limit in which a— ~. 

We note, finally, that Eqs. (5.94) and (5.48) may be 
used to test the validity of a single-level approximation 


A (p,w) = 295(w—wo(p)). (5.103) 


With this weight function, Eq. (5.94) implies that 


9 


or he 
wo( p) = —+] (p,wo(p)) 


2m 


9 


p° dp’ 
=— +f —{|e-br’ 
2m (27)? 


dp’ 1 1 
oo 
(2x)? om ett Bwo(p’) 4 e2ot+BEn—] 


r,,7( pp’) 
wy( P) +wo( p’) —f,, 


\n(p’) 


(5.104) 


Similarly the relation (5.48) for the mean value of the 
frequency yields 

oe dp’ 
ap) —+ I — 


—(|v+0'| )n(p’). 
2m (27)8 


(5.105) 


The difference of these expressions is therefore a 
measure of the dispersion in A. What frequency, if any, 
can be approximately associated with a given mo- 
mentum excitation when (5.104) and (5.105) differ, 
depends upon the time during which that excitation is 
important. This is reflected in the investigations which 
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led to (5.94) and (5.48). In the latter case we considered 
high frequencies and correspondingly short times; in 
the former we were concerned with frequency poles and 
long time behavior. 

Considerations similar to these apply to the multi- 
particle Green’s functions, and in particular, to the 
special density correlation function which we described 
in Sec. IV. Thus, when o is a continuous function the 
conductivity is determined microscopically, while when 
o contains a 6 function the system exhibits supercon- 
ductivity. 


VI. FORMAL SOLUTIONS. OTHER APPROXIMATIONS 


In previous sections we characterized the Green’s 
functions of the system by an infinite sequence of 
coupled equations and showed that numerous properties 
of the system could be obtained from approximate solu- 
tions of the first few of these. We turn now to the more 
general investigation of these equations and their solu- 
tions. We shall carry out this analysis with more power- 
ful field-theoretical techniques, which permit concise 
expression and formal solution of the Green’s function 
equations. From these solutions we shall then derive 
alternative methods of systematic approximation. 

In order to express the Green’s function equations in 
compact form, it is convenient to introduce arbitrary 
functions of space and time &(r/) and n(rt), called source 
functions, which are completely commutative when the 
y field obeys Bose statistics and completely anticom- 
mutative when the y field obeys Fermi statistics. By 
means of these functions, a Green’s functional may be 


defined: 
(m)---&(1) 


? “ ré 
Gleal=145 f —— 
l nN. 
01’) >» n(n’) 
Oe nae 


nN. 


x~G.(1,---: 1':- (6.1) 


In this equation, we have employed the conventions of 
Sec. 
coordinates and extends over all space and the time 
interval [0,7]. The individual Green’s functions are 
thus characterized as coefficients in the expansion of the 


V. Thus, the integration sign applies to all 2n 


functional, or equivalently as its functional derivatives: 


6) 61 61 61 - | 
=v Saini - ——G(é,n ] 
dn(n’) dn(1’) 6E(1) —-GE(n) quis 


=(G,(1---;1'---n’). (6.2) 


The distinction between left and right variational 
derivatives which is necessary for the anticommutative 
fermion source functions is indicated by the subscript / 
or r. With the aid of (6.2), it is easy to verify that the 
set of Green’s function equations (5.4) is equivalent to 
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the single functional differential equation 


0 61 6, 6, 
(i ~ivts) Fi f 0(1,2) — 
dt, d£(1) 5n(2*) 5&(2) 


6; t. 
x——— a(t) [Len] =0, 
5£(1) 
or its adjoint, 


a) 6, = 
[(-i—-s+7) — if (1,.2)————— 
Oly 6n(1) 6&(2- ) in(2 
6 


ae orig =0. 
1) 


dn( 


(6.4) 


In the interval [0,7] to which (6.3) and (6.4) are 
restricted, the boundary condition (5.17) specifies the 
particular solution of the differential equation in time. 
The additional condition necessary for the normaliza- 
tion of these equations is determined from (6.1) by the 


relation 7 
G[0,0]=1. 


A simple illustration of the functional technique is 
afforded by the direct integration of the differential 
equation in the trivial example with no interaction. In 
that case we obtain 
GLEé,n }= exp(&G1"n) 
for the Green’s functional. This functional serves to 
generate the multiparticle Green’s functions in the form 


(6.7) 


(6.5) 


(6.6) 


G,"=perm,n) or det ny)’, 


where perm,,) and det») are the permanent and deter- 
minant of the 2Xn coordinate indexed matrix G,°(i; j’). 

A successive correlation approximation for treating 
interactions may be introduced within this framework. 
As a first approximation we take a Green’s functional 
which contains no dynamical correlations, 


G[é,n]= explEGi7]. 


We then determine the function G, so that Eq. (6.3) 
is satisfied to the lowest order in & and n. The equation 
for G; thus obtained is the expected one, 


[i(0/dt) —y+¢]G,(1; 1’) 


(6.8) 


ste 1')G,(2; 2+) 


2+)G,(2; 1’) ]=8(1,1’). (6.9) 


As a second approximation we take an exponential 
form which reproduces correctly the first two powers 
of — and n, 
GLé,n]=exp(éGin) 

X exp (2!) HEE (G@2— 26141) nn (2!) 


+G,(1 


(6.10) 
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and determine the pair of functions by requiring that 
the coefficients of the two lowest powers of & and 7 in 
the functional equation (6.3) vanish. As a third ap- 
proximation we would include three-particle corre- 
lations, determining the coefficient functions in a 
functional 

GLé,n ]=exp(Gin) exp[ (2!) EE (G2—2G,G)yn(2!) ] 

Xexp[ (3!) 'eké(Gs—9GG4,4- 12G4,4,4)) 


Xnnn(3!)], (6.11) 


by satisfying (6.3) to third orders in — and 7; and so on. 
The pair of equations for G; and G2 which result from 
(6.10) are readily determined. The first equation is 


identical with (5.3), 


[i(d/dt))—Wy+¢]G,(1; 1’) 
if (1,2)09(12; 1'2*)=01,1, (6.12) 


and would be exact if the correct G» were inserted. The 
second equation is 


[i(0/dty) —h+¢]G2(12; 1/2’) 
i f o(1 39064113; 1'3+)G,(2; 2’) 
+ Go(13 ; 3+2')Gi(2; 1) +G2(13 ; 2'1)G(2; 3+) ] 
Fi f o(1,3)(Gal32; 1'3+) —[G4(3; 1’)G,(2; 3+) 


3+)G,(2; 1) G4 


+cyclic permutations of 1’, 2’, 3+ 


Fi f o11,3)(G,(21; 


34)Gi(1 


+G,(3; 


13+) -—[G,(2; 1G (1 


1’) ]}G,(3; 2’ 


+cyclic permutations of 1'2’3+ 


+G,(2; 


6(1,1’)Gi(2; 2')+6(1,2')G,(2; 1’). (6.13) 


lor weak interactions or low densities the last two sets 
of terms on the left-hand side of (6.13), which contain 
the interaction of one pair and the correlation of 
another, are relatively unimportant. If these terms are 
omitted, the equation may be re-expressed as 


[i(0/dt;) —h,+¢]{G2(12; 12’) 
—[G,(1; 1')G,(2 2’)+G,(1; 2')G,(2 


“fw: 2"")0(1,2")Ga(12” 


which we encountered as (5.55). Application of a low- 


; 1’) }} 


-1'2)=0. (6.14) 
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density or weak-interaction approximation again leads 
to the replacement of G,(2; 2’) by G,°(2; 2”) and thus 
to (5.57). We note that terms suggestive of an improved 
equation, in which the converse replacement is partially 
introduced, are included in (6.13). Thus, one term in 
(6.13) is 


+ f011,3)608;3°)G002; 1/2’) 


—[G(1; 1')G,(2; 2)4G(1; 2’)G,(2; 1’) ]}, (6.15) 
which is the direct interaction term that must be added 
to (G,°) in a Hartree approximation for Gy". With 
no further approximations, Eqs. (6.12) and (6.13) 
represent a second step in a systematic method whose 
first step is the Hartree approximation. In general, 
each step of this procedure differs from the corre- 
sponding one in the previously outlined scheme through 
the method of approximating the (+1)-particle 
Green’s function in the first m coupled equations. In 
(6.13), unlike (5.54), the arguments of the Green’s 
function in the interaction term are treated sym- 
metrically. Indeed, it is the terms in (6.13) not present 
in (5.54) which permit discussion of long range collec- 
tive motions. 

The great advantage of the functional formulation 
(6.3) lies in the possibility of obtaining formal ex- 
pressions for the multiparticle Green’s functions and 
for the trace. One form for the formal solution of the 
differential equation is easily obtained. We note that 
the equation contains and a differential operator 
which can be expressed as [A, ], where A is defined by 


a 6 0 6 
A= +f - — (i +5-1)- 
| 5n(1*) 5£(1) 


(6.16) 


1 6 6 6 6 
ee 
bn(1*) 8n(2*)  ” 8E(2) oe 


Since [A,[A,n ]]=0, we may write Eq. (6.3) in the form 
dne~4G(E,n ]=0, (6.17) 


and infer that ne~“G vanishes. From this equation and 
the analogous one for — we obtain 


GLE,nJ=esLé,n], 


where 6[ £,n] is a functional which satisfies né[£,n ]=0 
and £6[ §,n ]=0. 

We arrive at a more useful expression for the Green’s 
functional by a rearrangement of (6.18) which utilizes 
the possibility of describing the effect of the interaction 
potential v in terms of the behavior of the system in the 
presence of an external, single-particle, generally spati- 
ally nonlocal potential U(11’). Using a four-dimensional 
matrix notation in which the time dependence is given 
by 6(4;—t)'—O+) and defining G[é,U] in the 


(6.18) 
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presence of this potential by 
GLE, l ; J a ae lbLE,n J, 
6 (6.19) 


ALU ]= ax f — a)’ 


ial) 
we obtain the identity 


6 


6 - i 
GLé,n,U]=———Gé,n,U]. (6.20) 


6n(1) 5¢(1’) 6U (11’) 


The description of the interaction effects is transferred 
from the source dependence to the potential dependence 
by using this relation to replace 


— 5 . : 2 
Ii bn(1+) 8n(24). Oy (2) 5£(1) 


6 6 
by f v(1,2 
sU(1) ” 6U(2) 


in (6.19). (A local potential Uv has been employed to 
simulate the local interaction v.) We then have the 
expression 


a6 6 
G(é,n,U J= exp(~ — —) 
26U "aU 


6/20 6 
xexp| #—(i—— k+-— v)— fice (6.22) 


dn\ dt 





(6.21) 


The effect of the source derivatives on the 6-functional 
is readily obtained since it is, apart from a functional 
of U, the Green’s functional for a noninteracting system 
subject to an external potential. This functional is the 
analog of (6.6) with G,’ replaced by GU], 


GLU ]=[i(0/at)—h4+¢-U}? 


=[1-Gyu}6G,, (6.23) 


and therefore it follows that 


6/90 67 
exp] #1 ——k+-¢—U ) ten 
6n\ dl 6& 
= D[U] exp(G,"LU ]n), 
where DU ] is a functional of U. A differential equation 


for this functional is obtained by applying the relation 
(6.20) to (6.24) and setting &==0, 


(6.24) 


sD[U]= + f owen’; UYU (1'1). (6.25) 


By means of (6.23) this equation may be written in the 
form 


6 nD[U]=¥tr[ (1-GU)6(1-GU)], (6.26) 
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where tr indicates diagonal summation of the coor- 
dinate-indexed matrix. In view of the relation 
6 In det Y=tr(X"5X), (6.27) 
the solution of (6.25) is, apart from a constant, 
D[U]=det®!(1—-G UV). (6.28) 


An equivalent expression is obtained by letting U — «U 
and 6U — dU in (6.26) 
dInD[U] = Atrl (1—«GyU)GyUd«], (6.29) 
whence, on integrating from x=0 to x=1, we obtain 
DCU ]=exp[¥tr In(1—G,°U) ]. (6.30) 
Apart from a constant, the functional Glén,U] is 
therefore given by 


‘ i 6 6 5 
Len ]-exni - ——) det*!(1-—GU) 
26U 6 


U 
Xexp(tGPLU ]n). (6.31) 

In particular, to within a constant, 

G[0,0,U ]=exp(w[U ]) 


is 3 
00 ohn 


det¥!\(1—GyU). (6.32) 


If we redefine the constant in G[£,»,U] so that 
a td 6 
GLé,n,U ]=exp(— WLU ]) exp -—) 

2 dU 6U 


Xdet*!(1—GyU) exp(tGy[U Jn), (6.33) 
we see that this expression satisfies the analog of (6.5), 
and that its expansion in sources generates the Green’s 
functions 


G,[1,--:0;1'-+-n's U] 


16 6 EY 
=exp(— WLU ]) exp( +) det™!(1—G,°U) 
26U 6U 


X perm n) or det,» G,Li,j’; U]. (6.34) 


When U is set equal to zero, these functions become the 
previously defined Green’s functions. 

We turn now to the determination of the trace, 
expW. The form of the Green’s functions suggest that 
expWL[U ] is related to the trace, expW. This relation 
is made precise by introducing a functional WLU] 
which satisfies the analog of (3.7), 


(6.35) 


0 - 
“WU ]=¥rG,[U]. 
0g 
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In this expression the trace is the sum of diagonal 

matrix elements obtained by letting the left time index 

approach the right time index from earlier times. The 

corresponding derivative of W is given by 

ow i6 6 
—=Fexp(— WLU ]) exp(* —+—) 

og 26U 6U 


Xdet™! (1-4 UV) tr[(1-GyYU) 7G YG yU], (6.36) 


which, by (6.23), is equivalent to 


46 6 
Fexp(— WLU ]) exp(- —+—) 


26U 6U 


Xdet™(1—G,U) tr(GYLU]—G,). (6.37) 
Substitution of (6.34) in (6.37) leads to an expression 
analogous to (6.35), and thus to the relation 


(0/d¢)(W—W)= ¥tr G,’. (6.38) 


The right-hand side of Eq. (6.38) may be replaced by 
its known value (5.27), 
FtrGy=ir trLexp(i(h—-) 1) 1} 

l 


= ber tr In[iFexp(—i(—$)7) J, 
ag 


where tr“ is the trace of the spatial coordinate matrix. 
Integrating with respect to ¢ and determining the 
constant by noting that W, W, and tr“ all approach 
zero when ¢ — 1 (effectively the limit of zero particle 
number), we finally obtain 
r , 1 ) 6 « ~ ry , 
expW =expW y exp{ - —v -) det®!(1-—GU), (6.40) 
26U bl 


(6.39) 


or its equivalent 


16 6 
expW =expWo exp(- —v —) 
26U 6U 


Xexp[1trin(1—GU)]. (6.41) 


These expressions identify expW o, 
expW = det ®F1(1-Fe- ir) 
= [Jal 1Fexp(—i(ha—f)r) |", 


as the trace for a system with no interaction. 

Just as the occurrence of exp[+i(6/6n)U (6/6&) ] in 
(6.19) allowed conversion of source derivatives to 
external potential derivatives (6.20), the appearance 
of exp[_(i/2)(6/6U’)v(6/6U) | permits the replacement of 
external potential derivatives by interaction potential 
derivatives. By employing the relation 


(6.42) 


) i 6 ) 
ev= e”. (6.43) 
6v(12) 2 6U (2) 6U (1) 
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and the identity that follows from (6.2) and (6.20), 


[6/6U (n+1) Je"G,[1,---n; 1'---n’,U] 


=+e"Gas fl, «+n +1; 1'+-(n+1)+;U], (6.44) 


we obtain 

5W /6v(12) = 3iG2(12; 1+2+). (6.45) 
Equations (6.43) and (6.44) may also be employed to 
evaluate higher derivatives of W. For its second 
derivative, they yield 


6° bW OW 


=¢ Ww eVv— 


6v(1,2)80(3,4) 8v(1,2)80(3,4) 
= (44)°(G4(1234; 1+2+3+4+) 


ew 
60(1,2) 60(3,4) 


—G.(12; 1+2+)G_(34; 3+4+)], (6.46) 
and for the higher derivatives, they generate analogous 
but successively longer expressions. 
If we consider variations in the strength of an inter- 
action potential «v, we may specialize (6.45) to 
dW(x) = 
— =4i foa,neer; 1*2*), (6.47) 


dk 


where expW (x) is the trace and G* the Green’s functions 
for the system with potential xv. Since W(x) is propor- 
tional to the occupied volume for all potential strengths 
xv, the right-hand side depends linearly on the volume. 
The expressions for the higher derivatives of W are 
likewise linearly dependent on the volume. On the 
other hand, individual terms in the expression for 
d"W/dx" need not be proportional to the volume. In 
particular, the various terms subtracted from the first 
one encountered in the evaluation of d"W/dx" by the 
procedure (6.46), 


da” 
e W («)__ eW (« — 


dx” 


= (31) J v(1,2)---v(2n—1, 2n) 


X Gont(1,- ++ 2m; 1+,---2nt), (6.48) 
serve to eliminate its dependence on the second to mth 
powers of the volume. 

In the special case x=0, all derivatives may be 
expressed in terms of products of noninteracting single- 
particle Green’s functions. Under this condition, we 
verify by induction that computation of the mth 
derivative of W involves the subtraction from 


Con°=perM(2n) OF det canyG°(7; 7), 
of those terms and only those terms 
Gy9(1 5 1)Gy9(2; 72) + + +GY9(2N; ion) 


? 


(6.49) 
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whose arguments i admit a partition in which %, 2, 
++ +49, < 21 for some 1<n. The eliminated terms are the 
ones whose integrals 


fra. --v(2n—1, 2n)G (1; i3)+ + 


XG,9(2n; i2n) (6.50) 
may be factored and therefore behave as powers of 
the volume higher than the first; the remaining ones 
yield integrals depending linearly on the volume. We 
designate the latter terms as the connected part 
Con0(1,+ ++ 2m; 1,+++2n) of Gon(1,-++2n; 1+,-+-2nt). Tf 
the interaction potential is one which admits a Taylor 
series expansion of W—W» about zero potential, the 
combinations C2,° which characterize the potential 
strength derivatives of W—W po provide the expansion, 


ew 1 1 n 
w+We-X— f ( ) 0(1,2)+- 
1 n! 2 


v(2n—1, 2n)C2,9(1,---2n;1,---2n). (6.51) 

By setting r=—i8, ¢=—a/8, we obtain for the 
pressure, particle density, and energy per unit volume, 
expansions in powers of the potential which are ex- 
plicitly independent of the volume. Special examples of 
expansions of this character (linked cluster expansions) 
have been derived from perturbation theory by several 
authors. Unfortunately, such expansions of W, which 
imply similar expansions of all the Green’s functions, 
are not generally permissible. 

We continue the more general discussion by recasting 
the expressions for the Green’s functions in differential 
form. For this purpose we employ the relation 


(: 6 6 ) 16 6 
exp{ -—? f(U) exp( —- v ) 
26U 6U 2 6U 6U 


0 
= j(u+i ) (6.52) 


and Eq. (6.32) to write (6.34) in the form 


- very PETM(n) ; i 
G,=e "UI Gy) i,j’; U+iv— }e™™, (6.53) 
det (n) 6U 
From this expression we derive differential equations 
for Green’s functions by applying 
(10/dt+$—h—U—iv6/5U) 


to e"G, and using the identity (6.44), 


(Gry 0 Gi fo, nt+DGai 


=D (-1)78(1,7) Gra. (6.54) 


/ 
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We may also employ the relation 


6 6 OW 
wo eet 
6U 6U 6U 
to express (6.53) as 
—  perm(ny _ 6 6W 
G,a= afi # Utie +i | (6.56) 
det (n) 6U) =b6U 


This form leads to the differential characterization 


II 5G, a 1,, 


7=1 


(6.57) 


where 
F= (G,°)- —U—106/6U —i08W /8U, 


1,= perm(,) or det (n)0(J,J’). 


In particular, the one-particle function satisfies 


5,G,[1; 1’; U]=6(1,1’). (6.59) 

Approximate solutions to these equations may be 
determined by methods other than those discussed in 
Sec. V. For example, it is sometimes convenient to 
separate the effects of direct interaction from those 
due to exchange and correlation. The former appear in 
the equation for G, through the average local potential 
associated with the density, 


(iv6W /8U) (1) = +i f'0(1,2)G,(2; 2+); 


the latter, constituting the difference between the 
direct effect and fiv(1,2)G.(12;1'2+), arise from 
(iv6G,/6U)(1). We may introduce an effective local 
potential, V’, 


(6.60) 


6W 
viw=va+i(r—)ay, 
6U 


and rewrite (6.59) as 


6V’(3) 6 
XG[1; 1’; V’]=6(1,1)), (6.61) 


where we regard G as a functional of V’ rather than U. 
Together with the equation for 
k’ (1,2) =6V"(1)/6U (2), 
6 
(2) =8(,2)- f¥(,3)"(4,2) 71(3; 3+), 
6V’ 


(4 
(6.62) 


{which is related to the density correlation function 
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since we also have 


w'(1,2)=0(1,2)+4 f o1,3)06023; 2+3+) 


—G(2; 2+)G(3; 3+)],} (6.63) 
Eq. (6.61) determines the single-particle Green’s 
function and &’. The latter represents the change in 
potential produced by an alteration of the external 
potential, together with the induced change in the 
direct interaction potential. It may therefore be loosely 
described as an inverse dielectric constant. In a first 
approximation, which sets 


6G (1; 1')/6V" (2) = —6(1,1')6(1,2), 
K’ satisfies 
H(1,2)26(1,2)-4 fo(1,3)40(4,2) 


x G,(3; 4)G,(4; 3), (6.64) 


and G correspondingly satisfies 
[(4.9(1)) 7-1) Gi (1; 1)-i fone; 3) 


XK’ (2,3)G1(3; 1’) =6(1,1’). (6.65) 
This approximation is useful for treating the high- 
density electron gas.’ In particular, a knowledge of 
K’—1 at equal times, which we denote by (X’—1)(0), 
serves to characterize the dependence of W on the 
interaction strength and exhibits its linear dependence 
on the total volume, (V): 


bw 1 © 
focal. — G0; 196252] 
6v(1,2) 2 


=1 f oca,ncecrs 1+2+)—G(1; 1*)G(2; 2*)] 


=7(V)(K’—1)(0). (6.66) 

The modifications in the interactions characterized 
by & occur in the expressions for all the Green’s 
functions. So do terms like those discussed below (6.14) 
which describe the effect of the medium on propagation 
over long periods of time. One method for exhibiting 
these features is to introduce an effective potential 
VU ] of the type encountered in Sec. V: 


[(G))7-V]G,=1. (6.67) 
While V’ is inappropriate to the discussion of strong 
short-range interactions for which (6W/6U)G, and 
6G,/6U are separately large and compensating, the 
effective potential V is generally applicable. Associated 


9K. Sawada et al., Phys. Rev. 108, 507 (1957); 106, 372 (1957). 
M. Gell-Mann and K. A. Brueckner, Phys. Rev. 106, 364 (1957) . 
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with V is a response function X, 
6V(1,1’ 6G,71(1; 1’ 
K(1, 1’; 2)= Sas A 
6U (2) 6U(2) 





which also satisfies the equation 


6G,(11'; U)/6U (2)=[GiK(_ ; 2)G,](1; 1’) 
=+[G2(12; 1'2+)—Gi(1; 1)G,(2; 2+)], (6.69) 


where the unspecified indices of K are understood to 
fit into the matrix multiplication. This response function 
is explicitly given by 


5 

K(1,1’; 2) =8(1,2)6(1,1'}% f 0(1,3)— 
6U (2) 

<(G2(13; 1”3)G.7(1"; 1’)], (6.70) 


and may be determined by approximately solving the 
equation analogous to (6.62), 


K(1,1’; 2)—8(1,2)6(1,1))=43 f 01,3004 2) 


«6/6V (4,4) (G2(13; 13)G.7(1"; 1’). (6.71) 


In the weak-correlation approximation we may make 
the replacement 
fares; 1”3)G,7(1”"; 1’) 

65 (1,1/)G,(3; 3)4G,(1; 3)6(3,1'), (6.72) 
and utilize the exact relation 


6 i ‘s 
———G@,(1; 2)=G,(1; 4)G,(4’; 2), 
5V (4,4’) 


(6.73) 


thus obtaining 


K(1,1'; 2)226(1,2)6(1,1) +4 f 0(1,3)G033 4) 
«GA (4; 3)K(4,4’; 2)6(1,1’) 


+i fare ;4)G,(4’; 1’) 

X K(4,4'; 2). (6.74) 
The equations for the Green’s functions may be written 
in terms of K by regarding them as functionals of V 
rather than U’. We first introduce the identity 


a. . = 2. 
sx =30,G0X=|1-i f o( 0G, lax (6.75) 
5U (a) 


for arbitrary X. If we set Y=1, we obtain 


sw , 
V=U+in +i fo a)G,K( ;a), (6.76) 
aay 


AND FF. 


SCHWINGER 


or more explicitly 


v4 1y=[vay+if (1 2h 1’) 
7 eeu)” 


+i fo.yda 1”)K(1",1'; 2). (6.77) | 
By applying (6.75) to (6.57) we obtain 


" 5 
—$ ) 1 ‘es n =tI1n- J 
Il : i fo Oral |@ [U]=1n. (6.78) 


7=1 


We shall henceforth adopt a summation convention for 
repeated literal indices and omit the integration sign 
in expressions like (6.78). 

The equation for G, is then given by 


{{1—io( ,a)G,6/8U (a) G7}, 
X {[1—i0( b)G15/5U (b) G7} GeLU ]=112, (6.79) 


and is reduced to 
(Gr), {1 = iv( b)G6/5l (6) 1G} Gof l [|= 1 12 (6.80) 


on multiplication by the inverse of the functional! 
derivative expression for particle 1. When the corre- 
sponding derivative expression for the second particle 
is moved to the left, (6.80) becomes 


{(1—iv( ,b)G,8/6U (6) }o(G7) (G1) 
+iv(2,6)[6(G17),/6U (6) }}G.[U ]= 1x2, (6.81) 
which, by the argument applied to (6.79), is equal to 
{(G:2):(Gr7)2—[1—i0( ,b)G46/8U (6) J 7 
X iv(2,a)[K(11'; a) 1} Gol U = 1s. 
To approximate the exact (6.82) it is convenient to 


prove an identity which we shall apply to the inter- 
action term. We note that (6.75) implies 


FIX =G,[1—io( ,b)G16/6U (b) JX. 


(6.82) 


(6.83) 


If we set X=V’ we obtain an equivalent expression 
for the combination 


$V" (a) =[(G4)— V’— i086 /5U FV’ (a) 
=[((G.)7— V’— iv /65U}? 
X[V’(a)+i0(a,b)5/5U (6) ], 
which may be written as 


$V! (a)=[V’ (a) +iv(a,b)5/5U (b) G4, 


(6.84) 


(6.85) 


(6.86) 


in the virtue of the commutativity of the two operators 
in (6.85). 
The identity obtained from (6.83) and (6.86), 


[V'(a)+iv(a,b)5/6U (b) 1G, 


=G,[1—io( ,b)G\6/6U (6) }-°V'(a) (6.87) 
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may be reduced to 
iv(a,b)6G,/5U (b) = iv(a,b)G,K( ; b)G, 
=G,[1—iv( ,c)Gs6/8U (c) 
Xiv( ,b)G,6V'(a)/6U(b) (6.88) 
by subtracting V’(a)G,, so that 


[1—iv( ,c)G,6/5U (c) Piv( ,b)G,K(a,b) 
=iv(a,b)K( ;b)G;. (6.89) 
An equation for G: is obtained from (6.82) and (6.89) 
by two approximations. The first involves neglecting 
the dependence on U of the function ®’, in (6.89), and 
leads to 
[1—iv( ,c)G16/5U (c) iv(_,a)G, 


~iK’(a,c)v(c,b)K( ;b)G,. (6.90) 
We describe by v the symmetric combination 


v(a,b)= K’(a,c)v(c,b) (6.91) 


occurring in (6.90), and prove its asserted symmetry 
by observing that 


olIT 


frassas=vt.9+i foi1see9 


(6.92) 


= fo11,40002,9). 
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The matrix equivalent of this identity, K’»=vk’', 
implies this symmetry since 


v=K v= 0(K'T)=0(K'") T= (Kv) T=vT. (6.93) 


The second approximation involves replacing the inter- 
action term in (6.82), 


1G.=[1—iv( ,b)G,6/6U (b) ]iv( ,a)e 
XK( ;a):G[U], (6.94) 


by 
IG2={[1—iv( ,b)G6/5U (b) iv ,a)G} 2 


xK( 5a)(Gi)G:, (6.95) 


in which the differential operator acts only on the 
bracketed expression, thus ignoring the U’ dependence 
associated with the other particle. 

From (6.90) and (6.95) we obtain an approximate 
symmetrical interaction kernel, 


~iK( ;a)iv(a,b)K( 3 d)o, (6.96) 


and an equation for é,, 

G.= (Gy)1(Gy)o1 wt i(G;) 1 (G,)oIGo, (6.97) 
which indicates more explicitly than (5.56) modifying 
effects of the medium on the interaction characteristics 
of particles. 
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A general method of calculation is described for quantum statistical mechanics. It is based on a simpli- 
fication of the Laplace transform of the density matrix which follows from a theorem due to Hugenholtz. 
The basic result is that an element of the density matrix can be written as a sum over graphs, with the 
contribution of each graph factored into contributions from connected or linked graphs. Applied to the 
grand partition function, the exponential formula of Bloch and DeDominicis is obtained in a simple way. 
A similar formula is then derived for the canonical ensemble for the case of a nondegenerate gas. In this 
way the familiar result of Uhlenbeck and Beth is obtained for the second virial coefficient. Techniques are 
also introduced for evaluating ensemble averages of operators. In this connection, some care must be 
exercised in the case of diagonal operators. Finally, these methods are used to calculate the pair-correlation 
function for a system of fermions interacting through short-range forces. 





I, INTRODUCTION 


HIS paper describes a general method of calcu- 
lation for the quantum statistical mechanics of a 
system of interacting particles. It is based on a theorem 
due to Hugenholtz,! which allows simplification of the 
Laplace transform of the density matrix.” From this we 
derive a cluster expansion for the density matrix, and 
give several applications of the result. 

Brueckner has called attention to the fact that per- 
turbation methods for many-body systems involve 
formal complications not present in the conventional 
perturbation methods.’ The source of these difficulties 
is the possibility—for extended systems—that many 
particles interact simultaneously. Thus a straight- 
forward expansion in powers of the interaction energy 
and the retention of only low powers is not expected to 
yield a good description of a many-particle system. On 
the other hand, different groups of the simultaneously 
interacting particles act independently in the sense 
that particles in one group do not interact with those 
of another. Thus the wave function may be factored 
into terms referring to independent clusters of inter- 
acting particles. Brueckner introduced the term “linked 
clusters” to describe such factors. 

The same difficulty also arises in statistical mechanics 
when expansions in powers of the interaction energy are 
attempted. For classical statistical mechanics, a sys- 
tematic expansion applicable to gases has been pre- 
sented by Ursell and Mayer.‘ Recently a number of 
new methods have been proposed for studying the 
equation of state of quantum-mechanical systems. Some 
of these procedures are adaptations of techniques that 


* Work done under the auspices of the U. S. Atomic Energy 
Commission. 

1N. Hugenholtz, Physica 23, 481 (1957). 

? Kenneth M. Watson, Phys. Rev. 103, 489 (1956). 

3K. A. Brueckner, Phys. Rev. 100, 36 (1955). 

*H. D. Ursell, Proc. Cambridge Phil. Soc. 23, 685 (1927). 

5 J. E. Mayer, J. Chem. Phys. 5, 67 (1937). 

°F. E. Salpeter, Ann. phys. 5, 183 (1958). 


have proved useful in quantum field theory and nuclear 
physics. For example, there is the recent work of Lee 
and Yang’ which is closely related to the classical ex- 
pansions. Matsubara has given an extensive field- 
theoretic exposition of the grand partition function*® 
which makes use of time-dependent perturbation theory 
and Wick’s theorem. Bloch and deDominicis have 
extended this work in several recent papers."-” In 
particular, they have solved the difficulty mentioned 
above and given a linked-cluster expansion for the 
Gibbs potential. Similar results have been reported by 
van Hove." Montroll and Ward" have also obtained 
expansions in terms of graphs for use in quantum statis- 
tical mechanics. 

The methods presented in this paper are related to 
some of the above work, particularly that of Bloch and 
deDominicis.” In addition to giving a novel and simple 
development, we have also extended the general results 
and presented some new applications. 

In Sec. II the theorem of Hugenholtz is reviewed 
and then used to give the expansion of the density 
matrix. In Sec. III this result is used to evaluate the 
grand partition function, leading to a result similar to 
that of Bloch and deDominicis.” The evaluation of the 
partition function for the canonical ensemble is then 
considered in Sec. IV. Techniques for obtaining the 
ensemble average of an operator are considered in Sec. 
V. In the last section this result is applied to the cal- 
culation of the pair correlation function for a system 


of interacting fermions. 


7T. D. Lee and C. N. Yang, Phys. Rev. 113, 1165 (1959). 

8 T. Matsubara, Progr. Theoret. Phys. (Kyoto) 14, 351 (1955). 

®D. J. Thouless, Phys. Rev. 107, 1162 (1957). 

0G. C. Wick, Phys. Rev. 80, 268 (1950). 

1 Claude Bloch, Nuclear Phys. 7, 451 (1958). 

2C. Bloch and C. deDominicis, Nuclear Phys. 7, 459 (1958); 
10, 181 (1958) ; 10, 509 (1958). 

‘8 Leon van Hove, lecture given at the University of California, 
September, 1958 (unpublished). 

‘4 E. Montroll and J. Ward, Phys. Fluids 1, 55 (1958). 
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QUANTUM 


II. EXPANSION OF THE DENSITY MATRIX 
INTO CLUSTERS 

We consider a system of NW similar particles, each of 

mass M, in interaction within a large volume 0. The 

Hamiltonian describing their motion is 

H=K+V. (2.1) 

In this expression K is the kinetic energy: 

K=Yieditar, «x=h/2M. 

The quantities a, and a, are, respectively, creation and 

annihilation operators satisfying the usual commutation 
relations 


(2.2) 


[ax,axt ]4=5x, k’y 
Lax,ax Je =Laxt,axt =O. 


The + sign refers to Fermi-Dirac statistics, while the 
— sign refers to Bose-Einstein statistics. The quantity 
k represents the momentum of a single free particle 
(A= 1); when used as the label of a state or an operator, 
the label & represents the momentum of a single free 
particle (4=1) and its spin (if any). 

The interaction energy V is 


V=2 D> (kl V\mn)ax'ar'andmn, 


k,l,m,n 


(2.3) 


(2.4) 


where 
(kL| V | mn) = (X;.(x1)X1(x2), 0(%1— X2)Xm(X1)X n(X2)) 
F (Xu (41)X 1x2), V(X1— X2)Xm(X2)Xn(41)). (2.5) 


The quantity 2(r) is the potential energy of two par- 
ticles separated by a distance 7; it may also include spin 
interactions. The wave functions x are 


Xe (x)= U-le*™: 2S, (2.6) 


where S is a spin function if the particles have spin. 
Finally, the — or + sign refers to Fermi-Dirac or 
Bose-Einstein statistics, respectively. 

We shall often use a momentum representation for the 
entire system. Thus the ket! p) specifies the momenta 
of the .V particles; of course it also is an eigenstate of 
the kinetic energy operator: 

K|p)=E,|9). (2.7) 
Here E, is the eigenvalue of the kinetic energy for the 
state | p). 

The equilibrium properties of the V particle system 
are completely described in terms of the operator e~*”, 
where 8! is the temperature times the Boltzmann 
constant. It is useful to introduce the Laplace transform 





Fic. 1. Contour of inte- 
gration @ for Eq. (2.8). 
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ice spf 
>< 


Fic. 2. Typical 
low-order graphs in 
the expansion of 
(E—H)7: (a) simple 
scattering graph, (b) 
repeated scattering 
of two particles, (c) 
example of a discon- 
nected graph. 


of this operator,” 


pu — 


e — | dke**W(E), 
2ri Ye 
where 
W (E)=1/(E—#). 

Here E is a complex number, and the contour of inte- 
gration @ is illustrated in Fig. 1. First one integrates 
parallel to and above the real axis from + to a point 
to the left of the lowest eigenvalue of H. At this point 
the contour crosses the real axis and returns to + 
below the real axis. 

We now consider the expansion of the operator W (EZ) 
in powers of V. A typical term in the expansion has 
matrix elements 


1 1 | 
*" yy a ?). (2.10) 
Co — E-K E-K E-K| 


For the terms in this expansion we use the graphical 
representation of Hugenholtz.' A typical matrix element 
of V may be represented by two directed lines, crossing 
at a point, as in Fig. 2(a). Definite states are associated 
with the two lines, both before and after the scattering. 
We shall refer to this as a simple ‘‘scattering” graph. A 
typical term in an expansion such as Eq. (2.10) is then 
represented by a combination of the single-scattering 
graphs of Fig. 2(a). For example, we represent the term 


1 1 
(Riko! |v | Ry 'k. 2’) = 
E-e(k; \ —e(ke ') 
1 
|v | Rik2) s ; 
E— (ki) —€(k2) 


(2.9) 


E — (hy) — ef (ke’") 


X (Riko! (2.11) 


by the graph of Fig. 2(b), using the notation €(k) = ex. 
A graph, every part of which is connected to every 
other part by lines, is said to be a ‘‘connected” graph. 
The graphs of expansions, such as (2.10), need not be 
connected. This is illustrated in Fig. 2(c). Graphs that 
are not connected are called “disconnected.” 
If a graph contains “disconnected parts,” 
due to Hugenholtz! permits us to “factor” 
into parts each containing only ‘“‘connected graphs. 
This procedure is the cornerstone of our method, 


a theorem 
this graph 


” 
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The validity of this factorization may be understood 
as follows. Let a typical term (2.10) contain a particular 
connected graph G2 and any number of other graphs 
(in general disconnected) which we shall call G,. The 
contribution of all these graphs to W, summed over all 
permutations of the order in which the interactions of 
G, and G» occur, is indicated by 


| I ! 
WG G)=(9|— = ») . 
e~8=7,~¥60 1 03 


Here the subscript (1,2) means that we pick out the 
expansion of (E—K—V,—V>2)~' only those terms which 
give the required graphs. For the graph G, alone, or G2 
alone, we have the corresponding expressions, 


(2.12) 


| 1 
wG)=( pi — »). 
E-K-J;, 1 
| 1 
wiG:)={ py ——_— p). 
|\E—-K—V2] 7 


Here | p;’) and | po’) represent the states obtained from 

p) by the transitions of graphs G, and Go, respectively. 
Likewise the interaction terms V; and V»2 represent the 
particular terms in V that are required for the transi- 
tions in G, and Go, respectively. 

The graphs in this discussion, i.e., those in Eqs. 
(2.12) and (2.13), involve a specific set of intermediate 
states.’® The sum over these virtual states will be carried 
out at a later stage of the calculation. Now for a Jarge 
interaction volume VU, the interactions associated with 
the two parts of the graph, i.e., V; and V2, may be con- 
sidered to refer to different states. It is true, of course, 
that the sum over intermediate states occasionally 
gives terms in which the same states are involved. 
However, these cases are usually less important by a 
factor UO", assumed to be very small. Further discussion 
of this question is discussed in Appendix C. 

Thus we are justified in treating the interactions V, 
and V2 as referring to different states. We can next 
introduce a kinetic energy operator K2 which refers only 
to the states occurring in the connected graph G2, and 
also define the operator K,=K— Kg. Because the two 
graphs commute, direct application of Cauchy’s 
theorem gives 


(2.13) 


1 1 


1 1 
— fe 
E-K-V;-V; Oni E-t—K,-Vit—Ks wer 
(2.14) 


The contour of integration is similar to that of Fig. 1, 
except that it is sufficiently close to the real axis that 
no singularities of (E-—¢{—K,—V;,)" are enclosed 
within the ¢ contour. More specifically, we may choose 
a representation in which (K,;+V;) and (Ko+V2) are 
both diagonal with eigenvalues A; and A». Then (2.14) 


16 Thus the examples of graphs in Fig. 2 include definite mo- 
menta for all the states involved. 


HECKROTTE, 


AND WATSON 
obviously holds in the form 
1 1 1 


' ‘ 
a Omi E- — Ato i. 


By nie the matrix elements of Eq. (2.14) appro- 
priate to the graphs G, and Gz, we obtain immediately’® 


1 1 
2mi E-§—K,-—V,l /1 


1 | 
x{ ‘ >- (2.15) 
es if —Ke—- vi 2 


We simplify this relation by introducing the notation 
Ki|p)=Ei|p), K2|p)=E2|p), 
K | p)=(E:+E2)|p)=E,| p). 


We also define {=2+ £2, and now Eq. (2.15) becomes 


1 
WGG)=— f ack ps saiencniinnds *) 
2mi aan 


| »). | 
x — . (2.16 
(>: eh 


Equation (2.16) is just the statement of the Hugenholtz 
theorem.! We have developed it in a manner suggested 
by Hugenholtz! and also by Riesenfeld and Watson." 
A special case of Eq. (2.16) obtains when there is no 
graph G, and V; consequently vanishes. This factoriza- 
tion is not exactly correct because of those cases when 
G, and G2 contain the same momentum. The corre- 
sponding errors in the thermodynamic functions are 
discussed in Appendix C with an argument which 
involves the “slight” relaxation of momentum con- 
servation. 

We now use Hugenholtz’s theorem (2.16) to reduce 
e~*# to a sum of terms, each containing factors involving 
only connected graphs. First, let us suppose a typical 
graph in the expression (2.10) contains Z connected 
graphs, Gi, G2, ---Gzx. By induction, from Eq. (2.16), 
we obtain for their contribution to W(£), 


w(G ie 


| Fadfi. 
~ (mi) 
,? 
Blectaat: rm rs 
‘4 ae 1 


—(K+Vi-E,)\ 


'6 Tf either |p’) or | p2’) is the same as |p), the eisai 
graph will be repeated indefinitely in the expansion of both sides 
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We next carry out the sum over virtual states in each 
graph on both sides of this equation and also sum over 
all graphs to obtain 


W (GiG2: - Gz) 


1 
= PR fess f dee ep 
graphs (271) L 


ermine 
1 ‘i - K| 


X {wi(z1)we(s2)-+-wz(2z)}, 


wile) =p =P), 


In w;(z;) we include only terms of a (K+V-—E,) }° 
contributing to G,;, but sum over all virtual states. In 
Eq. (2.17) the sum over all graphs includes a sum over 
all topologically different graphs. It also includes a sum 
for each graph, over the available particle states in the 
states in the states |p) and (p’|. 

To obtain e~## it is necessary only to substitute Eq. 
(2.17) into Eq. (2.8) 


BH| p)= 1E j dz] --- | dz 
(p’\e ?)= ose (mie — f f f i 


(2.17) 
where 





(2.18) 
s—(K+V— 


e BE 
’) 
 E- (et eet- **B1)— K| 
 wy(s1)we(go)* ++ 


wy(Zz). 
Then 


(p" |e |p) 


e BL E- ~(zitzet >>> 
“Er fee sR?) 
SV ties ex baeh- +n) =i 
1 
an - fase Bais (Z,)+- — f ane Belqy (Zz). 
2ri Qri 


Now define the quantities 


1 
gi=— | dz,eF*w,(2,), 
2m 


(2.19) 


where i refers to a graph of the 7th type, defined by its 
topological structure. 
With these definitions, the above expression reduces 


to 
(p’ |eP4| p)=(plePX| p){ D0 gige-+ gz}. 


graphs 


(2.20) 


of Eq. (2.14) in powers of Vi and V2. This omen no difficulty, 
since we may replace V; by g:Vi and V2 by g2V2 and equate cor- 
responding powers of g; and gz on both sides of the equation. 

17 W. Riesenfeld and K. M. Watson, Phys. Rev. 104, 492 (1956). 
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This equation represents the fundamental result of this 
investigation. It achieves the stated goal of reducing the 
interaction of many particles to a sum of products, each 
factor describing the interaction of a much smaller 
number of particies. 

Before discussing a number of applications of this 
result, we discuss Eq. (2.20) in somewhat greater detail. 
Each factor g; represents a contribution to e~6” from 
a single connected graph. We may suppose this graph to 
involve the scattering of r; particles. Then g; is a matrix, 
(ky'ho! ++ +k,!| g;| Riko: ++), 


leading in general to a change of state for each of the r; 
particles. This matrix is 


(ki’:- ere “+k 
1 


——_——__)... 


dze~#? (o"' =, 
ri s—(K—E,) s—(K—E,) 
1 
—(K—Ep,) 


, - 


The last factor may of course be simplified, since 
[s—(K—E,) }"|p)=2"| p). 

The sum over “all graphs” in Eq. (2.20) implies first 
a sum over all (ky'ko’:+-k,’|g;| kiko: ++k,y) for each gy. 
In addition to this, we must sum over all topologically 
different graphs. To clarify notation on this point we 
use a symbol G to denote the topological structure of a 
graph. Then a ‘“‘sum over G” implies only a sum over all 
topologically different graphs. This kind of summation 
is then mot equivalent to the sum over “all graphs.” 
Only when a sum over all (4,’---k,’) and all (R:---k,) 
is also performed is the “sum over all graphs” complete. 


III. EVALUATION OF THE GRAND 
PARTITION FUNCTION 

We shall now derive an expression for the grand par- 
tition function using the basic result given in Eq. (2.20). 
This is the same problem solved by Bloch and deDo- 
minicis and, indeed, our result is similar to theirs. We 
shall treat separately the two cases of Fermi-Dirac and 
Bose-Einstein statistics. The two discussions are, how- 
ever, quite similar, as are the final formal expansions in 
terms of connected graphs. 


A. Fermi-Dirac Statistics 
The grand partition function is 


b=Trlew%e-6# ], 


(3.1) 


where a is the chemical polential. In performance of the 
trace operation (Tr[---]) a sum is carried out over 
all number of particles V. The essential simplification 
of the grand partition function is now accomplished by 
the use of Eq. (2.20) for the diagonal elements of e~8” : 


b= Tele OT (gre -gs) 


graphs 


(3.2) 
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For the operators N and K we use the familiar ex- 


pressions 
N => ar Nk, K= k €kNk, 


with n,=a,'a,. For Fermi-Dirac statistics, of course, 1; 
can only be zero or one. 

Only the diagonal elements of e~§” are involved in 
the trace of Eq. (3.1). Then, according to the discussion 
of the preceding section, only the diagonal matrix ele- 
ments of the g; (diagonal graphs) are required in Eq. 
(3.2). A typical diagonal g then has the form 


g= DL alkim---;stu---)[(1—n,)(1—m,)-- +] 


ttu--- 
XL ++). (3.3) 


This expression follows directly from Eq. (2.21) on sub- 
stitution of the explicit expression, Eq. (2.4), for the 
interaction V. Between each interaction a sum over 
intermediate states s, 4, wu, --- is introduced, and this 
gives rise to the sums over intermediate states stu: -- 
in (3.3). The states klm--- are the initial states which 
have been suppressed on the left side of (3.3). The coef- 
ficients a(klm-- +; stu---) involve energy denominators 
and matrix elements of the potential. Their form is not 
important for the general discussion of this section. 

Equation (3.2) may be written in a more explicit 
form as 


LX ILexpl(a—Bex)me l{gige: - + gr}. 


nmkni---+ graphs k 


a= (3.4) 
The summation over ,4”;- -- means that the occupation 
number for every state assumes the value zero or one. As 
a first step in evaluating Eq. (3.4) we insert the appro- 
priate expressions (3.3) for the graphs but, for the 
moment, do not carry out the sums over intermediate 
states s, t, u, ---. Instead, we first carry out the sum 
over the n,’s. To do this, we observe that each factor 


1 eb 
Ci(kike: + +k,) =— [aC baker ke (fV)- 
wi 2? | 


Here Ey=e(k:)+ (ko) +--+ -€(k,). The quantity f is an 
operator acting only on the V to its right; a typical 
matrix element of fV is 


(kl| fV | mn) = fi fi (Rl| V | mn). (3.9) 


If k (or /) is a virtual momentum state, then f,+ [or 
fi] is chosen. If k (or 2) is a member of the set 
(ki---k,) then fi [or fi] is used. Equation (3.8) 
provides a formal expression of the content of Eq. (3.6). 

To repeat, each intermediate state s in a graph is 
weighted by a factor f,“ (the probability that s is 
empty), while each initial momentum state is weighted 
by fi (the probability that & is occupied). This 
feature was obtained previously in the work of Matsu- 
bara® and by Bloch and deDominicis.” 

To summarize, we have obtained the following ex- 
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exp[ (a—ex)nx | falls into one of three classes. If the 
state k does not occur in any graph, 


> im e(aBee) mk = [J + e(aBee) |. 

Nk 
If k occurs somewhere as an initial state, it has, ac- 
cording to Eq. (3.3), a factor m;. Thus the sum is 


>» e(a-Ber) ky), = ofa Ber). 
nk 


For k occurring as a virtual state, there is now a factor 
of (1—n,) [again referring to Eq. (3.3) ]. In this case, 
then, 

Dd efe be) (1 —n,) = 1. 


nk 


Taking account of these results we may extract from 
Eq. (3.4) for 3 the factor'® 


do=[[[1t+e* ]=Trlew***], (3.5) 
: 


Of course corrections have to be made to Eq. (3.5) for 
states occurring in the graphs. When this has been 
done, it is seen that each summand in (3.3) is replaced 
by 


a(kjlm,+ ++; slurs Lf. P fr? +++] 
XA f+], (3.6) 


where 


fet ) =[1+e Bex) | < fx =[1 + €44-© ] ™ (3 7) 


The factors f;~ and f;, are simply the probabilities 
that state & is occupied or empty for a Fermi gas. Note 
also that fx? +f? =1. 

Referring to Eqs. (3.3) and (2.21) we see that, when 
the sum over the m,’s is completed, each g; in Eq. (3.4) 
is replaced by a quantity 


1 | 
————{f¥)-+>(f0 iby: - A) 
2— (K—E,) 


a 


pansion for the grand partition function in terms of 
connected graphs (or linked clusters) 


ga=3. 3 (Ci-«-Cz) 


graphs 


(3.10) 


As described in Sec. II, the sum over graphs is to be 
done in two parts. A given graph, defined topologically, 
must be summed over all states (kik2---k,). Then a 
sum must be made over all topologically different 
graphs. Define 


VG= > Ca(ki- + -k,), (3.11) 
Aye ++ky 


and let there be Vg graphs of a given type G ina typical 


'*Of course 3o is simply the grand partition function for the 
noninteracting system. 
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term in Eq. (3.10). Now, a given graph G is counted 
Ng! times in carrying out sumssuch as those in Eq. (3.11) 
The reason is that these sums give Vg! terms that cor- 
respond to permutations of the Cq’s among themselves. 
This permutation was already carried out once, how- 
ever, in using Hugenholtz’s theorem (2.16). Therefore 
if a graph G occurs Ng times, its contribution is 
QgN¢/Ne!. Finally, the grand partition function is 
a OgN¢ 
=k H—, 
Ne G Ng! 


which can be written as a simple exponential, 


a= 30 expL>. Og]. (3.12) 
G 
We can also introduce the Gibbs’ potential through 


the equation 
= exp ( — BQ). 


It can be simply expressed as a sum of connected 
diagrams, 
Q%=—B" Indo—B DY Oa. (3.13) 
G 
For convenience, we here rewrite Qg, defined by Eq. 


(3.11), 


1 1 
Qc= DX — fas Bz 
kiss ckr Qart 


1 


X (fakes keV —fV---fV 
| s—(K—-£,) 


To illustrate Eq. (3.14), we write out the first two 
terms (1st and 2nd order in v). Sums are converted to 
integrals by the usual prescription, 


i V 
fel 
k (2r)8 


i= = (9/2)L0/(anyr Ff are f anc] o}an wO-N, 


(3.15) 
Q2=4LV/ ney fare far fam farn 
X (RI| v| mn) (mn | v| Rl) fe fr fa? fa 


X {d~Le*4—1 ]—Bd}, 
d= e+ €1— €m— En. 


In the limit of zero temperature (8 ), 2) becomes 


the energy of the lowest state of the system. From Eqs. | 


(3.13) and (3.14) one can then deduce linked-diagram 
expansions for the ground state energy which are related 
to Goldstone’s formula.! 

19 J. Goldstone, Proc. Roy. Soc. (London) A239, 267 (1957). We 
have been informed by Dr. W. Kohn and Dr. J. M. Luttinger 


that, under certain conditions, Goldstone’s expression is not cor- 
rect, whereas the results derived from Eqs. (3.13) and (3.14) are. 
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B. Bose-Einstein Systems 


We begin our discussion of Bose-Einstein statistics 
with Eq. (3.4), which is also correct in this case. Now 
the sums over occupation numbers must go over all 
positive integral values of the m,, however. In addition 
Eq. (3.3) must be replaced by 


g= > b(kim---; stu--+)L(n,+1)(m+1)--- ] 


8,t,u 


Xmen]. (3.16) 


We again first sum over the n,’s in Eq. (3.4) before 
doing the sums over intermediate state (s, /, “, +-:). 
As before, the factors exp(a—e,) fall into three classes 
according to whether & occurs later in a graph (as an 
initial or intermediate state) or not. For a k not appear- 
ing in a graph, the sum is 


1 
b ee Bek) nk — 


— =),", 
nk i—¢er Fs 


(3.17) 


If k occurs as an initial state, Eq. (3.16) gives 


> ela Beb) ky, = 5,15, —, 


Nk 


(3.18) 


where 
b= 1/(e*—1), (3.19) 


Finally, if & is an intermediate state, we have 


Dd ee Bee) k(n, 4+-1) =[b, YP, 


nk 


(3.20) 


We see then that each term in Eq. (3.16) is replaced 
by 


b(R,Lm,- ++; sjtuy+ ++) Lb. Pb ++ + Lb. Ob +++]. 
(3.21) 
The contribution of the 7th graph is thus 


1 


dz—e#? 
9 


1 
- bV---BV eke.) 
s—(K— Ex) | ‘ 
(3.22) 
Here, in direct analogy to Eq. (3.9), b is a symbolic 
operator which introduces the appropriate weighting 
factors into the above products. For example, 


(kl| BV | mn) = by ‘“*b,*> (R1| v| mn). 


The + sign is used for a virtual state and the — sign 
for an initial (or final state). 
Finally, we introduce, as in Eq. (3.11), 


Qc= LX 


kikg+ + +k 


Cag(ki- ++ h,). (3.23) 


The partition function is given by Eq. (3.12), except 
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that Eq. (3.22) is used for Cg, and now 


do=]] & =][[1—er #4}. (3.24) 
k A 

The entire development parallels that for Fermi-Dirac 

statistics. The only modification which has to be made 

is to replace the Fermi-Dirac weight factors f,‘*) by 

the 5,‘*) appropriate to Bose-Einstein statistics. 

A comment is required for the case of degenerate 
Bose-Einstein systems. Let us suppose, for example, 
that a finite fraction of all the particles is in the lowest 
state k=0. In this case we can treat mo as a large number 
as the creation and annihilation operators for this state 
commute. This permits us to use the Hugenholtz 
theorem to separate graphs even though many graphs 
involve interactions with particles in the state ‘0”’. 

Care must also be exercised in carrying out statistical 
sums, such as in Eqs. (3.18) and (3.20). That is, many 
graphs may involve particles in the state “0” but only 
one sum is to be carried out over mo. First let us set 
not+1~no. Now in a typical term of Eq. (3.4) let us 
suppose that mo occurs w times when all graphs are 
considered. Now, instead of Eqs. (3.18) and (3.20) we 


have 


ps e\@ Beo mong" = [bot ](tio)*, (3.25) 


where fi is the average number of particles in state “0”. 
[ Because mp is assumed large, the average of the product 
is set equal to the product of the averages. ] The quan- 
tity tio, by Eq. (3.18), is just bo. We are thus again 
led to Eqs. (3.22), (3.23), etc., so our conclusions are 
valid for degenerate Bose-Einstein systems. 


IV. NONDEGENERATE GAS 


The general method discussed in Sec. II can be 
applied to ordinary canonical ensembles as well as 
grand ensembles. Thus we now consider a nonde- 
generate gas and evaluate the partition function for a 
canonical ensemble. Let the number of particles in the 
system be V. The states of the system are specified by 
the N individual momenta, | p)= | pi---pw). Thus the 
partition function is 


-€(py) J} 


X DX (gige--- gr}. (4.1) 


graphs 


Z=1/M! > 


Pl-+-PN 


exp{— BLe( pi) t+ €( pe) ie 


This expression follows directly from the fundamental 
result for the matrix elements of e~®” given in Eq. 
(2.20), with the g; given by Eq. (2.21) for the case 
ky’ =h, k,’=ks, etc. 

As before, the sum over graphs consists of two parts. 
First, for each g;, the variables (ki, ko, ---k,) are each 
summed over all momentum variables in the state 
|p)=|pi-:*pw). Then a sum is made over all graphs 
that are topologically different. In taking the trace, one 
finally sums over all (~:---pw). The first sum merely 
duplicates terms that occur in the sum over (1: -- py), 


HECKROTTE, 
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and may therefore be evaluated by simply counting the 
number of terms occurring. 

In the approximation that the gas is nondegenerate, 
each k may take any of the values (1: - -pw). Hence the 
possibility is ignored of more than one particle 
occupying the same state. The sum over all k then gives 


a factor 
N= 


(4.2) 
where r; is the number of ingoing (or outgoing) lines in 
the graph g,. In addition to this factor, the sum over 
(ki---k,) permutes the order of topologically equivalent 
graphs. Thus, we must introduce [as was done before 
in Eq. (3.12) ] the factor 


(4.3) 


’ 
@ Ne! 


where Ng is the number of graphs topologically equi- 
valent to G in a typical term in Eq. (4.1). We recall 
that this factor arises because these permutations are 
already included in the use of Hugenholtz’s theorem. 

For each p; that is not equal to a k; in one of the 
graphs, the sum over in (4.1) gives a factor 


v¥=>d eF, (4.4) 
p 


Next, we introduce 
3g=(N/y) > | exp{ —BLe(ki) +--+ €(k,) ]} 


Xgalki:--k,), (4.5) 


and the partition function in the absence of interactions, 


Zo=(1/N!) YL exp{—bLe(p:)+---€(pw) J}. 
Pl+**PN 
This leads to the following expression for the partition 


function, 
9,NG 
4G 


Z=L6 pe II an ee = Zo expL>_ 3a |. (4.6) 
No G 


7 G Na! 


Here the sum over G implies, as usual, a sum over 
graphs which are topologically different. This is the 
same kind of expansion as given above for the grand 
function, i.e., an exponential of a sum of linked clusters. 
For applications, we require the free energy, which is 
F=—B"|lnZ, or F=—B"InZ,.—8"D je. (4.7) 
G 
For convenient reference, we write in full the expression 


for AG: 


3g=(.V/y)" DX exp{—BLe(ki) +--+ e(k,) J} 


kis+-k 


1 
x— fas z* exp{ —z} 


Qni 


| 1 | 
«(bs ++h,| V-—_———V- - > iy - 
| s—(K—E,) 


‘t) , (48) 
G 
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with 


=[0/(2m)*](2mp-'M)}, (4.9) 


Ey,=e(ki) +--+ e(k,). 


and 


We note the absence of weight factors in intermediate 
states (nondegenerate gas) and the Boltzmann factor 
for the initial states of a graph. 

As a simple application of this result, we evaluate the 
second virial coefficient and obtain the result of 
Uhlenbeck and Beth.” For the second virial coefficient, 
we need consider only those terms in which a single pair 
of particles interact. Let these have momenta k and I. 
Then 


iy te” 


G 


(V/y)? 2, =P slate 


x fa z~* exp(—£z) 


| 
x Ca: ed doe 


1 
ae —-* 
e+ e+e 


os hn), (4.10) 


where only those terms in V are kept which describe 
the scattering of the two given particles. Introduce the 
variables 

P=k+, 


x=k-I, (4.11) 


to replace k and I. Since P is a constant of the motion, 
the energy denominators in Eq. (4.10) become 


ttetea—K=2+T —T, 


where T)=x’/M and T is the kinetic energy operator 
for the relative motion of the two particles. 
The quantity 


1 
R(s)=V+V———_V 
2t+To—T 
1 1 


Pets, 
st+To—-T 2+To— 


which appears in Eq. (4.10), is the two-body level shift 
operator. On introducing a representation (x, /, m), 
where / is the angular momentum of the two particles 
and m is its component along the axis of quantization, 
one can perform the integration over z in a straight- 
forward way”: 

1 


4nB x= 
paca Palle oA *(x| R(z)| «x)= —— >> (21+1) \51(k). 
27 UKM i=0 


(4.13) 


Here 6;(x) is the scattering phase shift for the /th partial 
wave. If the two-particle system has bound states, some 
additional terms are required in Eq. (4.13). 


2 G. E. Uhlenbeck and E. Beth, Physica 3, 729 (1936). 
*1 See, for example, the discussion in reference 16. 
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When Eq. (4.13) is substituted into Eq. (4.10), the 
result is 


er kK 


N: 
> i wal” _ 


G 


X exp( al M)éi(x). (4.14) 
We can now evaluate the free energy, using Eq. (4.7), 
and thus the equation of state, from the familiar relation 


(=) - [—) 
e=—{- — P 
oe 8 


Here & is the pressure and, of course, 8 '=kT. The 
immediate result is the first two terms in the virial 
expansion. 


V. EXPECTATION VALUE OF AN OPERATOR 


In this section we consider the ensemble average of 
operators of the form 


O=>- O,(ay'a,'a (5.1) 


*)(@eGrde°**). 


In this expansion, the vth term is a product of v 
creation and v annihilation operators. The coefficient 
O, depends on the states of these 2v operators and the 
sum goes over these variables also. This expectation 
value of O is denoted by O. For the grand ensemble we 
have 


30= Trl e** Oe ## J. (5.2) 


For the canonical ensemble, this expression is replaced 
by 


ZO=Tr[Oc-** J. (5.3) 


We give two different techniques for evaluating O. 
The first is formally very simple, but leads to somewhat 
more cumbersome expressions to evaluate than does 
the second method. In Appendix A the equivalence of 
the two methods is demonstrated. 

It is convenient to think of O as a generalized scat- 
tering operator with v incoming and y outgoing lines, 
as is illustrated in Fig. 3. Connected graphs may then 
be constructed, as in Fig. 4 by connecting O to graphs 
involving the matrix elements of V. 

In evaluating O, we first suppose that the diagonal 
matrix elements of O either vanish or give a negligible 

q 














q 


v 


‘1G. 3. Generalized scattering operator with v ingoing and v 
outgoing lines. 
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Fic. 4. Simple scattering graph linked to the 
pair-correlation operator O. 


contribution to O. At the close of this section, the con- 
tribution of the diagonal elements is treated separately. 
The reason why the diagonal elements require special 
consideration is clarified in Appendix B by a simple 
example whose solution can be found by other (ele- 
mentary) means 

We define the function, for real X, 


Y(\)=Tr [e* Nem p(H+ 0), 


a 


(5.4) 


For \=0, this simply reduces to the grand partition 
function, i.e., Y(O)= 4. Furthermore, the first derivative 
of this function yields the average value of O defined 
in Eq. (5.2): 


aed, 


—| =Trle*%Oe-*#]= 30. 
dy A=—0 


(5.5) 


—p 


If Y can be determined as a function of A, the problem 
of finding the average value of an operator is reduced 
to once differentiating Y(A). The evaluation of Y (A) is 
achieved by simply taking over the development of 
Sec. IIT, except that V is replaced by 


V’=V+ 0, (5.6a) 


and thus 


H+\0=K+V". (5.6b) 


Of course the connected graphs involve both the gener- 
alized O interactions and the pair interactions V. 

To be specific, the contribution of a particular type 
of graph Qc(A), defined by Eq. (3.14), must now be 
considered a function of \. Of course these quantities 
reduce to the previous functions for \=0: 


0o(0)=Qe. (5.7) 
Again, in analogy to Eq. (3.12), we obtain an exponen- 
tial form for Y (A), 


Y (A) oat by expL> Qe(A) }. (5.8) 
Therefore, 
dY(X)| 


A=0 G h=0 


dQ«(A) 


and the average value of O is 


dQg(A) | 


lA=0 
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To obtain O for the canonical ensemble, we may 
carry out a similar analysis and consider 
X (A) = Tre PF +90) 7, 
Now X(0)=Z and 


(5.10) 


= Tr[Oe-## ]=ZO. (5.11) 


= 
In analogy to Eq. (4.6), 
X(A) =Zo expL>_ G 3a(A) ], 


where 3¢ is the quantity defined by Eq. (4.5), but with 
V replaced by V’. Differentiating, one obtains 


(5.12) 


dX (Xd) d3a(X)| 


dx A=0 j dd A= " 
The average of O is thus 
D d3a(A)| 


6=-#° 5 — 
G dx 


(5.13) 


i\A=0 


Before discussing the contribution of the diagonal 
elements we present an alternative procedure for evalu- 
ating O. We first recall the basic expansion in graphs 
for a matrix element of e~®”, which is given in Eq. 
(2.20). In evaluating the average value of an operator, 
we are concerned with matrix elements of the form 
(p|Oe** | p). The direct generalization of Eq. (2.20) is 
then simply 


(p|Oe-*| p)=(ple-P"|p) 


all graphs 


[Ogi-+-gr]. (5.14) 
The “sum over all graphs” has the following meaning 
in this situation. We refer to the vth term of the sum 
in Eq. (5.1), which goes over all the states of the 2» 
operators (d,y’'a,/ta,/t+ ++) (a a,a,:++) in O. In perform- 
ing this sum we obtain all possibilities of connecting O 
to the original graphs. The graphs g which are con- 
nected to O in this way may now be separated from the 
remaining factors in Eq. (5.14) with the result 


2 [en--gz), 


all graphs 


(p|Oe*# | p) = (p| e-PX | b)x (5.15) 


where 


x=(p ee [Ogi---giJa\p). (5.16) 


all graphs linked to O 


In Eq. (5.16) each of the graphs gi---gz is linked 
directly to O. The sum includes all connected graphs 
involving matrix elements of O. Since O has v incoming 
lines, we must always have L’ < p. 

In performing the sums over graphs, we sum inde- 
pendently over the graphs of x and over the graphs of 
the last factor San grapns[gi-**gr] in Eq. (5.15). 
Therefore, 


Trle**%Oce-®# ] = 3x, (5.17) 
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where 


g=Tr Y  [0C:::-Crlel. (5.18) 


all graphs 
The C’s were defined in Eq. (3.8) and fO contains the 
weight factors f~ appropriate to the outgoing lines 
of O. Thus, as in Eq. (3.9), the matrix elements of fO are 


-| fO|grs---) 
= fy Ofr' Y fe) ° -(q'r's'- — |O|grs: ° -). 


For Bose-Einstein statistics the operator f in Eq. (5.18) 
is replaced by 8, defined in Eq. (3.22). 

An alternative way of writing Eq. (5.18), valid for 
either Fermi-Dirac or Bose-Einstein statistics, is 


Ff 


(qrs:: 


2 1 
O= a= Tr| —ent-08 DR [ Ogi: A lo} (5.19) 


50 G 


The sum over graphs here is for all graphs linked to O. 
This expression makes use of the notation of Bloch and 
deDominicis. 

We have previously indicated that the diagonal parts 
of O have to be treated in a special way. This may be 
illustrated with the particular example 


O= Da'a(q’ 'O! gay ‘ay, 


i.e., With an operator which has just one ingoing and 


one outgoing line. The leading term in x is clearly 
Da fa (gO! 4g). (5.20) 


This does not conform, however, to the general result 
of Eq. (5.14). The correct result is obtained by omitting 
from the sum over g in Eq. (5.20) those terms occurring 
in intermediate or initial states in the [gi---gz ]. 

The same conclusion may be reached more simply 
the following way. Let us separate from O a typical 
diagonal term, i.e., one for which some ingoing line 
equals some outgoing line. Then Eq. (5.1) may be 
rewritten as 


Oa= dog O(g) Ma, (5.21) 


where we have simply contracted the two operators 
referring to the same state and incorporated all other 
factors into O(g). Now, 


30. = Trie* v( die “BH | 


(5.22) 


d 
=> has ie Ogher hi 


qa ¢ A=0 


which leads us to define the following function of g 
andj, 
A(q,A) = Tre \ tAna()( qe onl * (5.23) 
This quantity may be evaluated by either Eq. (5.9) or 
Eq. (5.18). The new feature implied by Eq. (5.23) 
2 If O(g) itself contains diagonal elements of importance, the 
procedure described here must be applied again. 
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is that, in the evaluation of the diagrams of A(q,\), 
a is replaced by (a+A) in the weight factors f,‘*?. 
Carrying out the differentiation in Eq. (5.22), one 
obtains the following formula for the average: 
OA(q,A) | 
Oa= fi) : : - — (5.24) 
qa A=0 


To illustrate this result, we consider the very simple 
example, 
Osa=N=)og te: 
Now, 


a | 
2 a [eeN tine }| =—er 
a OA 


A=0 da 
so that the familiar result is obtained by this procedure, 
aN = 03/da. 


For a nondegenerate gas, we use the canonical en- 
semble (Sec. IV) and, by analogy with Eq. (5.18), write 


0=x= > ¥ ( ) 
graphs ky-+-kr NY 


X (ki: : -k, (Og: ‘eu le ky: : ‘k,). (5.25) 


In conclusion we note that the final formulas for the 
two procedures described in this section, Eqs. (5.9) 
and (5.18), are quite different. It is shown in Appendix 
A that these actually give the same result. In Appendix 
B we discuss in more detail the source of the compli- 
cations that occur when O has diagonal elements. 


VI. PAIR-CORRELATION FUNCTION FOR A SYSTEM 
OF INTERACTING FERMIONS 


We now illustrate the above discussion of expectation 
values with the example of the pair-correlation function. 
We consider a system of interacting fermions and, in 
doing so, go beyond the familiar results for noninter- 
acting particles.?’ On the other hand, we treat the inter- 
actions only in terms of the simple scattering graph 
shown in Fig. 1, and give a definite numerical result 
for the nondegenerate case. The reason for these re- 
strictions is that we are now primarily concerned with 
illustrating the formal procedures discussed in this 
paper. This particular example has been studied with 
other methods by Blatt and Karplus and Watson.” 

We define the pair-correlation operator as 


1 
O(rr2) = + 


(NV?) ay pp’ .aa’ 


exp{iL(p—p’)-n 


+(q—q’)-r, ]ay'a,'a,ay, (6.1) 
3 See, for example, the discussion by L. D. Landau and E. M 
Lifshitz, in Statistical Physics (Pergamon Press, New York, 1958), 

paragraph 117. 
* J. Blatt, Nuovo cimento 4, 430 (1950). 
25 R. Karplus and K. M. Watson, Phys. Rev. 107, 1205 (1957). 
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where (.V*), is the average of the square of the number 
of particles. As discussed in detail in the previous 
section, the diagonal part of © will have to be discussed 
apart from the nondiagonal part. The diagonal part of 
0 1s 

1 


O4=— 
(N? Te 


~ b(q,p)nny, (6.2) 


where 


b(q,p)=1—exp[i(p—q)-r], (6.3) 


and r=1,—fo. We note b(g,g)=0, so that the sum in 
Eq. (6.2) involves only the terms for which p ¥g. 

In treating the diagonal operator Oz we recall the 
pope of the preceding section, particularly Eqs. 
(5.22) and (5.23). In this case we have a sum of products 
of two numbers operators and the expressions are only 
slightly more complicated than in Sec. V. In this case 
we introduce the function 


A( p,q 3 ApAg) = 50(P,93 ApAg) 
XexplL G Valp,9 ; A pAg) J, 


and evaluate the average value of Oz with the formula 


(6.4) 


Na) my = b( p,q)- 


Or pq 


logA( p,q; ApyAg) | Ap=0, Aq =0. 
(6.5) 


The evaluation of A(p,g;A,,A,) follows directly the de- 
velopment in Sec. III for the grand partition function 
[ Eq. (3.12) ]. The only essential change is the replace- 
ment in the statistical factors f,*), [Eq. (3.7) ] of a by 
(a+A pb; ptAPxq) ° 


Se (P95 XpAg) [1+exp{Bex— (atApdip 


+ big)} F', (6.6) 


fe (2,95 ApyAq) = 1— fe (P75 Ap Ag)- 
Thus the function 40(p,g;A,,A,), which occurs in Eq. 


(6.4), is 


30(P,93 Ap Ay) =O fe? (p,q; Ap Ag). (6.7) 
k 


In a similar way the contribution Q¢@(p,g;Ap,Aq) of a 
graph of type G is evaluated from Eqs. (3.10) and (3.8) 
by using the new statistical factors of Eq. (6.6). 
Finally we note that the differentiations in Eq. (6.5) 
lead to an average value for Og which consists of five 
terms, 


0” by Ja.\ 0 
Oa) v= = b(b,9) aw ( )\—¥ @. 


OA pA, OpT Org & 
Oa0\ O | 
+( =~) DX Oa t+ -¥ 00) 
Og Or p P an p G 


0 0 
x (— ye Ou) + :® 0e| 
Org 


Dp, @ 


Ap=0, Ag =0 


HECKROTTE, 


AND WATSON 
In this expression the dependence on (f,g,A,,Aq) has 
been suppressed inside the square brackets. 

In evaluating the function A(p,qg;A,,A,) as the ex- 
ponential of a sum of topologically different graphs, we 
now make the main assumption of this section, which 
is to consider only the simple scattering diagram of 
Fig. 1. Its contribution Q, was evaluated in Eq. (3.15), 
and now becomes 


8 
O29; ApAg) = 5% Melee) 
lk 


X fe (P1953 Noa) fv (G5 Nvda). (6.9) 
To complete the evaluation of (Oz), we need only 
perform the differentiations indicated by Eq. (6.8) on 
39 and Q,, which are given in Eq. (6.7) and Eq. (6.9), 
respectively. Without going into further detail, the 
final result is 


Hp Lk fee 4-8 E expl ik] 
(N?)m @ 


(Oa Dav — i 


X (ki v| RI) fk O fir P fr P. (6.10) 
The statistical factors in this equation are the original 
ones given in Eq. (3.7), since now A,=A,=0. The con- 
tribution of the fifth term in Eq. (6.8), i.e., the term 
containing (0°/dA,0A,)>_ ¢ Oe, has been dropped since 
it is smaller than the others by a factor of 1/0. In the 
derivation of this result it has also been shown 

y= ~LLEh' fie fi‘ 


—B > (kl \ v| Rl) fe P. (6.11) 
kl 


In the limit of 0 — @ one also verifies (N?)y=(V)*w. 
Finally, for complete degeneracy, Eq. (6.10) reduces to 
the familiar formula 


(Oa)w=1—[ - é tar P/L x iy. 


q<PF q<PF 


(6.12) 


We next evaluate the contribution from the nondiag- 
onal part of the pair-correlation operator. We shall use the 
second method of calculating average values described in 
the preceding section and summarized by Eq. (5.18). The 
only nonvanishing graph involving a single scattering is 
illustrated in Fig. 4. In accordance with the discussion 
in Sec. III, the contribution of this diagram is obtained 
from 


1 
i(kl) =— fos 2? exp(—z) 


2ri 


vit) (6.13) 


where the corresponding graph is in Fig. 4. When Eq. 
(2.4) is substituted for V and Eq. (6.1) for © and the 


«(al catia 
2—(K— Nh €1) 
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contour integration is carried out, this becomes 


C1(kl) =¥ b(Rl,ml) (| 0] RD) fe fr fim fx? 


mn 


X (€m+ €n— &— €1) 


X[exp{—B(emt+en—€—€:)}—1]. (6.14) 


Here 6 is simply the coefficient of the operator appearing 
in the definition of © in Eq. (6.1), 


b(k’q’ ; kg) =exp{il (k—k’)-11+(q—q’)-12]}. (6.15) 


The last factor in Eq. (6.14) comes from the contour 
integration. The complete nondiagonal contribution is 
then obtained by summary over k; and ko: 
(Ond)w=—2 D b(kl,mn)(nm|v| kl) 

kimn 


XK fem fr fa fa (emt en— e— €1)71. 


We note that when these last summations are intro- 
duced, the two terms of Eq. (6.14) give equal contri- 
butions. This may be shown with the help of the relation 


Sr f m*" , exp —B(€m— ex) = fet i m' ’ 


and by relabeling the sums in Eq. (6.16). Finally, this 
expression may be simplified by using Eq. (2.5) for 
(nm|v| kl) and Eq. (6.15) for b(kl:mn), and by replacing 
sums by integrals in the usual way, 


(Ona) = — (02/(N2)q)2(2mr)-9 i dk f d'] f dm f din 


X6(k+1—m—n) fi fi fin fn 
Xexp[—i(k—m)-r](€m+€n— €:— €1) 


X[w(k—m)—w(k—n)]. (6.16) 


Here w represents the Fourier transform of the potential 


w(q)= far v(rje7'4*, 


The complete expression for the pair-correlation 
function is the sum of the diagonal and nondiagonal 
contributions, i.e., the sum of Eq. (6.10) and Eq. (6.16). 
We shall now obtain a quantitative estimate of the 
pair-correlation function by making the further ap- 
proximation of nondegeneracy. Thus the statistical 


v 


, 


(6.17) 


factors are now 


Se => e* Bek 


fem. 


In addition we shall ignore the exchange part of the 
potential. 
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The diagonal contribution, Eq. (6.10), now assumes 
the form 


(0a(r) w= 1— (V?/ (V)} 2(2m)-8 f d¥y 


Xexpl—iq-r+a—y"q"] 


~pw(0) f ak f dl 


2 


expt —she 20-744 P)}] , (6.18) 


and Eq. (6.11) becomes 


(N) w= ((N?) ay)? = | 22%) fay exp(a—‘y’q’) 


— pw(0)(2n)-8 f Pk f @'l 


xexpl2a—4(e+P)}}, (6.19) 
The length y is defined as 


v= Bh? 


2m). 


The square brackets in Eq. (6.18) is the same as that 
in Eq. (6.19) except for the dependence on r. But this 
dependence may be removed by completing the square 
in the exponentials in Eq. (6.18), and leads to a factor 
exp(—r’/4y*) for each bracket. Thus Eq. (6.18) is 
simply 


(Oa(r))w= 1—exp(—7r?/2y’). (6.20) 
The second term is a quantum-mechanical correction 
arising from the repulsive effect of the exclusion prin- 
ciple. It is important only for particle separations of the 
order of the deBroglie wavlength, i.e., for r~y. 
Returning to the nondiagonal contribution in Eq. 
(6.16), two integrations can be done immediately with 
the aid of the transformation k=x«+k’, l=x—k’, 
m= «’—k”, n=x«’+k”. The result is 
= — (U2/(N*)y) By exp(2a)[ (2x)? }"7. (6.21) 


( 0 nd) Av 


J= fas u(s) f ok exp(—277k?—ik- x] 


eik’-x 
x f ak! ——. 
kk 


We have used Eq. (6.17) for the Fourier transform of 
the potential and put x= r—s. The integration over k’ is 


(6.22) 


D 


Dri p ek'z 1 
= ff dk’ k’— -1(2n)?-(ei##+e-*) 
x Jy bh * 


where the contour corresponding to the principal value 
has been chosen. The integration over k can also be 
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carried out and the final result for J is 


v(|r—x]) 
I= (2m)7*(2y) fas exp(— 27/277) 


x 


(6.23) 


If we are interested in correlations for separations con- 
siderably larger than y, we may regard v as constant in 
doing the integration and obtain a closed form for the 
integral : 


I= (29)**(4y)~0(r). (6.24) 

The complete calculation of the nondiagonal con- 
tribution also requires evaluation of the chemical poten- 
tial a from the total number of particles. Neglect of the 
first-order effect of the potential in Eq. (6.19) leads to 


e~*= (U/(N)w)2(2x!y)-. (6.25) 
Collecting all these results, one finds the nondiagonal 
contribution to the pair-correlation function to be 
simply v(r)/kT. Adding this to the diagonal part for 
r>>y, one has for the pair-correlation function 


(O)\w=1—2(r)/RT. (6.26) 
This is just the classical result and it serves to verify 
the techniques introduced in the preceding section. In 
addition it should be noted that the result of Karplus 
and Watson” for the nondegenerate case may also be 
shown to lead to this result. 

In conclusion we should like to point out that the 
entire discussion of this paper refers to general anni- 
hilation and creation operators satisfying the commu- 
tation relations for Fermi and Bose statistics, and not 
just to those referring to free-particle states. They may, 
for example, refer to collective variables such as those 
introduced by Bogolubov”* and Sawada.*’ 
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APPENDIX A 


We demonstrate here (for Fermi-Dirac statistics) the 
equivalence of the two methods given in Sec. V for 
evaluating O. An analogous demonstration can be given 
for the Bose-Einstein case. The contribution of a 
typical graph to Eq. (5.18) may be written 


1 fo fi fs 
= fagiie: ©, row gu Sree | 
2ri Z st+Ao.1 stAos 


(A.1) 


26 N. N. Bogolubov, Nuovo cimento 7, 794 (1958). 

27 K, Sawada, Phys. Rev. 106, 1372 (1957). 

*8 David J. Thouless, University of California Radiation Labora 
tory Report UCRL-8696, March 23, 1959 (unpublished). 
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where 

Av, :=Eo— Ei, (A.2) 
and 


Q= (0/0}1)(1| V|2)(2{V{3)---(J| V0). (A.3) 


The label zero refers to the initial state, and the inter- 
mediate states are enumerated with the integers from 
1 to J. In the carrying out of the comparison, sums are 
not made over the intermediate or initial states in this 
-xpression. The factors fo, fi: +: fy represent the appro- 
priate statistical factors f‘*?. 

The analogous term in Eq. (5.9) is 


1 fo fi 
—BT =— |dz ed (=)(= : ). Be: 
Qri 2? stAo1 
x()}. wo 
2t+Ao,s 


In addition to this, there are J other terms obtained by 
cyclic permutation of the factors in Q. A typical permu- 
tation of Eq. (A.4) is 


re 1 fa-u' 
—BTo=— dz e—F — 
2ri s 


fa-u2 
x cata — 
2+As_141,54142 
fa 
2tAys, l+1,J 
x (0|O0}1)(1| V|2)--- 


x (J—1| V|J—1+1). 


l 


(J-141| V|J—/+2) 


(J~14-2\V|J—i+-3):-- 


fo 
0)—— 


(J V'0)- --— 
s+Ay+141,0 


x 


(A.5) 


Now, the matrix elements of V and O in this expression 
may be combined and set equal to Q, defined by Eq. 
(A.3), so that one has 


1 1 1 
—BT =Q— fe e Bs f(t) fC) a 
2ri 2 = s+Ay—41,7-142 
1 


S-Ag17=4 


TEED i (A.6) 


If the new variable, 
2'= stAysy 14+1,0 (A.7) 


is introduced, the energy denominators in Eq. (A.6) 
may be rewritten as 


1 1 1 1 1 
3! o/+Ao1 2/+Ap.» o/+Ao.s4041 3’ +Aos4 +1 
1 1 


Yee . (A8) 
s’+Aos41 a’ +Ao.7 
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At the same time, the following equation holds: 
e782 = e— Bz’ eB(Es+iti—Eo) (A.9) 
Because we have 
fF ela—Bek) = f, (4), 

the second factor in Eq. (A.9) may be used to convert 
the fi’s in Eq. (A.6) into the fo--- fy of Eq. (A.4). 

Finally, the complete term associated with the second 
method is 


Fi 1 
ToO=— 1 oe fs l— dz 
3 PLofe--fL— f 


l=0 


1 1 





1 
xen — ck a 
3? 2+Aor 2+Ao,s 


1 1 1 
pA (s+Ao,1)? 2t+Ao,s 





a: 4 1 
set oisel 
z 2+Ao (s+Ao,s)? 


This may be rewritten as 


BLO fo: - Je fo om Si : i ' 


dzz2+Ao1 


1 
x——}, (A.10) 
stAo,s 


which, after partial integration, is simply 7. We have 
thus shown that the terms of Eq. (5.9) combine in 
groups to give the terms of Eq. (5.18). 


APPENDIX B 


We attempt to clarify here the complications which 
arose in Sec. V in handling diagonal terms in the 
operator O. For purposes of illustration, we consider 
the evaluation of the grand partition function for the 
Hamiltonian 

H=K+V, V=Xx beng. 
Here each 6; is a real number representing the ‘“poten- 
tial energy” of a particle with momentum &. This 
example is chosen because the grand partition function 
may be evaluated directly to give 


3=[][1+expfa—B(ato)} 1. 


(B.1) 


(B.2) 


We now evaluate 4 by the methods of Sec. III, using 
Eq. (2.20), which now takes the specific form 


> IL eatk,k). 


li,lga--+ k 


e-bH = ¢~BK (B.3) 


Here, for /;,=1, 2, 3, --+, we have 


1 
g(k,l.) =— - 


2mi 


fa exp(—B2z)Z~-("#,"*n,., ~— (B.4) 
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and for /,=0, 
g(k,O)=1. (B.5) 
Equation (B.3) is easily verified directly, since we have 
21 g(k,l) =e bmn, (B.6) 


Now, following the notation of Sec. III, one obtains 


x TICws), (B.7) 


ly,le,-+- 


b= Trl e*%e-F4 |= do 


where (for /,=1, 2, 3, ---) 


71 


1 
Clb) =—— fa exp(—Bz)Z— (HF) By" f, Tt (B.8) 


For Ly = 0, 


C(Rkl.) = ‘# 


Then sum over these factors is easily carried out, 


(B.9) 


¥ Ck) =14+ fk-[e-#*—-1]=14+L,, (B.10) 
l= 


where L; is simply the second term of the result. 
Equation (B.7) is clearly equivalent to Eq. (B.2), since 


[1+exp(a—Bex) [1+L:. ]=1+exp{a—B(ect+d:)}, 


the first factor on the left comes from the 40 in Eq. 
(B.7). 

To see the relation of this development to the ex- 
ponential formula of Sec. III, we consider 


WlitJ=14+ Let SE Lele 
k kl <k!! 


5 SE alert: 


kh’ ck! '<k’ 


(B.11) 


Now, if terms such as L,? are negligible, we may write 
this as 


1 
Uf1+4:J=14+d L+—(E Li)*+-:- 
- 2s (B.12) 


a exp(doe Lx). 


This shows the relation to the graphical analysis of Sec. 
III, since each L; is a sum of connected graphs. The 
factorials here arise just as they did in the sums of 
Sec. III. 

The error in Eq. (B.12) may be seen by examining 


Inf +L.) ]= ¥ In(i+L) =CH[b+(L2)).  (B.13) 


For 

pan LPKD Lay 
the expression (B.12) is valid. This condition is satisfied 
when the 4; are very small. The corresponding case in 
Secs. III and V obtains when the matrix elements of V 
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and O are individually very small (for example, of order 
U"!), a finite result being obtained only on performing 
a sum. 


APPENDIX C 


It was mentioned previously in applying the Hugen- 
holtz theorem that it is convenient to suppose that 
momentum conservation is violated by a small amount 
at each interaction. This effectively decreases the 
likelihood that particles in different graphs will have 
the same momentum. It also means that we may ignore 
the possibility that two lines in the same graph refer 
to the same momentum state. This is not at all an 
unphysical assumption because a small violation in 
momentum conservation cannot have a macroscopic 
effect. To see this, we need only recall that two particles 
interacting in a real gas always do so in the field of 
other particles, which then receive some momentum 
transfer from the two colliding particles. 

We shall now illustrate this idea with a definite model 
for the relaxation of momentum conservation. The 
usual 6-function, in a box of volume UV, satisfies 

lim 6(P’—P)=/(2r)*. (C.1) 


P’—P 


We replace this by a new function A,(P’—P) which 
depends on a parameter a and is normalized to unity, 


for A,(I) =1. 


The matrix element of the interaction between two 
particles is now written 


v= v9(k’—k)A,(P’— P) (C.2) 


where v% is a function of the relative momentum k of 
the colliding particles and P is the total momentum 
(and similar primed variables after the collision). The 
corresponding coordinate-space representation is 


b= 8(r’—1)5(x—x’)(d0(r))n(Aa(x)) wv. (C.3) 


Here r is the relative—and x the center-of-mass—coor- 
dinate, and (do), and (Aa), are the Fourier transforms 
of v and Ag, respectively. 
For example, we might choose 
A.(P’— P) = (2rd?)-! exp[— (P’— P)?/2d? 
+i(P’—P)-a], (C.4) 
which has the transform 


Aa(x) =exp[ — (x—a)*d?/2]. (C3) 
With this form, the interaction between two particles 
is important only if their center of mass is within a 
distance d~' about the point a. Thus a linked-cluster 
now involves only particles interacting in the neigh- 
borhood of a. This is consistent with our physical inter- 
pretation of momentum being transferred to other 
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particles near a. Finally an appropriate sum must be 
carried out over all regions of size d~* at all points a 
within U. We may think of this as being an average 
over all local fluctuations in the force-field in which 
clusters interact. 

We define 


H= (2nd?)-}, 
so that 


A.(0) =H. (C.6) 


Equation (C.6) should be compared with Eq. (C.1), 
noting that H has the dimensions of a volume, It is 
important to assume that 


HKU. (C.7) 


We may write the matrix element 1 as 


(2m)? 


vo™ TH, 
\2 


(C.8) 


where yw is the average interaction energy of two par- 
ticles and 7 is the corresponding ‘interaction volume.” 

Consider now a diagonal graph with a number v 
interactions v. Let the’ingoing lines be “tied” to the 
outgoing lines. There are then 2y lines and v 6-functions 
(or A,-functions). If there are g external lines, (ingoing 
or outgoing) there are 2y—g integrations over virtual 
states. One of the 6-functions (A,-functions) is re- 
dundant. Integration over the remaining (v—1) 
6-functions (A,-functions) leaves v—q+1 free inte- 
grations, each one giving a factor 


VD 1 


~w—- — 


(Qn) r 


(2x)? 


Furthermore, there are (v—1) energy denominators, to 

each of which will be assigned some average energy e. 
If momentum is conserved exactly, i.e., if the normal 

6-function is used, this graph gives a contribution, 


OM 
HYIOO 


If we sum over all N possibilities for each of the g¢ 


incident lines, we obtain for the contribution of the 
graph under consideration 


By’ (Nry2t 7/0 
cvend’ CVC) 
€ x 6) T (C.10) 


=IN, 


where J is of order unity. 
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On the other hand, if we relax momentum conserva- 
tion and use the A,-function, the above estimate is 
changed only by the replacement of U in Eq. (C.9) 
with H defined by Eq. (C.7). The resulting contri- 
butions would be 


H H 
Go =—Go, G@=-—G. 
V V 


(C.11) 


The superscript a indicates a quantity calculated with 
the A,-function. 

On summing over all points a of volume H, as men- 
tioned above, an additional factor U/H is obtained in 
Eqs. (C.9) and (C.10). This is in agreement with our 
assertion that a small violation of energy conservation 
cannot affect the macroscopic properties of the gas. 

If one virtual integration is suppressed in the graph, 
a factor of U/r must be removed from G and G™, with 
the result, 


GHD) = “Ge, G' D=—G. 
V Vv 


(C.12) 


Consider next the contribution from two discon- 
nected graphs G; and G2, each of which has one line 
describing a virtual particle in the same state ». When 
we sum over p and use a 6-function, the two graphs 
give a contribution which is the same order as obtained 
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from a single graph: 


UV 
(Gi1G2)4~=—— feste DG. D7] 
(2r)8 . 


(C.13) 
=[°N2(r/V)2(V/1) 


=]?(Nr/V)N. 


With the A,-function, we find, however for the same 
quantity 


(Gi G2 w= (H/0)7L7(N7/V)N. — (C.14) 
When we sum over “regions” a, this is multiplied by 
U/H. The result is then of order H/V times that of Eq. 
(C.13). From this we see that the two graphs give a 
negligible contribution to the thermodynamic functions 
if the A,-function is used. That is, the relaxation of 
momentum conservation permits us to neglect the pos- 
sibility that two or more virtual momenta have the same 
value. 

The important practical consequence of this result is 
that all graphs must be evaluated just as if no two lines 
referred to the same state k, each getting its proper 
statistical factor of fi.“ or f,*. In this way the 
Hugenholtz result can be used and linked-cluster ex- 
pansions obtained for the thermodynamic functions. 
Finally, these expansions will give the same results as 
those obtained by Thouless.”* 
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It is shown that the pseudopotential method can be extended to yield further terms in the low-density 
expansion of the ground-state energy of a system of Boltzmann or Bose particles with hard-sphere inter- 
action. Two terms beyond the known result are found, and the expansion is no longer a power series in 
(a*p)*. Other related properties of the system are discussed. 


1. INTRODUCTION 


LTHOUGH it is an old problem to investigate 

the properties of a system of a large number of 
particles interacting pairwise, there exists only a few 
explicit answers for the quantum mechanical case. In 
the case of a system of identical bosons interacting 
through the two-body hard-sphere potential, virtually 
none of its properties were known until the recent work 
of Yang, Lee, and Huang.’ In the investigation of 
these authors, two independent methods have been 
used. One is the method of binary collision,’:*.’ where 
the evaluation of the grand partition function is reduced 
to the solution of the two-body problem. In the other 
method,'*-*> use is made of the pseudopotential of 
Fermi. In both methods, the particle density is assumed 
to be low. 

It is the aim of this paper to study the possibility of 
getting higher order terms by the method of the 
pseudopotential. One difficulty of the extension of this 
method is the inclusion of triple collision.* To the lowest 
order, this difficulty is resolved in Sec. 2. Another 
complication of the method of the pseudopotential, 
already apparent in reference 3, is the removal of the 
divergence in the expression for the ground-state energy 
per particle. This involves a comparision of the per- 
turbation series with and without the operator (0/dr)r. 
This task becomes impossibly complicated in higher- 
order terms. In Sec. 3, a slight modification of the 
pseudopotential is proposed so that to the lowest order 
this removal of divergence is automatic. This is done 
at the expense of dealing with a non-Hermitian Hamil- 
tonian at all times. In Sec. 4, this modified form of the 
pseudopotential is used to determine the next two terms 
in the expression for the ground-state energy per 
particle. However, in order to interpret the final 
formula, it is necessary to make a more detailed study 
of the three-body problem. This is done in Sec. 5, and 
the result is used in Sec. 6 to express the ground-state 
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energy per particle for the many-particle system in a 
definitive form. In Sec. 7, some of the simple properties 
of the ground state are calculated. And finally Sec. 8 
consists of some qualitative discussions of the present 
problem. This entire paper deals with the ground state 
only. 
2. THREE-BODY PSEUDOPOTENTIAL 
A system of hard spheres is a collection of pairwise 
interacting particles with the Hamiltonian (h=2m=1) 
H=T+V, (2.1) 
with 
V= DX Volris), (2.2) 
i<7 
where 


rij=|ri—r;|, (2.3) 
and 
0, r>a 
Vo(r) = 


0, r<a. 


(2.4) 


These particles are, as usual, assumed to be confined 
to a cubical box of volume Q=Z', with periodic 
boundary conditions on its surface. 

When .V=2, the interaction potential may be re- 
placed exactly by the pseudopotential! 


8m tanka © 0 
V»=————6(1 — re) —rio + terms corresponding 
Ori 


to partial waves with />1, (2.5) 


where the meaning of & has been discussed in reference 
1. When applied to the case V>2, the replacement of 
Vo by V2 is accurate only when the interacting pair of 
particles are far away from all the other particles. 
Therefore, if Vo is to be correctly represented by a 
pseudopotential, it is necessary to include a three-body 
pseudopotential in addition to V2. To find this three- 
body pseudopotential, consider the case V=3. The 
problem is to find V; such that the eigenfunctions of 


H3=prt+pret+pet Vo(re3)+Vol(rs1)+Vo(riz) (2.6) 
and 
Hs) = prt pot pet V2(r23)+ Vo(rs1) 
+ Vo(ri2)+ V 3(41,¥o,83) 


coincide in the region f23, 731, 712> a. 


(2.7) 
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The pseudopotential V2 as given by (2.5) is ex- 


ceedingly complicated, but when 


ka, (2.8) 


the leading term is simply 
(2.9) 


Vo~ 8rra6(ri a f2)—Tio. 
Oris 


In almost all applications of the pseudopotential, this 
approximation is used. In the same sense, the leading 
term of V; is, by dimensional arguments,! 


V 3(11,Fe,%3) ~ constant a46(1r1—r2)6(re—13)O3, (2.10) 


where O; is some generalization of (0/0rj2)ri2 to three 
bodies. It is the purpose of this section to find the 
constant in (2.10). 

The meaning of the assumption (2.8) is that momenta 
comparable to or larger than a~ are not important. In 
coordinate space, this means that distances comparable 
to or smaller than a are neglected. Therefore for the 
present purpose, the comparison of the two Hamil- 
tonians in (2.6) and (2.7) may be carried out in the 
following manner. Instead of the eigenfunction problem 
originally posed, consider the scattering problem with 
the same Hamiltonian. In accordance with (2.8), the 
energy of the incident field is chosen to be small com- 
pared with a~*. And the constant in (2.10) is to be so 
determined that the scattered fields for the two Hamil- 
tonians coincide as well as possible for points far away 
from the region r;=r2=r;. If the first Born approxi- 
mation is used with respect to V3, then the O; of (2.10) 
may be replaced by 1. 

In order to carry out this program, the formalism of 
binary collision is used. Write (2.6) in the form 


H3= Hot ] oe + V+ Vo, 
and consider the Schrédinger equation 


(H;—E)Wo=0 


(2.11) 


(2.12) 


for a scattering problem. Let yi"° be the incident field 
satisfying (Ho»—E)yi"’=0 and define the following 
operators: 

Go= —(Ho— EE)", (2.13) 
and 


S03 = — (Hot Vo%— E) "V0", etc. (2.14) 


Note that the solution of 


(Hot Vo%—E)y=0, (2.15) 


P= (14.S523)pine, (2.16) 


Equation (2.12) is equivalent to the integral equation 


Yo=vire+Go( Vo%+ V+ V 0!) Wo. (2.17) 


But 


GoV oF = (1+S0%) "So. (2.18) 


Thus the first two iterations of (2.17) give 
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Yo = (1 + Sp So3! + Sy? + SS? + So? S03 + S oF So! 

+ S07 So + SoS 02+ So So) Pim. (2.19) 
It is required to compare Yo with Ye, which is defined 
by the right-hand side of (2.19) with the subscript 0 
replaced everywhere by 2. Here, for example, 


S23 = — (Hot V23— E) V2. (2.20) 


From the way the pseudopotential V» is defined, it 
follows that 
(Sy?9—S.%)yine=0) (2.21) 
for r2;>a. Thus for the present purpose it is only 
necessary to study the quantity 
D=D,+Dz, 
where 
dD, = So? (S08 - So) ine 
and (2.2. 
Do= (So2—So!2)S24pir’, 
In the determination of D,, some properties of 
following Green’s function are needed : 
(HM )— E)G= —6(r,—12)6(re— Fs), 
PG=kG, 


where P is the total momentum and k?<3E. Let 


R,= rtre— 2rs, 


the 


and 


R= ne +93? +nie= 9 (3riv+R2"), 
then a solution of (2.24) is 
E\3 
G=—i——R?-H. (RE|3/N3) 
481° p 
Xexp[4ik- (ri +re+rs) |, 
where 
E,\= E—}k’. 
The behavior of G for small R is 


v3 
G=——R~ exp[4ik: (ri+4e+r) J. (2.29) 
4r' 


The equality r12=a defines a hypersurface in the six- 
dimensional space of relative coordinates (t—r2, Rs). 
It follows from (2.29) that 


v3 
f GdS= f dS f aRi( sec ) 
r12=a rj2=a 4rr? 


X 4(3r12?+ R;”) 2 expLjik- (ry +ret+rs) | 


= —4a exp[4ik-(r+1et+r) ]. (2.30) 


For the purpose of obtaining D;, assume 


pine exp[7(k, ° rt+k.: ro+k;-rs) |, (2.31) 
with 


kP+ke+k?=E. (2.32) 
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To first order, it follows from the definition of the 


pseudopotential that 


a 
(sim—seyyme= (“1 h(a-ra) 
723 


Xexp{ilk, ° r+3 (ke+k;) . (ro+r;) ]} ; (2.33) 


where / is the Heaviside function defined by 


h(x)=1 for x>0 


=0 for x<0. (2.34) 


Therefore, for r;2> a, D, is a solution of 
(Hy—E)D,=0 
with the boundary condition that, for r:.,=a, —Dy, is 
equal to the right-hand side of (2.33). With 
k=k,+k.+ks, (2.36) 
the right-hand side of (2.33) is approximately, for 
rye=4, 


(2.35) 


a 
(- -1)i(a-rs) exp[4ik- (rit+re+r;) J. (2.37) 


T23 


In particular, for fixed r;—fe, 


a 
for(—- 1 )i(a—rn) 
To23 


- a 
-32n f rotdra(. -1)i(a~rs) = (169/3)e%, (2.38) 
723 


which is independent of r;—r2. A comparison of (2.38) 
with (2.30) gives the result that 
D,= — (16/3)2°’a'G. (2.39) 
The quantity D2 may be calculated as follows. First 
it follows from (2.20) that 
So7Apine= 82aG05 (r2—F3). (2.40) 
For a point source in r12>a, So'’—S," gives 0. For a 
point source in 7;2<a, So'* may be replaced by —1, 
while S;'* may be replaced by —a/ri2. Therefore 


D:/G= bf e.- ro)@°R;87ra5(re— r;) 


a 
x (- 1+- ilar) = (16?/3)a*. (2.41) 


T12 


Finally, the substitution of (2.39) and (2.41) into 
(2.22) yields 
D=0. (2.42) 
This means that for the purpose of the present calcu- 
lation, there is no difference between Yo and 2. Since 
this result is independent of E. 


V 3(81,¥ 0,83) =(, (2.43) 


TSUN 


W U 


This means that the constant of (2.10) is zero, and 
consequently it is irrelevant what QO; is. 

In particular, there is no correction to the ground- 
state energy due to the three-body pseudopotential, to 
the order p*a*N, where p= N/Q is the particle density. 


3. TWO-BODY PSEUDOPOTENTIAL 


Even in the case N=2, the pseudopotential 
8ra5(r;—F2)(0/Arj2)ri2 of (2.9) cannot be replaced by 
82aé(r;—f2). However, in the treatment of the N-body 
system with large NV, this replacement is made,’ and 
leads to a divergence in, for example, the expression for 
the ground-state energy per particle. The removal of 
this divergence is a rather complicated process. This 
difficulty stems from the fact that the operator 


fe) 0 
0=8(1}—r=a(0)|—*] (3.1) 
0 r=0 


r Or 
does not commute with the infinite Fourier series; for 
example: 
m sinmr 
m>—n* 


m sinmr 
0oy> ——# X 


men m—n? mn 


(3.2) 


This inequality causes trouble because a typical y for 
O to operate on is a solution of 


(V2+ k°)y = 4rkad(r). (3.3) 


It is the purpose of this section to see how the operator 
O can be slightly modified so that this particular di- 
vergence does not occur. The modified form of the 
pseudopotential is then used to get the phonon spectrum 
near the ground state and the ground-state energy. 

Imagine that the 6(r) in (3.1) is replaced by, for 
example, 


5 (r) = (8re)'e-"€'", (3.4) 


where e’<a. To be consistant, the 6(r) in (3.3) has to 
be so replaced also. In this case, y is distorted up to a 
distance several times ¢’. In order that the pseudo- 
potential can serve its purpose, the part (0/dr)r of 
(3.1) should be evaluated at a distance ¢, where 
e’Ke<a. Since only an § wave is under consideration, 
the result is more precisely 


1 rs) 
Ove =be(0)|— f ir] | 
4r OF Bic 


Since ¢’ is much smaller than any other length in the 
problem, it may be set to zero. Then (3.5) becomes 


1 0 
0,=6 — J dQ— ; 
| J =e 


For a y with no singularity at r=e, O, commutes with 
the Fourier decomposition; for example, in contrast to 
(3.2), 


(3.5) 


(3.6) 


m sinmr m sinmr 
0. > ——=Z O. 


mézn m>—n? 


(3.7) 


men m?—n? 
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It is proposed here to use the O, of (3.6) in the two- 
body pseudopotential instead of O. Only after the 
desired answers are obtained will the limit e— 0+ be 
taken. More precisely, the order of the limits are 
lim.o+ limg.., where the right-hand limit means 
N— @ so that p=N/Q is fixed. 
It may be noted that 


O2=0. 


With the result of Sec. 2 and this interpretation of 
the pseudopotential (2.9), the Hamiltonian of a Bose 
system of hard spheres may be approximated by 


H'=T+V'. 


(3.8) 


(3.9) 
with 


(3.10) 


f~~ f ry*(1)VV("), 


and 


V'=a J dry *rp* (11)¥* (12)5(11 — Fe) 


‘ 


Here the language of quantized fields is used, and y 
satisfies the usual commutation rules for a boson field. 
In (3.11), 0/dri2 is taken with fixed r+re. If the 
annihilation operators in the momentum space are 
defined by 


= a # 
1 =). ky Gy ay, 


0 
dQ\2— neviev(e)| . Bae 
rig=€ 


Or}2 


(3.12) 
then 
(3.13) 


and 
V'=2" 4a DO aa*as*aya,5(kat+ks—k,—k,) 


a,Bi yyy 


Xcos(se|/k,—k,!). (3.14) 


This differs from the V’ of Lee, Huang, and Yang? only 
in the appearance of the cosine factor. When the 
occupation of the k=0 state is almost complete, and 
only pair excitation is included, an approximate formula 
for V’ is 


V’=4rapN +4rap > [ax*ax(4 costek— 2) 
k+~0 


+a,*a_.*+a,a_, cosek ]. (3.15) 


If the following notations are used, 
(3.16) 
(3.17) 


ky? = 8rap, 
_— Ske LRe+ ke? (2 cos} ek— 1)] - 
then 
H'=4mrapN+ > [k?+he?(2 cossek—1) ] 
kx 


X Lax*aut ye (ay*a_y*+aya_x cosek) |. (3.18) 


The eigenvalues of this non-Hermitian Hamiltonian 
give the energies of the low-lying states of the Bose 
system. 
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In view of the form of the Hamiltonian of (3.18), let 


Ay= ay*a,+a_,*a ct 29x (a,*a_,* 
+a,a_, cosek), (3.19) 


where k¥0. This Hamiltonian, being non-Hermitian, 
cannot be diagonalized by a canonical transformation 
of the form’ 


&= (1—ay”) 3 (ay +axa_x*), 
é _\_= (1 —a;’)~ Va wtay.a,*). 
However, if a; is chosen to be 


a= (2, cosek)'[1— (1—4y,? cosek)* ], 


(3.20) 


(3.21) 
then 
H,=—[1—(1—4y.2 cosek)!] 
+ (1—4y,? cosek)!(En*Ex +E n*E_x) 
—2y.(1—cosek) Exé~. (3.22) 
On the other hand, if the choice is 


a.= (2y,.)“[1 — (1 —4y,? cosek) '], (3.23) 


then 
Hy, = —[1 on (1 —4y,? cosek)! ] 
+ (1—4y.? cosek)!(Ex* Ext & n* Ex) 
+ 2.(1—cosek) E*E_4*. 

Since & and && separately satisfy the same commu- 
tation rules as dx, it follows from either (3.22) or (3.24) 
that the energy eigenvalues of Hy are 
Ey (my,m_,) = —[1— (1—4 2 cosek)*] 

+ (my+m_,) (1—4y.2 cosek)!. 


The substitution into (3.18) then yields the phonon 
spectrum near the ground state 


E(my) = Eot+ Epnonon(mx), 


(3.24) 


(5:20) 


(3.26) 


where 
Ey=4rapN +3 > ([R?+ho?(2 cossek—1) | 


k#0 
xX [—-1+(1—4y,2 cosek)!], (3.27) 


and 
Eptonon(M) = } x my_k?+ ko? (2 cos} ek— 1) ] 
k+0 


X (1—4y,? cosek)!. (3.28) 


Equation (3.26) is valid when m,=O(1), and 
> «xo m=0(N). In the limit e— 0+, (3.28) yields 
the well-known formula*4 

Epnonon(Mx) = >> myk(k?+ 16rap)?. 


k+o 


(3.29) 


In the same limit, (3.27) yields the ground-state energy 
per particle 


8 N. N. Bogoliubov, J. Phys. (U.S.S.R.) 11, 23 (1947). 
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Eo/N=4nrap+ lim lim (2pQ)- 


«0+ 2-0 
xX Db [ke + ke? (2 cosek—1) ] 
k~O 
x[—1+(1—4y,2 cosek)#] 
=4irap+ lim (47’p)"! 
xf dkk*Lk?+- ko?(2 cossek—1) ] 
" - x [-14(1—4y,2 cosek)#]. (3.30) 


As k— , the integrand is asymptotically —}ko' cosek, 
which yields zero when integrated from 0 to ~ for any 
«~(. If this asymptotic expression is subtracted from 
the integrand, the limit e—+>0+ may be taken under 
the integral sign to give the result 


ko / N =4ap+ (41r’p) f dk{ k?(k?+ ky?) 
0 


X[—1+ (1—4y,2| <0)! ]+3hot}. (3.31) 


This is identical with the result of Lee, Huang, and 

Yang. After integration, the result is 
128 

Eo/N =4nap| 1+——(a'p)' ]. (3.32) 

15/7 

The present procedure involves no removal of di- 

vergence. Note also that in this calculation the com- 

bination 2cos}ek—1 in (3.17) and (3.18) may be 

replaced by 1 at the beginning. 

Now consider the lowest state of Hy. Since it is 
necessary to distinguish the right and left states, define 
(Ox! and |0,) by 

&,|0,)=é k Ox) =0, 
(Ox| Ex*=(Ox| E-41*=0. 
In terms of ay, (3.33) is, with (3.20), 


(ay+axa_1*) | Ox) = (4_nt+-axdx*) |Ox)=0, (3.34) 
(0, | (a,*+&,a .) = (0; | (a. ct +&.a,) =0. i 


(3.33) 


From (3.34), except for normalizations, the states 
0.) and (Ox! may be expressed in terms of the states 
) and ¢ | defined by 


a,| )=0, ( |a,*=0. 


The results are 


(3.35) 


0.) = K, exp(—a,a,*a_,*) | ), 
ae. ae (3.36) 
(Ox! =A | exp(—&,aya_,). 


The normalization 
(Ox Oy ome 1 (3.37) 
then yields the condition 


K,Ki.= 1—a,d,.. (3.38) 


TSUN 


WU 


The state of Hy, with one phonon of momentum k is 
related to the lowest state by 
11,)= Ku’ Ex*| Ox), 
| 1x) my bx | Dx) (3.39) 
(1a | =Ax(Ox! Ex. 
Since 
(Ox | dQ | Ox) = —ax(1—and), 
(Ox | a,*a- «* | Ox) = — &(1—ay&)", 
(Ox | @x*ax | Ox) = (Ox | dn *a_x | Ox) 


=x (1—ap&), 


(3.40) 


as consequences of (3.36) and (3.38), the normalization 
(1x 1k)=1 (3.41) 

yields the condition 
K,’Ri!= (1—ax2)3(1—&2) 4 (1-day). (3.42) 


The states |m,,m_,) with both numbers different from 
zero are somewhat more complicated, although explicit 
formulas can be found for them. They do not appear 
in the following discussion. It may be noted that, as a 
consequence of the pair excitation, it is possible to 
choose the normalization 


(3.43) 


(my,m _k | my’ ,m_,’) = Omy,my’dm__,m_,’ 


even though H, is not Hermitian. 


4. GROUND-STATE ENERGY PER PARTICLE 


It is the purpose of this section to improve the 
calculation of the previous section for the ground-state 
energy per particle by applying the second-order 
perturbation theory after the canonical transformation. 
For this purpose, attention is returned to (3.13) and 
(3.14). A more accurate approximation of V’ than 
(3.15) may be obtained as follows. 

A. Let mo=(ao*ao), where the expectation value is 
taken in the ground state; then 


o*a*aydo~ (do*do)? 
= [0+ (ao*ay— no) | ~ No? + 2no(ao*ao— no) 


=no(2N—no)—2no D> ax*ax. (4.1) 
k+~O 


Here, use has been made of the relation N= )°, ay*ax. 
t B. In view of the remark following (3.32), « may be 
made equal to 0 in those terms of V’ that can be com- 
bined directly with kinetic energy terms. Thus 
o*d,* yd, COSSRE~ Ay* dy *A0d,~ Nody* ay — (4.2) 

for k¥0. 

C. The same approximation may also be used in the 
numerical part of the Hamiltonian. Thus 


SY ag*ag*agas cos(de|Ka—ks|)~ XY aa*as*aaas 


a, Bo a, Bro 


~ (N—ao*ao)?~ (N—m)". (4.3) 
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D. For the off-diagonal terms, the approximation 


ady~ay*~ no} (4.4) 
may be used. 

E. For the off-diagonal terms, those with four mo- 
menta all different from zero are neglected. 

These approximations are based on the fact that 
1—mo/N is small; more precisely, Lee, Huang, and 
Yang? have given the result 


8 
£=no/N =1———(a'p)!+ oL_(a'p)!]. (4.5) 
3/1 
This quantity is to be studied in more detail in Sec. 7. 
It is convenient at this stage to modify (3.16) to 
{y= Srapé, (4.6) 
and, according to (4.2), to simplify (3.17) to read 
= Sho? (k?+ Ro?) (4.7) 
With these notations and the approximations A — £, the 
Hamiltonian given by (3.9), (3.13), and (3.14) becomes 


H’ = 4rapN[1+(1—£)? J+> (k?+ Ro?) 


k+0 
X Lau*ant yi (ax*a_y*+a,a_x coske) | 


+2-78rani > 


k,k’ ,k+k’+0 
+dusx* ayy COS( + /k— k’ | « )] 


+02 49a DY ay*a_y*ay-a_y cosk’e. (4.8) 


k,k’ +0 


>| k+k’ | ¢) 


[ax* dy’ *Ois ke! cos( 


The first two terms of this Hamiltonian are identical 
in form with the Hamiltonian of (3.18). Therefore, with 
some trivial modifications, the canonical transforma- 
tions of last section may also be applied to the present 
case. The program is to treat these two terms as the 
unperturbed Hamiltonian and then carry out a second- 
order perturbation calculation to get the ground-state 
energy per particle. For this purpose, certain matrix 
elements have to be obtained. 

With reference to (3.33), the right and left un- 
perturbed ground states may be defined by 


£,|0)=0, 
(0 &,= = =(), 


(4.9) 


for all k#0, where & and & are defined by (3.20), 
(3.21), and (3.23) with the y, of (4.7) and (4.6). In 
view of the form of (4.8), one matrix element to be 
calculated is 


AF, =(0|Q"'42a p a,*a td, a k’ cosk’e 0). (4.10) 
k,k’+0 


This may be evaluated using |0,) and the identities 
(3.40) as follows: 


9 See Eq. (400) of reference 3, There the a should be a, 
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AF, = Q-4rra[_ a (Ox | a,*a_,* | Ox) | 
k#0 


XL te (Ox: yA,’ | One) cosk’e } 
k’ 0 


=O" 4mral Do &(1—apay)*] 
k0 
XLS ax(1—apad,) 


k+0 


'coske]. (4.11) 


This is easily evaluated in the limit 2— « and then 
er 
AE,/N = 4rap(64a'p/7). (4.12) 


Accordingly, the diagonal matrix element of the H’ of 
(4.8) is 


(0| H’ 0) trap | 14 (1 


128 a'p 
aa (= "Y+oe(— 4) | (4.13) 


is substituted in (4.13), the result is more 


When (4.5) 
explicitly 


128 f a*p\ 3} 128 a'p 
(O| H’ 0)—4rap\| 1+ (~)+ —(- ") (4.14) 
15 \ 7 


It is seen from (4.8) that the relevant intermediate 
states for a second-order perturbation calculation are 
the three-phonon states. In view of (3.39), they may 
be defined by 

Vilar lar) = K,’Ky 
eb ep ae ree =K,’K, 


it ane To 
OK bee 


) ae 1 é rs 
OKO! bebe Exe, 


FEL *! 0), 


where the three k’s are assumed to be distinct. With the 
help of (3.20) and (3.40), it is not difficult to verify that 
for k+k’+k”’=0 


0)=2" 16man'K,’R, 
aya h 

om »cos(ak” €) 
k—k’ |e 


ger 


(1 — apr & ye) 


(Lasley dae H'|\( (1—4&,”)! 
XK (1—-&”” 


X (lay 


)3(1—ap&) 


oe cos(3 )+symm. ], 


(4.16) 


and 


(O| AH’ 141k, oa = 
X (1—-a,? 
died ams sonal €) 

k—k’! «)+symm. ], 


Q NGwane'K Kye Kye’ (1 —a?)! 


ML ap) 


1 1—a,” 1(1—a,.a&;) 


—& cos(4 (4.17) 
where “symm.” denotes additional terms obtained 
from those explicitly written down by the cyclic 
permutations of the three indices. So far as the energy 
denominator is concerned, it is sufficient the 
phonon spectrum (3.29). With (3.42), the ground-state 
energy shift due to the three-phonon processes is found 


to use 


to be 
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AE,=— > (Expt Ey +E) 256907 


k>k’>k”" >0 
k+k’ +k” =0 


X (1 erie) (1 ape) 1 (1 ae) 
X [apa cos($k’’€) —& cos(}|k—k’| €)+symm. ] 
X Lax cos($k’’e)+-a,a, cos(} | k—k’ | &) 


+symm.], (4.18) 
where 


E,.=k(k’+16rap)!. (4.19) 


In (4.18) the approximation £=1 may be used. Un- 
fortunately, as it stands, the formula (4.18) is not 
meaningful, because lim,..o; limo.,.4E2 does not exist. 
This difficulty is to be resolved in the next two sections. 


5. THE THREE-BODY PROBLEM 


The difficulty with (4.18) is not peculiar to the 
N-body problem for large N, in the sense that a very 
similar trouble already appears in the three-body 
problem. Rather, this difficulty may be considered to 
be a fundamental limitation of the method of the 
pseudopotential. This section is devoted to a study of 
the ground-state energy of a system of three particles 
interacting pairwise through the hard-sphere potential 
Vy) and confined to a large periodic cubic box of volume 
0) 

This limitation of the pseudopotential of (2.9) may 
be seen from the perturbation calculation of the N-body 
ground-state energy by Huang and Yang.' There, the 
energy is calculated to the third order. If one attempts 
to carry the procedure further to get the fourth-order 
energy, the result is divergent for \ >2. The divergence 
is unrelated to the one arising from the omission of the 
operator (0/dr)r. Furthermore, the situation here is 
not improved by the introduction of €: the fourth-order 
energy is convergent for e>0, but as e— 0+, the limit 
does not exist. Since this fourth-order energy is useful 
in spite of this difficulty, this perturbation calculation 
is repeated here with ¢«. Only the case V=3 is treated 
since further generalization does not seem to be 
necessary. 

Consider the three-body Hamiltonian 

H=pP+pe+pr+M, (5.1) 
with 


1 0 
H.=8refa(r.—r9)| ic am rs| 
dr Ore;  Sro3=e 


(5.2) 


+symm. , 


where “symm.” again denotes the other two terms 
obtained from the first one by the cyclic permutation 
of the indices 1, 2, 3. The three particles are confined 
in a cubical box of sizes LXLXL with periodic 
boundary conditions. It is assumed that 


cKAKL. (5.3) 


TSUN WU 


For the perturbation calculation of the ground-state 
energy, the three particles may be assumed to be 
distinguishable. Then the eigenstates of the kinetic 
energy part of the Hamiltonian (5.1) are the momentum 
states |k,,ko,ks) with the coordinate representation 


(11,82,Ts | ki, ke,k;) 
=(-! exp[i(ki “rntk, ° ro+k;: rs) . 


In terms of these momentum states, the matrix elements 
of H, of (5.2) are 
(ky’ ke’ Jk’ | A, | k,,ko,k;) 

=Q8ara[_ bye’ cos (4 | ko—k;| €)+symm. ]. 


(5.4) 


(5.5) 


The first three orders of perturbation yield just the 
following results by Huang and Yang,’ in the limit 
e—0): 

EV= 241ra/ Q, 

E®=C(a/L)E™, 

E® = (C?+&)(a/L)*E™, 
where C is a constant approximately equal to 2.37 and 
£, with a>2 is‘defined by 


(5.6) 


tant SF! (P+ mtn). (5.7) 


l,m,n=—® 


This perturbation calculation is now to be pushed 
to fourth order. Let Go be the operator defined by 
(2.13) with E=0: 


Go=—(pr+p?t+ps)"; 
and (Q be the projection operator 
Q=1— |0,0,0)(0,0,0'. 


Then E™ consists of two parts, one of which can be 
easily obtained in the limit e— 0+: 


E®=F, +E, 


(5.8) 


(5.9) 


(5.10) 
where 


E, = (—9CEo+15€;) (a/L)7E® (5.11) 


in the limit e— 0+, and 
E, = (0,0,0| H,:GoQH :GoQH GoQH | 0,0,0). (5.12) 


With (5.5) the right-hand side of (5.12) may be found 
explicitly. It again consists of two parts, one of them 
being in a form not dissimilar to E,: 

FE. =E;,9%+E™, (5.13) 
where 


Ey + FE, — (C?—Ci+ 15£;) (a, ‘L)°E' 1) 


in the limit e— 0+, and 


(5.14) 


EO=-EOYSray> 


kit+kot+k3=0 


(kv +k2e+ k;*)"! 


X[k? cos(hie) cos(} | ke—k;| &)-+symm. | 


X[k? cos(}kie)+symm. ]. (5.15) 
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In (5.15), each component of each k is of the form 
2rL™ X (integer), and the prime means the omission 
of those ki, ke, k; that makes any one of the energy 
denominators vanish. 

There remains the rather complicated task of finding 
the behavior of £, when e<L. Only the leading term 


will be calculated here. Consider first the sum 


| akg (ke+ he? +k;?) ky 4 


k)}+ke+k3=0 
Xcoskie cosdkye cos(4 


7 oe 
~ 


1= 
k.—k; |). (5.16) 


k,=k.—k:;, (5.17) 


yg 


ki+ke+k3=0 


(Skv+ke) 1k 


Xcoskie cosskye cosskye. (5.18) 
Since the terms with k,=0 do not contribute to the 
leading term, they may be omitted. Furthermore, when 
the decomposition 


(3k +h) = ke? —S3kPRe (Ske +e) (5.19) 


9 


is used in (5.18), those terms coming from ky? 
do not contribute to the leading term. Therefore 


b ing (3k +key) ky Phy 
ki +k:+k; =0 
k, ~0 


again 


ys I'm —6 


=1 


(5.20) 


X coskye cosshky€ cost hye. 


This may next be replaced by the integral 


n~— 10°(2n)-* f dh hy (3kst+ ke) lpi ky? 
(5.21) 


X coskye cosky€ cos} kye, 


where the domain of integration is ki>L™, ky>L". 
Now it is straightforward to get the result that as 
e(/L—0 
D1! = —2?(2r)-*[ 2n*v3 In(L/e)+O(1) ]. (5.22) 
The other part of the sum 
b iy 
kitket+k3=0 


X coskie cos}hkye cos(}|k,—k;| €) 


(RP+R2 +3?) hr ky 
(5.23) 


may be replaced immediately by the integral 


2 ~$00n)-* f Phu, 
X (k++ ke — ki: ke) kr ke, (5.24) 


where the domain of integration is L'<ki<e"', 
I <ko<e". The angular integral may be evaluated 
to give 


e7l 


Yo’ ~0? (23) dkydko( kyk2) . 
kP+ke+kike 


X In ee 
ki+he?—kike 


(5.25) 
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Introduction of the variable «= k2/k; yields the result 


dx Lx 1+a+2 
D2! ~ 202 (29) f In— In————._ (5.26) 
e/L - 


: € 1—x+2? 


Since the integral 


ldx I+a+2? 
f — Inx In———— 
o 2 1—*+2" 


is convergent, a contour integration gives finally for 
(/L—-0O 

Yo’ =? (2r) ‘Ln (L/e)x?/3+0(1) J. (5.27) 
The substitution of (5.16), (5.22), (5.23), and (5.27) 


in (5.15) yields 


E. = —E®(a/L)3 


[8 (8r/3—2v3) In(L/e)+O(1)]. (5.28) 


In this calculation e~! acts as a high-momentum 
cutoff. Since, in the actual problem of hard spheres, 
the only quantity of high momentum is a~', the form 
of the correct expression for the three-body ground-state 
energy may be expected to be 


24na{ 1-+C(a/L) + (+E) (a/L)? 
+ (a/ Ly (C?—Ck&+ 15é; 


—8(8x/3—2v3) In(L/a)+ &3]+o[(a/L)*}}, (5.29) 


where (5.6) and (5.14) have been used. In (5.29), &3 is 
a number that cannot be determined by the method of 
the pseudopotential. A comparison of (5.28) and (5.29) 
gives the following interpretation of the sum in (5.15)- 


(RP +R +R?) 


kit+ke+k3=0 
X [ki-® cos(kie) cos(4 | ko—ks| €)+symm. ] 
X[k? cos}kie+symm. | 


—+ 302(4ar)-[ (84 /3—2V3) In(L/a)— 3/8]. (5.30) 


This interpretation is to be used in the V-body problem. 


6. GROUND-STATE ENERGY PER 
PARTICLE—CONTINUED 


The asymptotic forms of a, and E; are,ask— © and 
§= i, 
ap~ 4rapk~, (6.1) 
and 


E.~k’, (6.2) 


as seen from (3.21), (4.6), (4.7), and (4.19). Further- 
more 


&=az coske. (6.3) 


Therefore, when the three momenta are all large, the 
summand of AE, as expressed in (4.18) behaves the 
same as that of the left-hand side of (5.30), except for 
a numerical factor. To make a more detailed compari- 
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son, consider first the difference 


A= LD! (hit+h?+k?) 


ki + ke+ks=0 


'Lky'— (4rap) “ax? | 


X coskie cos}kie cos($|k2—k;| 6). (6.4) 


This quantity is studied in Appendix A. The result there 
is 


Ay = $2?(2r)-*[ —2v3 In(12map)'L+C,] (6.5) 


in the limit « — 0+ and Q large. C; is a number defined 
in Appendix A. Next consider the difference 
A:= >’ 

k,+ ket+ks=0 


X LAr *k: 


(ky +ko+k;") : 
°— (4rap) *akyaks | coskye cos} koe 


\k.—k;|«). (6.6) 


Xcos(4 


In view of (6.1), the limit e— 0+ may be taken under 
the summation sign. Thus 


A»= » tig (ky?+ ko? +k") 


kit+ke+k3=0 
X [Rr tke? — (4rap)akyaks |. (6.7) 


As a consequence of (5.27), A, must be of the form, for 


large Q, 
As=((2r)[ dar In(122ap)'L+Cz | (6.8) 


where C2 is a number defined by (6.8). oleae (6.6), 
(6.8), and (5.30) then give the interpretation 


(4rrap) 2 > img (ky? + ko? +k,”) i 


k)+ket+k3=0 


X [a@1 cos(} | ko—k;| e)+symm. | 


X [ar cos}kye+symm. | 


—> —30°(4r)[(4—Vv3) In(12ra*p) +36;+C14+16C2 ]. 
(6.9) 


Note that InZ does not appear in (6.9). With (6.9), it is 
possible to write AF, in the limit 2— © and e— 0+ 
explicitly as follows: 
AE,=AE2,+AE2t+AE2;, (6.10) 
where 
AE 2, — 32ra‘p’N[_($4—V3) In(122a%p) 
+$634+C) +16C» 


2a’ 2N 
ff fone k’d*k’’6(k+k’ +k’) 


a nine ee '(1—a,”) 
1 (p24 p24 p’2)- 


(6.11) 


K (1—ay?) "(1 —a,”) (6.12) 


and 


2a’ »N 
: ~~ ff fenewon orn tn’) 


K (Ft Bee t+ Ep) '(1—a,”) 1(1—a,/") ! 
X (1—ay/*) 
XK (Qe Zap: + 2erye — Opp — Ap AE — AQ} ). 


Vara tayrray +ayay’) 


(6.13) 


TSU 


N WU 


In (6.12) and (6.13), € may be put to zero in ay. There 


only remains a slight simplification of these formulas: 


loa2V 7” ‘ : ’ 
—_—— f dk f dk’ f dk! kk’ U(k,k’,R’”) 
37? 0 0 


X (anton tax) (Let Ee + Ee) —a;*)* 
y-1~ (ke+k2+k')-], (6.14) 


AEs= 


'(1—a,? 


xX (1-—a:”) 


16a* NV 
ff ef wwf dk” kR’R'U(k, RR”) 
3m? 0 


11 —a,-?) 


and 


x ( Ey AE pet E pe) a € | —a;') 
X (1-aye?)- 
XK (Qa t+ Zaye + Zap tapray 


Meprapr Fag ap tara) 


(6.15) 


1 AKA; — AA’), 


where U(k,k’,k’’)=1 if the numbers &, k’, k”” can form 
a triangle, and =0 otherwise. For dimensional reasons, 
AEFo.+AE»:; is of the form 

AEo2+AEo3= 4a to NC 3, (6. 16) 
is a dimensionless number. Finally, let 


C= 8C,+128C2+C3+ (128 Or), 


then it follows from (4.14) and (6.10) that 


where C3 
(6.17) 


E)=(0| H’|0) +E: 


128 
(a%p)} 
154, os 


=4rapN E +— 
+8(42—v3)(a%p) In( 127a*p) + (a'p) ( s+e)| (6.18) 


Note that the three-body energy 6&3 of (3.72) has not 
been explicitly found. On the other hand, @ is known 
in principle by (6.17) in terms of C1, C2, and C3, which 
in turn are defined in Appendix A, Eq. (6.8), and Eqs. 
(6.14-16), respectively. 


7. PROPERTIES OF THE GROUND STATE 
In this section, a few simple properties of the ground 


state are to be studied. 


A. The Depletion Factor & 


First the formula (4.5) may be improved. 
(3.40), it follows that 


From 


(O ao*ao|0)=N— s ay.).(1—a,.&,) l 
k+0 


8 
-s[1-—en)e] (7.1) 


34/4 


as e— 0+ and Q large. Next it is necessary to calculate 
the further depletion due to the creation of phonons. 
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The one-phonon states have the property 


(O ay*ay|0)—(1y dy*dy 1) 
=(1,! ay*a,! 1,)—(O| ay, *a,| 0) 
+(1,) a x*a k 1,)—(Ola «*a_,|0). (7.2) 
Since the one-phonon state is explicitly known, it is 
tedious but straightforward to verify that 


(14 | ay*ay | 1,)= (1—apd.) (1+ a07n&;), 


(7.3) 
and 

(1, a4*G > 1) = 2a, (1—ap&) I (7.4) 
Accordingly 
(0| ao*ao|0)—(1 | @o*ao| 1,)= (1 — ajax)! (1t+a,&;,). (7.5) 


A perturbation calculation with (4.18) then yields 
NE—(0| ao*ay|0) 


k>k'>k" >0 
k +k’ +k” =0 


X (lands) (1 oye) —eydtar 


XC(1 apd)! (1+0,8;,) +symm. | 


2569? a7pQ" (Ey + Ey +E) 


X [aa cos(dR€) —& cos(4|k—k’| &) 
+symm. ] 
X Lax cos($k’’€)+a,a, cos(4|\k—k’ | €) 


+symm. ]. (7.6) 


Because of the extra energy denominator, the limit 
¢— 0+ may be taken directly here. The result for 
Q— x“ ande— 0+ is 

8 32 


€=1———-(a'p)!+ 
3/1 


xf ak f ax’ f dk" kk'k''U(k,k’,k’’) 
0 =9 0 


x ( Ey tke tke) 2( 1—a,’) 1(1—ay?) “tk 
X (1—ay?)“[(1—a,2) 1 (1+0,2)-+symm. ] 


16a° 
a’p—— 
3r 3m 


XK (apap tage: — apg — AK ey. —ayery’)?, (7.7) 
where U is defined after (6.15). Note that no logarithm 
term appears in (7.7). As before, in the a, under the 
integral sign, it is understood that e=0 and ¢=1. 


B. Three-Particle Wave Function 


In the work of Lee, Huang, and Yang,’ the ground- 
state wave function is expressed in terms of a two- 
particle function 

1 


f(rn)=——__ J dk aye**"". (7.8) 


Srp 
It is noted there that terms of the form 


f(ri2) f (ris) 
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are missing in the wave function. In the present calcu- 
lation, due to the admixing of the three-phonon states, 
a certain three-particle function takes the place of the 
combination f(ri2) f(r13)+symm. It is the purpose here 
to make a partial study of this three-particle function. 

A phonon does not differ very much from a free 
particle if its momentum is much larger than (ap)!. 
This approximation may be used if the three relative 
distances are much less than (ap)~!. In view of the 
difficulties arising from the three-body problem, the 
results are not expected to be valid when the relative 
distances are of the order of a. Thus it is hereby 
assumed that 


a<Kro3,r31,71x< (ap). (7.9) 


Under this assumption, define the required function by 
f3(t1,¥2,83) 


=-N-° > 


k +k’ +k’ =0 
X(Ligler,Lerr|H’|0)(Er t+ Ee tee)“. (7.10) 


expli(k-m+k’-r.+k’ +43) ] 


Since only momenta larger than (ap)! are of concern, 
(4.16) simplifies greatly with (6.1) and (6.3). With ¢ 
put to zero, the result is 


(Li Lice Lar | H’| 0) 


~ — 6490? pQ" N(R? +R’). (7.11) 


Therefore, asQ— ~, 


f3(11,¥2,%3) 


= ea) 


X (kit Rett hs) (keke + ket) 


X expla (Ki: ri t+ke: rot+k;-4) J. (7.12) 


This integral is explicitly evaluated in Appendix B. 
The result is 


fs(t1,¥2,03) = 893 L7y'4' (yy? — 117) 4 
Xsin“(r;//y)+symm. ], (7.13) 
where 

, 


I; =r;—3(ritretrs), (7.14) 


and y is defined in (B.15). Under the assumption (7.9) 
the “missing” terms are 


1 


f—Fe|— || t1—Te 


+symm. ]. 


f (rie) f (ris) + symm. = a, 
(7.15) 


Thus it may be seen that /3(11,r2,r3) is not quantitatively 
close to f(ri12) f(ri3) +Ssymm. 

The conclusion is then reached that even at 
densitites, the ground-state wave function is not of the 
form 


low 


(7.16) 


t<) 


In particular, it seems unprofitable to push any further 
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the procedure of Aviles," who calculated the ground- 
state energy per particle by a variational formula using 
a trial wave function of the form (7.16). 


C. Pair Distribution Function 


The pair distribution function, normalized to be 1 at 
infinite distance, is defined by 


D(r)=0-(G\¥* (0)y¥* (Hy (OY () |G) 


for the ground state, where |G) is the ground-state wave 
function. For the determination of D(r), the following 
formulas are of use: 


(7.17) 


(Li | @xQ—x| 1c) = — 20x (1—ards), (7.18) 


and 


(7.19) 


As in the calculation of &, it is permissible for the present 
purpose to set e=0 at the beginning; in particular, 
ax=&. Following Lee, Huang, and Yang,’ define 


(1x | @ye*a_4*| 14) = — 244 (1—andy) +. 


i 
Pyne f dk e*(Glay*ax|G), (7.20) 
82*p 


Oban fer e**-*(G| aya_y|G) 
8r°p 


ak e*k-1(G| ay*a_y*|G). 
8r°p 


(7.21) 
It follows from (7.6) that 


1 
F(r) =—— fox eik-r 
82°p 


16a*p 1+a,;2 


x| ie +H} (7.22) 
1-—a,2 wr 1—a,2 


where 
o(k)= f #k'[ (Ex t+ Ey + Ey) ?*(1—ay?) 


X (1—ay*) (1 age”) (ag tag tap apa 
(7.23) 


— cry tp — pete)” Jit meg ke’ 


Similarly, it follows from (7.18) or (7.19) that 
1 ; 
G(r) =— fox gt 
82°p 


—a,? 16ap — 2a; 
x| +——_ -———_¢ 
2 - 2 


1—a; 1—a; 


w| (7.24) 


In addition to F(r) and G(r), the following function is 
also needed : 


J. B. Aviles, Jr., doctorate dissertation, Johns Hopkins 
University, 1958 (unpublished). 


TSUN 


WU 


n= J fener e pi G| Ak4k’ * yy? |G) 


(82'*p)? 
f fenow efk-r 


(8a)? | ad 
X(G| Oy* Oy * Oye’ |G). (7.25) 


A calculation quite similar to the above gives 


H()=-— ff fernerarn 0 k+k’+k”)e* 


4r'p 
X (Ext Ev +Ey)“(1—ax)'(1—ay?) 
X (Lage?) (nr —eryern’) (an tartar 


p06 K — Ap Lp — AO’). 


(7.26) 


In terms of the functions F(r), G(r) and H(r), the pair 
distribution function of (7.17) may be written as 


D(r)=1+ 2é[F (r)+G(r) ]+[F(r) P 
+[G(r) P+4H(r). 


However, note that except for the exponential factor, 
H(r) is very similar to the AE» of (4.18). Therefore, 
due to the complication arising from the three-body 
problem, (7.26) for H(r) is inaccurate for r~a. Com- 
pared with the result of Lee, Huang, and Yang, the 
more complicated formula (7.27) gives no extra in- 
formation for r~a, but is probably more accurate for 
ar. 

It remains to evaluate D(r) approximately for 
(ap)-*«Kr. Since 


(7.27) 


1 
F+6)=— [ak et 
82° p 


—a, 16a’p 
x{ + 

Ita, m (1+a,)? 
where $(k) is that of (C.1), it follows from (3.13) that 


asr—> @ 


ro sap\} 8sa'p\! 
F(r)+G(r) > -s-(=) [:--(—) | (7.29) 
rmXrXN 2 3\ x 


where 


sm) (7.28) 


ry= (8rap)~*. (7.30) 
On the other hand, in terms of the @'(k) of (C.14), 


H(r) of (7.26) is 
a . — 
H()=-— fet e'*t(1—q,2)16'(k). (7.31) 
4rp 
Application of (C.18) gives, asr— ~, 


ro -(2C) 


(7.32) 
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So far as the terms [F(r) ? and [G(r) } in (7.27) are 
concerned, it is sufficient to use the result of Lee, 


Huang, and Yang? that as r— 
F (r) > (x°’pror*)—, 
os ( i ; (7.33) 
G(r) > —(x°pror’); 


therefore 


(Fin) P+[G() PO ion(~*). (7.34) 
r! 


T 


Finally the substitution of (7.29), (7.32), and (7.34) 
into (7.27) gives, asr— 2, 


ro! ap 3 
poywi—“[16(—*) 
r3 T 
80 a’p\3 
x|1- (16r+—) (=) | (7.35) 
3 wT 


This is the desired answer. 


D. Sound Speed 


Although strictly not a property of the ground state, 
the speed of propagation of a compressional wave in the 
limit of infinite wavelength is related to the ground- 
state energy per particle.* Thus it follows from (6.18) 
that this speed is 


8 
(167ap)! 1+—(a'p)!+12(§r—v3)a%p In(127a'p) 
Jr 


+ (a*p)[$(6;+C)+10(44—v3)—32/m]}. (7.36) 


8. DISCUSSIONS 


Since the preceding calculation is concerned with an 
improvement over the results of Lee, Huang, and Yang,’ 
it only remains to discuss a few miscellaneous topics 
peculiar to these higher order terms. 

A. Although this calculation has been carried out 
only for the case of hard-sphere interaction, it also 
applies to any repulsive interaction of short range. In 
fact, the only modifications are: (a) & is in general 
different, and (b) the length a appearing in the calcu- 
lation is to be interpreted as scattering length of the 
two-body potential. In particular, the logarithm term 
of (6.18) appears in the more general case. 

B. This logarithm term may be obtained by an 
argument as follows. From (5.29) for the three-body 
problem, the three-body ground-state energy has a term 


— 192x[ (8/3)x—2v3 Ja*Z-* In(L/a). 


In the logarithm, Z~ plays the role of a low-momentum 


cutoff. Since 
N 
3 
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for large NV, there should be a term of the following form 
in Eo: 


Ne 8 
——1924(—r— 6 atl In(L’/a), 
6 3 


where L’—! is some low-momentum cutoff. Since this 
should be proportional to V, L’ must be independent of 
the size of the box. This leaves just one possible choice, 
namely 


L'= (ap) a 
With this choice of L’, this energy becomes 
162[_(8/3)xr—2v3 Ja‘p?N In(a*p), 


which is identical to the logarithm term of (6.18). In 
fact, this is the argument originally used to get this 
term, before the present calculation was carried out. 
However, it is much more satisfying to see the explicit 
cancellation of Z under the logarithm sign, as exhibited 
in (6.9). 

C. It may also be noted that no logarithm appears 
in the depletion factor £ of (7.7). Thus the logarithm 
term of the ground-state energy does not come from the 
depletion of the single-particle k=0 state. 

D. Contrary to @, no formula has been found for the 
number 6&3. After repeated failure to find &3, the author 
conjectures that &; cannot be expressed in terms of a 
finite number of integrals involving only well-known 
functions. 

E. If an unknown number &; has to be introduced 
for the ground-state energy per particle, why is & of 
(7.7) well defined? The reason is as follows. The com- 
plication due to the three-body problem implies that 
the matrix elements of (4.16) and (4.17) are inaccurate 
for any k of the order of a~'. For large k, the energy 
behaves like k? but the depletion behaves like 1, from 
(7.5). Thus the unknown region of large momenta 
contributes much more to the ground-state energy per 
particle than to the depletion factor & To the order 
calculated, it turns out that the contribution is of 
importance for the former quantity but not for the 
latter. 

F. The existence of a sound speed (7.36) implies the 
absence of an energy gap immediately above the ground 
state. How can this be verified directly? Even though 
the phonon spectrum is unstable due to the three- 
phonon perturbation, for a given 2 the one-phonon 
state with a momentum of 27/ZL is stable. It is possible 
to modify the present calculation to find the energy of 
this state. Here it is important to note that the ground- 
state depletion factor for this state is different from &. 
When this is taken into account, a very tedious calcu- 
lation yields the result that there is no energy gap to 
order ap(a’p)}. 

G. So far as the asymptotic behavior of the pair 
distribution function D(r) is concerned, although the 
first two terms of the coefficient of (ro/r)* are given in 
(7.35), there seem to exist no proof that it is indeed 
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of the form 1+O[(ro/r)*] as r— «. In particular, a 
term of the form (pa*)?(ro/r)*In(r/ro) has not been 
excluded. 
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APPENDIX A 
In this Appendix, the quantity A; defined by (6.4) is 
studied. Since «— 0+ eventually, two of the cosine 
factors are ineffective; thus 
Ai= »’ 
kit ke+k3=0 


x [ki *— (4rap)ax;? | cos (3 | ko—ks)| €). 


(ki +h? +k;*) 
(A.1) 


The first part of A; is identical with the 2,’ of (5.16). 
According to (5.18) and (5.19), A; may be broken up 
into 

(A.2) 


~ y» 7 wu 
~o 11 +212 “aml oy 


. / 
Ai = > 10 
where 


a ae 


(ke+ke+hk;"?) 1k 
ki +ke+k; =0 
ky =k; 


(A.3) 
(A.4) 


Xcos (4 ko—k; €), 


ou 2 By ks hy coshhue, 
ki +ke+k3=0 
>1'= —6 aa 


ki+ke+k3=0 


(3k +k,7) lk, 2k > cost hye, (A.5) 


and 


¥,"=(4eap)? 


ki +ket+k3=—0 


(ke +ke+hk;") lovky? 
Xcos(}|ko—ks| €). 


Here ky is that of (5.17). Clearly with (5.7) 


(A.7) 


ap = 


3. 
12 (27) 


For the evaluation of Y,)’, define the following variations 
of the constants C and £: 


ra 


- 7 2 = 
he = & > 
$2 


l.mn=—® 


[?+m?+ (n+3)?}°, (A.8) 


y [P+ mtn are, —(A9) 


x 


a 


> (Cd+3)?+(m4+3)" 
+(n+3)?}°, 


l,m,n 


(A.10) 


TSU 


N WU 


C=r'lim > 


e-0+ 1 m,n 


[P+ m?+ (n+3)? } 
; Xcosle(?-+ m+ n*)! |, 
[P+ (m+})?+ (n+3)?}? 
Xcos[e (2+ m?+n?)!], 


(A.11) 


C"=r' lim > 


e+ 1.m.n=—s 
(A.12) 
and 


«x 
CV=_" lim > 


e+ l,m, n=—s 


[(/+-3)?+ (m+3)?+ (n+3)?}7 
X cos[e(?-+-m?+n?)?]. (A.13) 
These constants are related by 


fa! + £2!" + $2!” = 5s, (A.14) 


and 


C'+0"+3C'"=C. (A.15) 


In terms of these constants, 21,’ is, in the limit « —~ 0+ 
’ J ’ 


1 ? 


(CE2+3C' Eo! +30" E2!""+-C'""Eo!""). (A.16) 
3 


In a similar manner, ¥,’’ may be separated into 


(A.17) 


vos ow 


1] “12 5 


vi 


= 
@W!| —s2 
where 
Yi’ =2(4arap)* ay kg ak)" Coss k4e€, (A.18) 
k)+ ke+k3=0 
and 
>.’ = — 6(41ap) 2 » My 
ki+k2+ k3=0 


ped kivks “ak ? cost ke. 


(3k r+k,) 1 
(A.19) 


In the limit Q— ~, 


1 95 32 
f a? k?dk = ——(4ap)'. (A.20) 
Qn? 1052? 


0 


Therefore 1)’ is of the order 2° as Q— «& and hence 
may be omitted. Finally, as «— 0, 


ee > 
pa 2 


(3ky+k4") \Ry 2k, . 
k)+ ke+ k3=0 


X[1— (4rap) hy 'ax:? }. 


is =—0 


12 
(A.21) 


In order to consider the two terms separately, it is 
convenient to introduce a high-momentum cutoff for 


ky. As Q—- we, 


o% >’ 
ki+ ko+k3=0 
6 
—- ss f ehh (3ke+ke) Rik; ?(4rap) "ak; 


(2ar)® 


(3k +hs?) RP ky? (4rrap) ak? 


3 K ‘ 
= f ans f dk, ky (3kP +e) (4rap) ar? 
1674 0 
\ 


3 19 
= in(125ap 1K—- +0(1)| (A.22) 
12 


32? 
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as K — o, On the other hand, in view of (5.22), it is 
possible to define 
Cy’= lim {2’[3(P+m?+n?)+ (l?+m"?+n”?) 
a X [P+ om? +n? 12 +m? +n? 
—7n3-)In(2rN)}, (A.23) 


where the ¥’ extends over 1, m, n, I’, m’, and n’ such 
that 1?+-m"+n?<N?, and 1+/', m+m’, n+n’ are all 
even. With (A.22) and (A.23), (A.21) reduces to 


12” = —0?(2r)—*[ 2v3x' In(12map)4L 
+6C,'+ (19V3/6)r*].  (A.24) 


Now (A.7), (A.16), and (A.24) may be substituted into 
(A.2) to yield 


1 


A= [—2v3 In(12map)'L+C,], (A.25) 


8 (2m)8 
where 

: 1 

C= iit (CEa+3C' fo! +30" E 2" +" E0!") 


19v3 
- (A.26) 


—6n-*C|'—- 


6 
APPENDIX B 


In this appendix, the following integral, which 
appears in (7.12), is evaluated: 


= ff fh sPhsdh, o(k-+hetths) (be-bk +h , 


Kk ? exp[i(kKi-rit+ke-re+k;- rs) ]. (B.1) 


When the integral representations 


6(k,+k.+k;) 


= (2m) far exp[iR: (ki+ks-+k;)]  (B.2) 
and 
(k2-+he-+kz)1= f dg expl—f(ke-+he-+k2)] (B.3) 


0 


are used in (B.1), J becomes 


1=(24)-3 f @R 


xf d¢J (R+n, O)K(R+n, OK (R+r;, 6), (B.4) 


where 


WR) = fork k exp(ik-R—¢#), (B.S) 
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and 
0 
K(R,¢) es J(R,¢). 
ag 


c 
The function K may be integrated to give 


K(R,f)= (1/5)! exp(—4R?/), (B.7) 


and hence 


JR)= f (a/texp(—1R/y ae 
t 


(B.8) 


When (B.7) amd (B.8) are substituted into (B.4), the 
¢ integral and the angular part of the R integral may 
be carried out to give 


iy, if dR(R?+a’) 4(—R+n,) 


A X (R°+-Rrni+p’)", (B.9) 
where 
=} (r2+r2+rz), (B.10) 
and 


(B.11) 


In (B.9), it has been assumed that ry-+ro+r;=0 
without loss of generality. The evaluation of the re- 
maining integral is straightforward but tedious. The 
relation 


B?=3 (r2?-+1;"). 


3a? — 28°—1r=0 (B.12) 
simplifies the calculation greatly. The result is 
I = 89°3-'7, (2a?—1,?)-! sin (2-47,/a).  (B.13) 


In order to remove the restriction r;+ro+r;=9, it is 
sufficient to make the following substitution in (B.13): 


re rth (ritret+rs). (B.14) 
Under this substitution 
2a? 3 y?=3 (re tret+rs?) + (7/27) (rit re+rs)?. (B.15) 
APPENDIX C 
Consider the integral 


$(k)= (1—a,2)6(k) = fan’ M (k,k’), 


where 
M (k,k’)=[ (2+ Ee +e)? (1 ay?) 
K (Large?) (antag fap —apage 
— apr Oty, — yay)? |r +k’ 
Assume 
kK (ap)}. 


When k’<(ap)!, the behavior of M(k,k’) is 
M (k,k’) = (16mapt)~' (4k' | k+k’| )~'. 
When k<k’, it is convenient to use 


p=k’+3k, 


and 
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M (k,k’) = [2p (16mapt+ p*)!+-k(16mapt)! |? 
X (1—a,”)*[1—a,?—k(4rapt)—3(1—2a,) }? 
k(16mapé)* 


=[4p?(p?-+ 16xapt) [1-—_—* = 
[4p°(p" apt) } pt Torape)) 


. k(1—2a,) 
or 
Take a momentum K such that 
kKK<K (ap)}, 
and write (C.1) in the form 
o(k) =91(k) +42(k), 


where 


bu(k)= f @k' M(kk’), 
p<K 


bt)= f d*k’ M (k,k’). (C.10) 
p>K 


It is then found from (C.4) that 


rg 
oi(k) =———_[ K+ O(k?/K) ]. 
16mapé 
On the other hand, ¢2(k) is 


$o(k) =x (16mrapt) if d6 seché 


sinh 1K (16rapt)-4 
X [1—4k (16rapt)—4 (14-679) J 
= —K (16mapt) +32? (16mapt)—! 


—4nrk(16rapt)"(49r+}). (C.12) 


TSUN 


WU 


Therefore, for k<(ap)!, the first two terms of $(k) are 
$(k) = 40? (16rapt)—!—4ark(16mapt)— (44+3). (C.13) 


In (7.26), a function similar to ¢(k) appears: 


¢' (k)= fox M’'(k,k’) (C.14) 


where 


M' (k,k’)=[((Ert Ev t+ Ex)“ (1—ax?) 
X (Larne?) (apr —erperns) (ax tary Fay 
= Opp! = e1— Ok!) |r mez’. (C.15) 
Again assume (C.3). For k’<(ap)}, 
M' (k,k’) = (16mapt)(2k' |k+k’ |)“ 
% (—[h-Ek'|-+e+#); (C16) 
and for k<k’ 
M'(k,k’) =[4p?(p?+ 16mapé) }-3 


k(16mapt)! ims 
p(p?+16mapt)! (4rapt)*(1—a,”) 


d 
x| ({k+k’| —k’)—a,+ (1—ax) 
dp 


d 
x (art (k!— per) | (C.17) 
dp 


The situation here is very similar to the case of $(k). 
Thus it is permissible to use (C.17) in (C.14). The 
result is then 


$' (k) = 24 (1—ax)[4 (16mapé)!— k(34r—1) ]. (C.18) 
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The description of steady-state phenomena, near equilibrium, in terms of Gibbs ensembles is discussed. 
The Liouville equation is modified to include external nonconservative forces which prevent the system 
from reaching equilibrium. The steady state ensembles are then obtained as (approximately) time-inde- 
pendent solutions to the Liouville equation; such ensembles depend linearly on the thermodynamic pa- 
rameters which characterize the deviation from equilibrium. With the aid of the steady state ensembles 
the linear relations between the thermodynamic fluxes and forces are obtained. 


1, INTRODUCTION 


HE description of transport phenomena in gases 

at low density may be based on the Boltzmann 
equation. Discussions which are applicable to gases at 
higher density and to liquids have been given by a 
number of authors. In particular, Born and Green,? 
Bogoliubov,? and Kirkwood‘ have obtained kinetic 
equations which provide a generalization of the Boltz- 
mann equation, and Green’ and Hashitsume® have 
discussed transport phenomena from the point of view 
of Markoff random processes. The methods of Kirk- 
wood! and of Green® have been carried to the point of 
yielding explicit formulas for such transport coefficients 
as the thermal conductivity and viscosity; however, 
general methods for evaluating these formulas have 
not been given. 

The discussions of Born and Green, Bogoliubov, and 
Kirkwood are based on the distribution function 
method, the quantities of basic interest being the 
molecular distribution functions of low order. In the 
treatments of Green and Hashitsume the time de- 
pendence of the pertinent macroscopic variables is 
discussed directly. It is expected that some formal 
advantages may be offered by an approach to non- 
equilibrium phenomena in which the Gibbs ensemble 
plays a more prominent role. Here we wish to discuss 
the construction of Gibbs ensembles for the description 
of the steady state, near equilibrium; some comment 
will also be devoted to nonsteady phenomena. It will 
be seen that a steady-state ensemble can be simply 
expressed in terms of the external forces which maintain 
the deviation from equilibrium. With the aid of the 
steady-state ensemble one can obtain linear relations 
between the thermodynamic fluxes and forces, the 
coefficients in these relations being the transport 
coefficients. 


1S. Chapman and T. G. Cowling, The Mathematical Theory of 
Nonuniform Gases (Cambridge University Press, Cambridge, 
1939). 

2M. Born and H. S. Green, A General Kinetic Theory of Liquids 
(Cambridge University Press, Cambridge, 1949). 
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5M. S. Green, J. Chem. Phys. 20, 1281 (1952) ; 22, 398 (1954). 

6 N. Hashitsume, Prog. Theoret. Phys. (Kyoto) 8, 461 (1952). 


Certain of the methods used here have also been 
applied by other authors. In particular we mention the 
work of Kubo,’ Kohn and Luttinger,’ Lax,® and Block” 
on electrical conductivity, and of Mori! on transport 
phenomena in fluids. In addition Bergmann and Lebo- 
witz” have given a discussion of the steady state which 
in some respects is similar to ours. 


2. LIOUVILLE EQUATION INCLUDING 
EXTERNAL FORCES 

We consider first a classical system, surrounded by 
and interacting with external reservoirs and mechanical 
devices; the system and surroundings collectively will 
on occasion be referred to as the universe. We suppose 
the surroundings to be large compared to the system 
so that an approximate steady state can be attained in 
which variables referring to the system change slowly. 
We assume the Hamiltonian for the universe to be 
separated as 


H,=H+H,+V, (1) 


where H and H, depend on variables referring only to 
the system and surroundings, respectively, and the 
energy of interaction V is independent of the momentum 
variables. A Gibbs ensemble for the universe is described 
by a distribution function p satisfying the Liouville 
equation 

Op/dt+ {p,H7.} =(0), (2) 


where the curly brackets represent the Poisson bracket. 
An ensemble for the universe entails one for the system 
and surroundings separately, according to 


j= fo, = f deo. (3) 


Here the integrations are taken over that part of phase 
space referring to the surroundings and to the system, 


7R. Kubo, J. Phys. Soc. (Japan) 12, 570 (1957) ; Kubo, Yokota, 
and Nakajima, J. Phys. Soc. (Japan) 12, 1203 (1957). 

8 W. Kohn and J. M. Luttinger, Phys. Rev. 108, 590 (1957); 
109, 1892 (1958). 

9M. Lax, Phys. Rev. 109, 1921 (1958). 

0B. Block, Ann. Phys. 6, 37 (1959). 

11H. Mori, J. Phys. Soc. (Japan) 11, 1029 (1956); Phys. Rev. 
112, 1829 (1958). 

2 P. G. Bergmann and J. L. 
(1955); Ann. Phys. 1, 1 (1957). 


Lebowitz, Phys. Rev. 99, 578 
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respectively, dy and dg being the corresponding 
elements of volume. If p is normalized, 


favaeo=1, (4) 


then it follows immediately that f and g are normalized, 


fae set, fave=t. (5) 


p= fg(1+x), (6) 


If we write 


then x is a measure of the statistical correlation between 
the system and surroundings. 

We now wish to obtain a Liouville equation for /. 
Integration of Eq. (2) over the phase space for the 
surroundings yields 


af/attt{ fA} + frost fare.) -0. (7) 


The integrand in the third term above is a divergence 
in the phase space of the surroundings; the integral 
then vanishes if p is assumed to vanish sufficiently 
rapidly in the remote regions of phase space. The last 
term may be transformed to give 


df/dt+{ f,H}+0/fF,/dp,=0, (8) 


where 


Fi = - favgatyav Ogi. (9) 


The g; and p; here are the coordinates and momenta of 
the system. We have used the summation convention 
that terms containing repeated indices are to be 
summed; the number of terms in the sum will be 
evident from the context. 

The F; are effective external forces representing the 
effect of the surroundings on the system. They are in 
general time-dependent, if the universe is not in 
equilibrium. However, if the surroundings are large 
compared to the system we expect that in some situ- 
ations the time dependence of the F; will be sufficiently 
slow that over long periods of time they may be treated 
as constant; we further assume that under such condi- 
tions the system will attain an approximately steady 
state in which its distribution function is substantially 
constant. 

If x=0, that is, if the system and surroundings are 
uncorrelated, the external forces are conservative. Their 
effect may then be described by adding a suitable term 
to the system Hamiltonian; this will simply lead to a 
modification of the equilibrium properties of the 
system. In general, however, 


OF, Op; +9, (10) 
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and the effect of the external forces cannot be included 
in the Hamiltonian formalism. We note that in such a 
case {= 1 is not a solution to the Liouville equation (8) ; 
as a consequence the principle of equal a priori proba- 
bilities is not valid, in the system phase space. 


3. STEADY-STATE SOLUTION TO THE 
LIOUVILLE EQUATION 


Our problem is now to obtain solutions to the 
Liouville equation (8), under the assumption that F; 
and f are nearly time-independent. We further assume 
the F; to be small in the sense that the steady-state 
solution is near equilibrium, that is, that 


where An is small and fo corresponds to an equilibrium 
ensemble. We first choose for fo the canonical ensemble, 


(12) 


fo=etF4 ; 


the grand canonical ensemble will be introduced below. 
Here B=1/kT where k is Boltzmann’s constant and T 
the Kelvin temperature, and @ is 1/kT times the 
Helmholtz function. 

The equation for the determination of An is 


dAn/0t+{An,H} —BF,0H/0p;+ OF ;/0p;=0, 


where terms containing the product of An and F; have 
been neglected. Rewrite this equation as 


0An/0t+ {An,H}=h, (13) 
where 


h=BF AH /dp,—dF/ dpi. (14) 


We solve Eq. (13) by introducing an adiabatic 
turning-on of the external forces. That is we imagine 
that in the remote past (4=—°%) the system is in 
equilibrium without interaction with the surroundings. 
The external forces are then slowly turned on (for 
example by the removal of adiabatic walls), reaching 
their full value at ‘=0. If the turning-on is sufficiently 
slow we expect that the system will at all times adjust 
itself to the instantaneous values of the external forces, 
reaching the desired steady state at /=0. Presumably 
the results will be independent of the particular manner 
in which the forces are increased; we will make the 
choice 

F(t) =e*'F,, 


h(t) =e*'h. (15) 


Here € is small, that is, 1/e is large compared to the 
time necessary for the system to attain a steady state. 

With the assumption (15) for /, a solution to Eq. 
(13) is 


t 
An( Pf) -{ ef hig ( Py pdt’, 


Here P denotes the position of a point in phase space 
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and P; is the position at time ¢ of that point which was 
at P initially. At =0 we have 


(16) 


0 
An( P) -{ eh (Py)dt’, 


and the steady-state ensemble is 


ji f ei Pedat | 


4. SPECIFICATION OF THE EXTERNAL FORCES 


(17) 


To proceed further it is necessary to specify the 
external forces, or /4o, in more detail. For this purpose 
we consider first the interaction of the system with a 
single reservoir. We suppose the forces Ff,” due to this 
reservoir to be such that the system can reach an 
equilibrium state described by a canonical distribution 


, = e(a-BrH) 
Jr=e x—B a. 


where 1/8, is the temperature of the reservoir. Substi- 
tuting this expression into the Liouville equation (8) 
(with only the forces F;” being included) we find the 
condition 

der) Fi" /0p;=0, 
or 


OF '/ 0p; =B8,F OH /0p;. (18) 


In the presence of more than one reservoir the quantity 
h of Eq. (14) is 
h=>. [BF "0H /0p,— OF; Op; |, 
which reduces with the help of Eq. (18) to 
h=>°-(8—B,)F;"0H/90p;. 
Now F,'0H/0p; is the rate at which energy is trans- 
ferred from the rth reservoir to the system. Denoting 
this quantity by q,, 
gr=F'0H/ dpi, 
we have 
h= > r(B—Br)qr- 
Consider next the interaction with an 
mechanical device. For concreteness we may imagine 
the system to be a fluid enclosed in the annular region 


external 


between two concentric cylinders. A relative rotation | 
of the two cylinders will produce a transport of mo- 


mentum across the fluid. Evidently for such mechanical 
devices one can choose the external forces to be mo- 
mentum independent, 

OF ;/dp.=0. (19) 


(However, the forces are still nonconservative and 
cannot be represented by a potential added to the 
Hamiltonian.) Their contribution to / is simply 


BF 0H /dp,= —BW, 
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where W is the rate at which work is done by the 
system. 

In the presence of both reservoirs and mechanical 
devices 


h=>).(8—B,)qr—BW. 


For any particular system, g, and W can be expressed 
as functions of the g; and p;; the specification of h is 
then complete. 

Maintenance of a steady state requires that, on the 
average, the energy of the system be constant, or 


(20) 


> g7)—(W)=0, 


where the angular brackets denote an average taken 
with respect to the steady state ensemble. Thus 


(h)= —- >, B(qr). 
This is just 1/& times the entropy production o familiar 
in nonequilibrium thermodynamics," 


(h\=o/k. 


The formalism has so far been restricted to closed 
systems. For the treatment of such phenomena as 
electrical conduction and diffusion it is necessary to 
consider open systems, able to interchange particles 
with the environment. We will defer the discussion of 
open systems, as they are somewhat more easily handled 
in the quantum theory. It will be shown below that the 
quantity / for an open system is 


h=>>.(8—Br)qr—BW — > -(Bu—Brtr) jr (21) 


Here /, is the rate at which particles are added to the 
system from the rth reservoir; uw and u, are the chemical 
potentials for the system and rth reservoir. The average 
of h is 

(h)= —> + BAgr)—->d r(Bu—B rr) (jr), 


or, for the steady state, 


and 
(h) > ->, B<qr) +o + Britr{ jr). 


This is again 1/k times the entropy production o. 


5. LINEAR RELATIONS BETWEEN 
FLUXES AND FORCES 


The quantity / can be written in the form 


h ™ (1 k)> a ahd 


(22) 


where the XY, are thermodynamic forces representing 
the deviation from equilibrium, and the /,(P) are phase 
functions whose averages are the thermodynamic fluxes. 
Combining Eqs. (17) and (22) gives for the steady 
state ensemble 


fond 
f{(P) jr) 4 ee | 
Bk a 


8S. R. De Groot, Thermodynamics of Irreversible 
(Interscience Publishers, Inc., New York, 1952). 
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With the help of this ensemble we find for the thermo- 
dynamic fluxes 


1 0 
(Js)= (210) 1+ x. f e"'Jq( pat}) 


where ( )o denotes an average taken with respect to 
the equilibrium ensemble fo. Since the fluxes vanish in 
equilibrium, 

0 


1 
(Is)=-¥ Xe f e'dt(Js(P)Ja(P,))o. 
ka ‘ 


—2 


(23) 


With Eq. (23) we have derived the familiar linear 
relations between fluxes and forces, 


(Js) - Lie LgaX a, 


with the transport coefficients being given by 


(24) 


0 


1 
Lpa=- f e(Is(P)Ja(P,))odl. 
k 


—® 


(25) 


Onsager’s theorem" is an immediate consequence of 
Eq. (25) and the reversibility of the mechanical 
equations of motion. 

The positive-definite nature of the entropy production 
is readily demonstrated. Consider the inequality 


0 2 
1=<| f es(P at] ) >0. 
—® 0 


It follows from invariance under a translation of the 
origin of time that 


Ja(P)Ja(Pr))o=(Ja(P)Ja(Pr—s))o, 


and therefore 


0 0 
I -f ent f edt’ Ja(P)Ja(Pr—t))o 


0 —t 
-f eva f e**+9ds(Ja(P)Ja(Ps))o. 


An integration by parts yields 


1 0 
[= f e( J q(P)Ja(P1))odl. 
€ 


6) 


Consequently, 

Lea= el /k20. 
Thus the diagonal elements of Z are non-negative, in 
any representation. This has the result that the entropy 
production, which can be written 


= >» X alapX By 
is non-negative. 


McLENNAN, 


JR. 


6. NONSTEADY PHENOMENA 


The above formalism is readily modified to include 
slowly varying time-dependent phenomena. Supposing 
the external forces to depend on the time, we consider 
a single Fourier component. Including also the adiabatic 
factor, we have 

X a(t) = elt OlX ao. 
The frequency w must be assumed to be sufficiently 
small that the system can at all times adjust itself to 
the instantaneous values of the external forces, that is 
1/w must be large compared to the relaxation times 
characteristic of the system. As in the previous discus- 
sion, we then find the ensemble 


1 0 
f(P) = fo + es Xan f aero (Pat 


—2 


The linear relations between fluxes and forces are 


(Js)=Die Lga(w)X a; 
with the frequency-dependent transport coefficients 


being 


1 0 
Leeta)=— f etOXT5(P)Ja(Pr))odt. (26) 


7. QUANTUM THEORY 


We consider next the modifications necessary for 
quantum mechanical systems. An ensemble for the 
universe is described by the density matrix p, satisfying 
the Liouville equation 


1 
dp/dt+—[p,H,, |=0. 
ih 


Let gq’ and Q’ be the eigenvalues of a complete set of 
commuting variables referring to the system and 
surroundings, respectively. Then the density matrix f 
for the system is given by 


g'| fla =Ledg',0'|e|9",0'). 


Assuming the Hamiltonian operator for the universe 
to be separated as in Eq. (1), we find the Liouville 
equation for f 


1 1 
0 f/dt+—(f,H ]+— Ts[,V ]=0. 
th ih 


Here ‘T's denotes a trace taken over the eigenvalues of 
variables referring to the surroundings, that is 
(q'|TsA |9q”)=Leq’,0"|A19",0’). 
If we define X by 
p= fX, 
we have 


1 1 
af/ai+—[f,H]+—Ts[fX,V]=0. (27) 
th ih 





STATISTICAL 


Proceeding now as before we assume f to be close to 
an equilibrium ensemble 


f= fol +n). 
In order to treat open systems we choose for fy a grand 
canonical ensemble, 


H+ 
exB8 Bus. 


where yu is the chemical potential and » the number of 
particles. (For simplicity we consider systems con- 
taining one type of particle only; the generalization to 
a variety of particles is immediate.) We then find the 
equation for An 


1 
sana” aaa =h, 
th 


(28) 
where to lowest order 


1 
h=— — Ts[ foX,V]. 
thfo 
Equation (28) can be solved as before; we find 


0 


An -{ e“h(t)dl, 


—x 


where /(/) is the Heisenberg representation operator 
h(t) =e MA tye CUM) At, 


To relate h to the thermodynamic forces, consider as 
before the interaction with a single reservoir (thermal 
or particle). Suppose this interaction to be such that 
the equilibrium state of the system is described by the 
grand canonical ensemble. 


f. =e BrH+Bruy n (29) 


Here 1/k@, is the temperature of the rth reservoir and 
uy its chemical potential. Including in Eq. (27) the 
interaction with the rth reservoir only, and requiring 
that the ensemble (29) be a solution to this equation, 
we find the condition 


Ts[ foX,,V,]=0. 
To first order in B—£, and u—u,, 
fo= [1 + (8-—8)H— (8,4r—Bxu) |f,, 


and, with the help of Eq. (30), / is reduced to 


(30) 


1 
k= — —¥THYS-X.Ve] 
th fo r 
1 
=— ——Y{(8,—8) Ts[H,V-]f-X; 
th fo * . 
— (B,4r— Bp) TsLn, V, lf-X} . 


Now [H,V,]/ih is the rate of change of the system 
energy, due to the interaction with the rth reservoir; 
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that is, it is the flux of energy into the system from the 
rth reservoir. Similarly [,V,]/ih is the flux of particles 
into the system from the rth reservoir. While such 
fluxes presumably do not commute with H/, n, or f,, 
we may in the present context neglect the effects of the 
uncertainty principle as applied to H, n, and the fluxes. 
One may in fact replace these quantities by commuting 
“macroscopic variables,” as discussed by von Neu- 
mann" and van Kampen." Then 


1 
h =—=— h “ pi cm (8,—B) TsLH,V,]X, 
wjy 7 ’ 
eb (8,M@r—By) TsLn,V, |X r}y 
or, putting f,= fo in lowest order, 


h= > (8—-B,)gr—- (8u—Brur) jr], 


where 


1 1 
qQ= -- Ts[H, V, |X, d= nes Ts[_n, VIX, 


th ih 
Finally consider the interaction with an external 
mechanical device. In the classical treatment we 
characterized such devices by the equation 
aF,/dp;=0, 


which, on taking into account Eq. (9), may be written 


fauxn =0, 


X= (1+ x)g. 
The quantum mechanical analog of Eq. (31) is 


Ts[X,V]=0. 


(31) 


where 


The contribution to / of a mechanical device is then 
h=—(1/th fo) Ts[ fo,V JX. 


Assuming » to be a constant of the motion in the 
presence of the mechanical device, and further assuming 
the flux [H,V ]/ih to commute with H, we find 


h=—BW, 


where W is the rate at which work is done by the 
system, 
W=—(1/ih) TslH,V JX. 

In the presence of both reservoirs and mechanical 

devices 
h=% 1 (8—B,)qr— (Bu—Brutr) jr |— BW. 

The derivation of the linear relations between fluxes 
and forces now proceeds as before. 

4 J. von Neumann, Mathematical Foundations of Quantum 
Mechanics (Princeton University Press, Princeton, New Jersey, 


1955). 
16N. G. van Kampen, Physica 20, 603 (1954). 
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Capture in Supernovae* 
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Calculations have been carried out, following the method of Burbidge, Burbidge, Fowler, and Hoyle, 
for the abundances of nuclei in the rare-earth region which are produced in the rapid neutron-capture 
process thought to occur in supernovae. The recently available rare-earth mass differences of Johnson and 
Bhanot were employed. The calculated abundances agree, in general, with those given by Suess and Urey. 
The results of the computations support the work of Burbidge, Burbidge, Fowler, and Hoyle which showed 
the effect of spheroidal deformation above the closed shell at V =126 in enhancing the production of Th”, 
U5, U8, Cf, etc., in supernovae. The effect of different combinations of temperature and neutron density 
in enhancing certain relative abundances is discussed briefly. 


INTRODUCTION 


URBIDGE, Burbidge, Fowler, and Hoyle! (BBFH) 

have given a detailed analysis of element synthesis 
involving a variety of processes believed to take place 
in stars. General agreement was found between com- 
puted abundances and those given in the survey of 
Suess and Urey? (SU). According to BBFH, the heavy 
nuclei beyond the iron group are thought to be produced 
mainly by neutron capture, two distinct processes being 
involved which differ widely in respect to the time scale 
and neutron flux involved. The slow (s) process, 
characterized by a low neutron flux, is believed to take 
place in second-generation red giant stars on a time 
scale such that 10? to 10‘ years may elapse between 
successive neutron captures in a single chain. (By 
“second generation” we imply only that the star 
already contains elements up to the iron group which 
were produced in the first-generation stars of the 
galaxy.) The rapid (7) process, on the other hand, is 
thought to take place during the initial phases of the 
supernova-type explosion of a second-generation star. 
In the first 10? to 10° seconds of a supernova event, an 
intense flux of neutrons is assumed to be present. 
BBFH estimate that a flux of ~ 10” neutrons per cm? 
per second at a temperature of ~10° degrees Kelvin 
occurred in the supernova event which gave birth to 
the solar system abundances. The origin of the neutrons 
is considered to be a-particle capture by certain nuclei, 
such as Ne”, which had previously been produced in 
the NeNa-cycle in the hydrogen-burning core of the 
second-generation star while it was still on the main 
sequence, or alternatively in the hydrogen-burning shell 
during the red-giant stage. The r process is pictured as 
a competition between two types of event: on the one 
hand, rapidly occurring successive captures of neutrons, 


* Supported in part by the joint program of the Office of Naval 
Research and the U. S. Atomic Energy Commission. 

t On leave from the University of Illinois. Partially supported 
by a grant from the Guggenheim Foundation. 

! Burbidge, Burbidge, Fowler, and Hoyle, Revs. Modern Phys. 
29, 547 (1957). This paper is referred to hereinafter as BBFH. 

2H. E. Suess and H. C. Urey, Revs. Modern Phys. 28, 53 
(1956). This paper is referred to hereinafter as SU. 


and, on the other hand, photoejection of the neutrons 
in the intense bath of heat radiation present during the 
explosion. Under these conditions, successive neutron 
captures produce neutron-rich nuclei far to the electron- 
unstable side of the stable isobars. As the neutron flux 
dies down, the neutron-rich nuclei decay back to the 
stability region through repeated negative beta decays. 

The rapid process, together with other nuclear 
reactions involved in stellar nuclear synthesis, is 
described in detail in the review article by BBFH. 
Since the writing of that paper, accurate neutron 
binding energies have become available for the stable 
nuclei in the range of atomic weights from 140 to 200, 
chiefly through mass-spectroscopic measurements by 
Johnson and Bhanot.’ Accordingly, the results of these 
two authors have been employed in the present work 
in order to determine the r-process progenitors of the 
elements in the rare-earth region and their abundances. 


DETAILS OF THE CALCULATION 


BBFH have given the critical neutron-binding energy 
in Mev, above which neutron capture will occur on the 
average, as 

Ts 
Cee (34.07+-3 logTy—logn,), (1) 
5.04 


where 7 is the temperature in units of 10° degrees 
Kelvin, 7, is the density of neutrons, and the logarithms 
are to the base 10. For Ty=1 and n,=10*/cm', 
QO, 22.0 Mev. In the present instance, a better fit with 
the rare-earth abundances of SU is obtained with the 
choice Q,21.6 Mev. This entails a slightly altered 
combination for the adjustable parameters Ty and ,, 
€.g., M,=10%, T,=0.8. The determination of the 
neutron-capture path was carried out in the manner 
outlined by BBFH. In this treatment, a modified form 
of the Weizsicker mass formula is used which takes 
into account such effects as those arising from closure 
of nuclear shells and from spheroidal deformation of 

3W. H. Johnson, Jr., and V. B. Bhanot, Phys. Rev. 107, 1669 
(1957). 
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RAPID 


the nuclear core between neutron shell closure at 82 
and 126 neutrons, and beyond 126 neutrons. For a 
given neutron-rich nucleus, the neutron-binding energy 
is obtained from those known for the stable nuclei 
having the same number of neutrons. The assumption 
is made that neutron shell effects are a function only of 
the neutron number .V, and that proton shell effects are 
a function only of the proton number Z. On this basis, 
it can be shown that the binding energy for the next 
added neutron to a nucleus (A,Z) is given by 


On(A,Z)~LOn(A £0) —48Z2/A 2] 
+482?/A?—0.41, (2) 


where the subscript e pertains to a nucleus with known 
Q,, and @ is the coefficient of the isotopic asymmetry 
term in the Weizsicker formula. The relation .V =A —Z 
=A,—Z, must hold. The last term in (2) is an approxi- 
mation to various small terms which have been evalu- 
ated numerically. A knowledge of Q,,(A,Z) permits the 
determination of the rapid-capture path once a critical 
Q, for capture has been determined through the 
application of Eq. (1). 

When equilibrium between the (n,y) and (y,n) 
processes is reached, further progress along the r-process 
path is made only through negative beta decay. Thus, 
the abundance of a nucleus produced by the r process 
will depend linearly on the beta-decay lifetime of its 
progenitor situated on the r-process path. This lifetime 
depends in turn on the energy available for the decay 
event. The beta-decay energies of the highly unstable 
progenitors are found by considering the mass parabolas 


NEUTRON CAPTURE 


IN SUPERNOVAE 


of isobaric sequences, according to the formula 


A odd, Z odd 


(+9, 
bs—2—-2.5)F05 


| —364, A even, Z even 


(3) 


with all energies in Mev. Here Z, is the charge of the 
hypothetically most stable nucleus in the isobaric 
sequence, and By, is the curvature of the parabola at 
the vertex. Values for Z4 were determined empirically 
from known beta-decay energies. B4 values given by 
BBFH were employed. The term 364 is the pairing 
energy described in the same paper. The term 2.5B, 
replaces the customary term 0.584 because the allowed 
beta-decay transitions will occur in general to a low 
excited state with excitation of the order of 2B,4, rather 
than to the ground state of the residual nucleus. As 
before, the mean lifetime 7s for beta decay, and the 
atomic abundance n(A) are given by 


ra= (10'/W°) sec, (4) 
and 
n(A)=const(rs/AA), (5) 


where AA is the number of neutron captures made by 
the isobar of weight A before the next beta decay occurs. 
Expression (5) assumes that all of the material produced 
at a given mass number A at which beta decay occurs 
is spread out, at the termination of the supernova event, 
over an interval up to the next mass number, A+A4A, 
at which beta decay occurs. The numerical quantity 
in (4) is a very rough approximation for allowed beta- 
transitions over an extended range of mass numbers. 
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Fic. 1. The average excess neutron binding energy to nuclei with neutron number N, over that given by the smooth Weizsicker atomic 
mass formula. Each value has been normalized to the value at the beginning of the shell in which WN lies. 
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Relative abundances do not depend on the exact value 
of this constant. 

As in the original article of BBFH, the atomic 
abundances (A) have been normalized to those of SU, 
taking the number of Te'®* atoms as 1.48, which is in 
turn normalized to a standard value of 10° for silicon. 
However, we have multiplied the computed n(A) by a 
factor of 2 in order to take into account the effect of 
cycling after spontaneous fission above A = 260. Because 
the fission fragments peak at A~140, the effect of 
cycling is such that the flow of nuclei above A = 140 is 
twice as great as in the range of A from 110 to 140. 
This was not done in BBFH. 

Figure 1 shows, for the next added neutron, the 
manner in which the binding energy excess over that 
given by the usual Weizsicker formula increases 
experimentally with neutron number after shell closure 
at 20, 50, 82, and 126 neutrons for known nuclei. The 
curve for the region between 82 and 126 neutrons is 
deduced from the recent measurements of Johnson and 
Bhanot.’ In the other regions, the masses of Huizenga* 
and Wapstra® have been used. The N= 20 to 50 shell 
also exhibits the stability associated with the subshell 
at N=28. The N=50 to 82 shell shows the normal 
increase in stability within the shell as N increases, 
unperturbed by quadrupole spheroidal deformations. 
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Fic. 2. Abundances of nuclei in the region of the rare earths 
which are produced in the r process. The empirical points are 
taken from SU. The histogram is a curve calculated by the 
method more fully described in the work of BBFH.' The abun- 
dances are normalized to silicon taken as 10° atoms. The heavy 
line depicts the conditions believed to be responsible for the solar 
system abundances of these nuclei (Q, critical=1.6 Mev).The 
light line shows the effect of increasing the temperature to 1.0X 10° 
degrees (Q, critical=2.0 Mev). Both situations are normalized 
to show a common abundance for an element like tellurium (not 
included in the diagram). 

4 J. R. Huizenga, Physica 21, 410 (1955). 

5 A. H. Wapstra, Physica 21, 367, 387 (1955). 
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Such deformations are known to be quite small in this 
shell. On the other hand, both the V=82 to 126 and 
the N=126 to 184 shells exhibit marked effects due to 
the large spheroidal deformations which are known to 
occur approximately midway in these shells. Large 
quadrupole moments are observed for nuclei in these 
regions. 

It should be mentioned that the exact shapes of 
curves such as are given in Fig. 1 depend somewhat on 
the choice of the parameter 8 in the Weizsicker formula. 
It was found, however, that the general character of 
the curves is preserved for not too large changes in 8 
from the value of 22.6 Mev, given in BBFH. 


RESULTS 
Rare-Earth Abundances 


The results of the calculations for the rare-earth 
region are shown in Table I. The critical value 0,= 1.6 
Mev was chosen as the best fit to the SU abundance 
distribution. It is of interest to note that the sum of the 
decay times over the entire region of the rare-earth 
progenitors (A~141 to 178) is about 1 second as 
compared with a total of about 80 seconds for one 
cycle of the entire r path.' Thus, the progenitors of the 
rare earths in one cycle build up a population of roughly 
1% of all the r-process nuclei. Figure 2 displays a 
histogram of the calculated abundance distribution in 
the region of interest. Following the practice of BBFH, 
we present the calculated results in the form of a 
histogram rather than a smooth curve. In this way, the 
order of magnitude, but not the exact nature of fluctu- 
ations due to odd-even effects, etc., is indicated. SU 
abundances are indicated by the plotted points. The 
solid circles (even A) and crosses (odd A) represent 
neutron-rich s/adle nuclei, that is, they exist as stable 


TABLE I. The path of the r-process and abundances 
in the rare-earth region.* 








; : Ba 454 Ws 7B Atomic 
N A Z ZA Mev Mev’ Mev sec AA abundance 
92 141 49 60.0 1.29 --- 11.47 0.0504 5 0.056 
96 146 SO 60.8 1.28 0.98 10.14 0.0933 3 0.172 
98 149 51 61.9 1.27 +--+ 11.17 0.0575 3 0.106 
100 152 52 62.9 1.26 0.93 10.15 0.0928 3 0.172 
102 155 53 63.9 1.25 --- 11.00 0.0621 3 0.114 
104 158 54 64.9 1.24 0.89 10.03 0.0985 3 0.182 
106 161 55 65.9 1.23 --- 1083 0.0671 3 0.124 
108 164 56 67.0 1.22 0.86 10.01 0.0995 3 0.184 
110 167 57 68.1 1.22 +--+ 10.99 0.0624 3 0.115 
112 170 58 69.2 1.21 0.83 10.20 0.0906 3 0.164 
114 173 59 70.6 1.21 +--+ 11.51 0.0495 5 0.0686 
118 178 60 72.2 1.19 0.80 11.24 0.0557 9 0.0340 
126 187 61 75.6 1.18 +--+ 14.78 0.0142 1 0.0786 
188 62 75.8 1.17 0.78 12.94 0.0276 1 0.153 
189 63 763 1.17 +--+ 13.14 0.0255 1 0.141 
190 64 76.7 1.17 0.78 11.65 0.0466 1 0.258 
191 65 77.0 1.16 +++ 11.52 0.0493 1 0.274 
192 66 77.3 1.16 0.77 9.94 0.103 1 0.570 
193 67 77.7 1.16 +++ 10.01 0.0995 1 0.552 


* Silicon taken as 10* atoms. 
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Fic. 3. Capture paths in the r process for different choices of critical neutron-binding energy. 
The symbol L denotes a lifetime longer than the time scale of the s process. 


nuclei in nature, but are off the s-capture “path,” and 
so are produced only in the r process. The open circles 
(even A) and plus-signs (odd A) represent nuclei 
produced predominantly by the r process. The s process 
contributes very little in this range of elements owing 
to the large size of the neutron capture cross sections of 
these nuclei. The most probable path of the r process 
for 0,21.6 Mev is shown, along with that for the s 
process, in Fig. 3. 

The effect of shell closure and spheroidal deformation 
on the abundance distribution may be seen by com- 
paring Figs. 1 and 2. Large abundance will be associated 
with stability against neutron capture, that is, with 
low neutron capture energies; similarly, small abun- 
dances will be associated with high neutron capture 
energies. Immediately following shell closure at V=82, 
there is a rapid rise in neutron binding energy until 
N~100. This produces the minimum in n(A) near 


A=145. From N=100 to 114 there is a minimum in 
neutron binding energy, producing the maximum in 


abundances between A = 146 and 174. From V=114 to 
126, there is again a rapid rise in binding energy, 
resulting in another abundance minimum between 
A=174 and 186. The rise in (A) beyond this point is 
because of shell closure at 126 neutrons. The behavior 
between 82 and 126 neutrons is quite similar to that 
found in the work of BBFH for nuclei containing more 


than 126 neutrons, and supports their view of the 
enhanced production of Th, U*, U8, Cf, etc., 
because of such deformation beyond N= 126. 


Effect of T, and n, on the Capture Path 


Equation (1) is illustrated in Fig. 4 with Ty plotted 
against the critical neutron-binding energy Q, for 
several values of the neutron density ,. Clearly, the 
effect on Q, of elevating the temperature, meanwhile 
holding the neutron density fixed, is similar to that 
obtained by decreasing , but keeping 7» constant. 
The light line in Fig. 2 is the result of a choice of 2.0 
Mev for Q,. If m, is taken as 10% neutrons/cm’, it 
represents a temperature of 1.0X10° degrees Kelvin, 
as compared with 0.8 X 10° for the heavy line (for which 
Q,=1.6 Mev). Alternatively, both abundance curves 
could represent a common temperature but differing 
conditions of neutron density. Physically, this is not 
difficult to understand. An increase in the temperature 
increases the number of high-energy quanta, thereby 
aiding photoejection of neutrons. Similarly, decreasing 
the neutron density, meanwhile holding the temperature 
fixed, decreases the rate of neutron capture, thus 
providing less competition with photoejection. The 
critical value of Q, above which neutron capture 
proceeds would be elevated in both cases. 
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Fic. 4. Temperature against critical neutron-binding energy for 
several values of the neutron density. 


Referring again to Fig. 3, the rapid-capture paths for 
Q,=1.6 Mev and for Q,=2.0 Mev may be compared. 
On the average, the path Q,,=2.0 Mev passes through 
neutron-rich nuclei for which Z—Zz, is less than for 
those along the path Q,=1.6 Mev. Assuming that the 
decay energy Ws is linearly proportional to Z—Z., and 
that the mean lifetime varies as Wg~*, an appreciable 
differerice in abundances can be expected between the 
two cases. For further comparison, the path appropriate 
to 0,=3.0 Mev (Ty~1.5 at n,=10") is also shown 
dashed in the figure. 
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Points Requiring Further Investigation 


An abstract of this material was presented at the 
Los Angeles meeting of the American Physical Society 
in December, 1958. At that time, attention was called 
to the possible disagreement for the rare-earth abun- 
dances in the region of A=165 to 175. We still find 
this effect in the present calculations using the best 
choice of Q,, namely, 1.6 Mev. This is seen in Fig. 2, 
where the calculated histogram lies above the SU 
points in the range A4=165 to 175. However, we 
hesitate to regard the deviation as significant since the 
shape of the histogram in this region depends on the 
exact location of the valley in the excess neutron 
binding energy near .V=112 (see Fig. 1). This depends 
both on the accuracy of the empirical mass differences 
and on the choice of the constant 8 employed in the 
calculations. Further, the exact location of the abun- 
dance minimum near A = 185 of the histogram in Fig. 2 
depends to some extent on the choice of the critical 
neutron binding energy. Finally, a better fit in the region 
A= 140 to 185 could probably be obtained by lowering 
the calculated curve, i.e., by changing the normalization 
procedure or by ignoring the factor of two arising from 
cycling after fission. All of these details are being 
investigated further. 

Other details deserving of additional study are 
improved modification of the mass formula and possible 
inaccuracies which become pronounced for large dis- 
placements from the stability region. In particular, it 
will be important to investigate the effect of retaining 
a surfaceXisotopic asymmetry term in the smooth 
Weizsicker formula. 
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The nuclear spin thermal relaxation time, 7), of He*® nuclei has been measured between 1.2° and 3.0° in 
pure liquid He’ under its saturated vapor pressure. The measured times are those characteristic of the bulk 
liquid. 7; is 300 seconds at 1.2° and increases to 550 seconds at 3.0°. These results are in fair agreement with 
the Bloembergen, Purcell, and Pound theory. For a 12% solution of He* in He‘ under its saturated vapor 
pressure, 7; is about 2000 seconds and does not show any sharp change at the lambda point. In pure He’ gas 
at 4.2°K and 1000-mm pressure, 7; is at least 1000 seconds. In dilute solutions at 1.25°K, 7, is at least 
90 minutes for a 3.5% solution and at least 120 minutes for a 1.7°% solution. 


HE nuclear spin thermal relaxation time, denoted 

by 7), has been measured in pure liquid He? 
under its saturated vapor pressure, between 1.2° and 
3.0°K. The measured relaxation times in this experi- 
ment appear to be approximately those characteristic 
of the bulk liquid; these measurements do not appear 
to have been significantly affected by relaxation occur- 
ring at the walls. 

Early observations made in this laboratory!” on the 
nuclear resonance in liquid He* were carried out with 
the primary goal of measuring the nuclear magnetic 
susceptibility. Attempts to measure simultaneously the 
value of 7; were frustrated by the discovery that the 
measured 7°; depended markedly on the container used. 
The conclusion was drawn’ that in at least some of the 
containers the relaxation process was being artificially 
enhanced by processes occurring at the walls, and that 
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Fic. 1. Spin relaxation time, 7), in pure liquid He* under its 
saturated vapor pressure. Circles represent measurements in small 
bulb (bulb A). Triangles represent measurements in large bulb 
(bulb B). Dashed line is the BPP expression, calculated from 
density and viscosity data. 
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1 Fairbank, Ard, Dehmelt, Gordy, and Williams, Phys. Rev. 92, 
208 (1953). 

? Fairbank, Ard, and Walters, Phys. Rev. 95, 566 (1954). 

3 Fairbank, Ard, and Walters, Conférence de Physique des Basses 
Témperatures, Paris, 1955 (Centre National de la Recherche 
Scientifique and UNESCO, Paris, 1956), p. 14. 


the true 7), which would be measured in a very large 
sample, was greater than or equal to the longest ob- 
served time. It has been realized since the work of 
Bloembergen, Purcell, and Pound‘ that 7, measure- 
ments can give important information about atomic 
motions in liquids. Because of the great interest in the 
properties of liquid He’, the experiments reported here 
were begun with the specific goal of determining the 
value of 7, characteristic of the bulk liquid. 

One approach to the problem of measuring 7; without 
complicating wall effects would be to use a container 
which is so large that in a time comparable to 7; most 
atoms do not have time to diffuse to the walls, and thus 
will not be relaxed by the walls even if the walls can 
relax He*® atoms with 100% efficiency. However, the 
diffusion data of Garwin and Reich®:* show that unless 
a sample volume greater than several hundred mm* is 
used, most of the atoms will diffuse to the walls within 
a few hundred seconds, and it has previously been 
shown! that 7; is at least 200 seconds at 1.2°. The 
alternative approach used here was to make 7) meas- 
urements in containers of various geometries but made 
of the same material. In this case, if approximately the 
same 7 is observed in all containers, then one can be 
reasonably confident that 7, has not been artificially 
shortened by wall relaxation processes. Pyrex containers 
were used since the longest previously measured relaxa- 
tion times! had been obtained in Pyrex bulbs. 

In each run the sample, stored at room temperature, 
was passed through a purifying trap cooled by liquid 
helium and condensed into a bulb blown in the end of 
a long Pyrex capillary which extended down into the 
helium Dewar. The nuclear magnetic resonance absorp- 
tion signals were observed in a field of about 9000 gauss. 
The observed line widths were in all cases determined 
by the inhomogeneity (approximately 0.1 gauss) of the 
external magnetic field. 7, was measured by the 
“direct” method of saturating the nuclear spin system 
with a strong rf field, then decreasing the rf field 
strength to a value low enough to cause no appreciable 

‘ Bloembergen, Purcell, and Pound, Phys. Rev. 73, 679 (1948) 
(hereafter referred to as BPP). 

5R. L. Garwin and H. A. Reich, Bull. Am. Phys. Soc. 3, 133 
(1958). 


6 R. L. Garwin and H. A. Reich, Phys. Rev. (to be published). 
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Fic. 2. Spin relaxation time, 7), in a 12.3% solution of He’ in 
He‘, liquid under its saturated vapor pressure. Circles represent 
measurements in small bulb (bulb A). Triangles represent meas- 
urements in large bulb (bulb B). The arrow indicates the \ point 
of the solution. 


saturation, and then watching the amplitude of the 
signal recover toward its equilibrium value. The meas- 
ured 7; is the time constant characteristic of this 
exponential recovery. 

Figure 1 shows the experimental results for pure He’ 
in two different containers. Both containers were made 
of Pyrex as described. Bulb A was approximately 
spherical, with inside diameter approximately 5 mm. 
The measurements on bulb A were made with the bulb 
full of liquid. Bulb B was approximately spherical, with 
inside diameter approximately 10 mm. The measure- 
ments on bulb B were made with the bulb approxi- 
mately % full of liquid. Each point in Fig. 1 represents 
one observation of the recovery of the resonance signal. 
The error flags indicate roughly the uncertainty re- 
sulting from trying to fit exponentials with various 
time constants to the experimental plots of signal 
voltage vs time. 

Figure 1 also shows the results of applying to He’ the 
BPP theory. The expression used was 


T\=SkT/(6n*y'h*nN), 


which is one form of the BPP expression for ordinary 
liquids,’ with the coefficient slightly modified according 
to the work of Kubo and Tomita® and Torrey.*:” Values 
of N, the number of atoms per cm’, were taken from 
the work of Kerr,'' and values of 9, the viscosity coeffi- 
cient, from Zinov’eva.'* The experimental results are 
clearly in good agreement with the BPP result, as far 
as the order of magnitude of 7; and the sign of the tem- 
perature dependence are concerned. 


7E.g., E. R. Andrew, Nuclear Magnetic Resonance (Cambridge 
University Press, Cambridge, 1955), p. 117. 

® R. Kubo and K. Tomita, J. Phys. Soc. Japan 9, 888 (1954). 

9H. C. Torrey, Phys. Rev. 92, 962 (1953). 

1H. C. Torrey, Suppl. Nuovo cimento 9, 95 (1958). 

1 E. C. Kerr, Phys. Rev. 96, 551 (1954). 

2K. N. Zinov’eva, J. Exptl. Theoret. Phys. (U.S.S.R.) 34, 609 
(1958) [translation: Soviet Phys. JETP 34 (7), 421 (1958) ]. 
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T; measurements on pure liquid He® have also been 
made by Garwin, Kan, and Reich,*" and also by 
Careri, Modena, and Santini."* The results obtained by 
both of these groups disagree with those reported here. 

A preliminary measurement in He’ gas at 4.2°K and 
1000-mm pressure yielded a 7; of about 1000 seconds. 
This measurement has not yet been repeated in a larger 
container to see if wall effects influenced the result. 
Bloembergen’® has given a theoretical expression for 7; 
which should be applicable to a monatomic gas such as 
He’. For the stated experimental conditions, Bloem- 
bergen’s formula yields 7;5{5000 seconds, if the dis- 
tance over which the interaction between two atoms is 
approximately constant is taken as 2.5 A. Theagreement 
with the experimental result is reasonably satisfactory, 
since the theoretical formula is sensitive to the value 
chosen for this distance. 

Some measurements on the relaxation time of He? 
nuclei in He*-He* solutions have also been made, in 
addition to those previously reported'® which showed 
that in a 3.5 mole % solution, 7; was at least 90 minutes 
at 1.25°K and in a 1.7% solution at least 120 minutes. 
These measurements on the 3.5% and 1.7% solutions 
were made in a Pyrex bulb of approximately 7-mm in- 
side diameter, and have not been repeated in larger con- 
tainers. Figure 2 gives results for a 12.3 mole % solution 
of He*® as a function of temperature. Measurements 
were made in bulbs A and B, with the bulb full of 
solution in each case. If wall relaxation processes 
dominated, one would expect the markedly different 
geometries to give very different 7;’s. Hence, although 
the relaxation times measured in bulb B are consistently 
longer than those measured in A, the bulk relaxation 
times for this solution are probably not much longer 
than those shown. The striking feature of these results 
is the absence of any marked change as the temperature 
is changed through the \ point (approximately 2.0°). 
The diffusion coefficient of the He’ nuclei, to which 7; 
is closely related in the BPP theory, is known®® to 
change by several orders of magnitude as the tempera- 
ture of a solution is changed through the A point. 
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The fourth virial coefficient D and its components D;, D2, D; for molecules interacting with a square-well 
potential have been obtained as functions of the temperature by the use of Fourier transformation and the 
addition theorem of Bessel functions. Each component and their sum are nearly constant above the Boyle 
temperature and tend to negative infinity as temperature tend to zero. The total D is expressed by 

D/b® =0.28642 + 1.5397f—23.554f?+53.645/3 +-69.859/4— 170.01/5— 14.777/8, 


where f=e**?—1, and well width is equal to the hard core diameter. Corrections of the critical data and 


some related problems are discussed. 


1. INTRODUCTION 


HE isotherm of the imperfect gas is expressed by 
the virial series 


pvo/kT =1+ Bu!+Cv?+ Dv + ---, (1.1) 


where p, v, and T are the pressure, the volume, and the 
temperature, respectively. B, C, and D--- are the 
second, the third, and the fourth virial coefficients, 
respectively. Mayer’s theory! of the imperfect gas shows 
that the &th virial coefficient can be obtained from the 
intermolecular potential by evaluating the 3-dimen- 
sional integral whose integrand is composed of irre- 
ducible cluster graphs. 

Virial coefficients already calculated are as follows: 
Hard sphere up to the fourth (Boltzmann,’? Happel,’ 
Majumdar‘), square well potential up to the third 
(Kihara®), Lennard-Jones potential up to the third 
(Kihara®), etc. These are discussed in detail by Kihara® 
and by Hirschfelder.’ 

The fourth virial coefficient is often used as a criterion 
for the validity and the self-consistency of an approxi- 
mation used in the distribution-function method.*-” 
The only exact value of D known is that of the hard 
sphere, and we do not know how D depends on the 
temperature when attractive forces exist. The reason 
for this difficulty is that the method of carrying out 
the multiple integration for the configuration in Fig. 
1(c) has not yet been found. This paper reports a 
method of calculation of the integration in Fig. 1(c) 
and the values of the fourth virial coefficient and its 
components D,, D2, and D; in the case of the square 
well potential. The method of the integration for D; is 


1J. E. Mayer and M. G. Mayer, Statistical Mechanics (John 
Wiley & Sons, Inc., New York, 1940). 

?L. Boltzmann, Amsterdam Ber. 477 (1899); Wiss. Abhandl. 
von L. Boltzmann IIT, 658 (1909). 

3H. Happel, Ann. Physik 30, 246 (1906). 

4R. Majumdar, Bull. Calcutta Math. Soc. 21, 107 (1929). 

’ T. Kihara, Nippon Sugaku-Buturigakukaisi 17, 11 (1943). 

6 T. Kihara, Revs. Modern Phys. 25, 831 (1953). 

7 Hirschfelder, Curtiss, and Bird, Molecular Theory of Gases 
and Liquids (John Wiley & Sons, Inc., New York, 1954). 

8 G. S. Rushbrooke and H. I. Scoins, Phil. Mag. 42, 582 (1951); 
Proc. Roy. Soc. (London) A216, 203 (1953). 

°K. Hiroike, J. Phys. Soc. Japan 12, 326, 864 (1957). 

0 A. E. Rodriguez, Proc. Roy. Soc. (London) A239, 373 (1957). 


as follows: The multiple integral in the configuration 
space is transformed to that in t space by the Fourier 
transformation. The factors of the integrand in this 
space are decomposed by the use of the addition 
theorem of the Bessel functions. Then the angular part 
of the integrals can be evaluated separately and the 
whole integral is expressed as a series, each term of 
which consists of a product of three infinite integrals of 
one variable. These integrals are calculated in the 
Appendix. D,; and Dz, are calculated in a simple way. 

The results of the present paper show that above the 
Boyle temperature the magnitudes of Di, D2, and D; 
are approximately constant of comparable order and 
approaches the value for hard spheres, respectively. At 
low temperatures D; contributes mainly to D, and 
becomes negative infinity of the order of O(e®*/*”) as T 
tends to zero. This makes us suggest that the ring 
approximation for the virial coefficients might not be 
good for both the high- and low-temperature regions. 

The present method can be applied also to the higher 
virial coefficients, but the applications are postponed 
to another occasion. 


2. CALCULATION OF D; 


We consider a system of molecules whose interaction 
is given by the potential (Fig. 2) 
U (7) = 20. r<o 
=-—¢ o<r<go 
=(), go<r 


(2.1) 


where o is twice the diameter of the hard sphere, and g 
is assumed to be 2, for the convenience of the calcu- 
lation, since it is known that g=1.6—2.0 fits well with 


D-DD + D+ D 
“CNB 
(a) (b) (c) 


Fic. 1. The fourth virial coefficient D is composed of three 
parts, each of which consists of different multiple integrations. 
The integrands of these multiple integrations are expressed in Fig. 
1 (a), (b), and (c), and the contributions of these integrals to the 
total D are denoted D,, Ds, and Dy; hereafter. 
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Fic. 2, Square well potential. 














experiments.°'® We denote the position vector of the 
ith molecule by r;, and define the function f(r) by 
U(r)/kT _ J r<o 
o<r<go 


go<r. 


Ilr) e 


r;—-rh= 
P (2.3) 
f(\r|)=h(\ 0) )=A(p). 


As is well known, the fourth virial coefficient D is 
given by 


D=D,+D.4+ D;, (2.4) 


where 


1 1 
D;=— -- f--fs re—r,))f()rs—m| ) fi m—r| ) 
8 V 


fC rs—1a)) fC rare!) fC) rg—re| )dridrodrsdry, (2.5) 
and V is the volume of the total system. The integrands 
of D,, D2, and Ds are schematically expressed in Figs. 
1 (a), (b), and (c). The exact definitions of D; and Dz» 
are given by in (3.1) and (4.1), respectively. 

(2.5) can be integrated with respect to the coordinate 
of center of gravity and is expressed by 


D;= — 40 f . +f Hosdh(osdh( oh | O3— 04| ) 


x hI 04— 02 hi 02— 03 )doodo;d 04. (2.6) 


Let the Fourier transform of /(| |) be y(|t)): 


1\! 
y()=y(\t )=( ) fu o|) exp(—it-o)do 
2r 
2\! #*% sinip 
-( yf ph(p)——-dp, 
Tv 0 t 


1\} 
h(p)=h(\e}) ( ) fx t|) exp(it-o)dt 
2r/] + 
2\! 7” sinpl 
( ) ty (1) dl. 
_ “® p 


KATSURA 
For the square well potential given by (2.1), 


VO) = (14+ f)yvol() —8fr0(20), 


2\'/cost — sint Jib) 
oie (ae Me 
T hy P ti 


where o(t) is the corresponding Fourier transform for 
the hard sphere. 

When we insert (2.8) for the fourth, fifth, and sixth 
factors in the integrand of (2.6) and integrate over 9», 
0;, and o4, and use (2.9) and (2.10), then 


1 \92 
D;= -10r(, ) f. . » f Hos)h( hoy Oreyee" 
2r 


Xexp{iLt-(o;— 014) +t’ (041— 02) 


+t’ (e2—;) }}do.do:do.dtdt'dt” 


=—40 f- Provera ’—t"|) 


Ky(\t’—t!))y()t—t’|)dtdt’dt’” (2.11) 


—fo? S (1t-f)"(-8f)" 


x I (a1,d2,a3 . 14,05,06), 
where 


T(a},d2,43; @4,d5,46) 


= f. : +f rolas)vo(as’yro(ast”)vo(as| tt" 


Xvo(as| t’—t! yo(as|t—t’| )dtdt’dt’”’. (2.13) 


The summation in (2.12) means the summation for all 
64 cases where all a; take the values 1 and 2; and m 
and ms are the number of a; which take the values 1 
and 2, respectively (#:+22=6). (2.11) is reduced to 
D;/o°= —3{ (1+ f)97(1,1,1; 1,1,1) 

—3(1+/)9(8f)[7(2,1,1; 1,1,1)+/(1,1,1; 2,1,1) J 

+3(1+ f)*(8/)?[7(1,2,2; 1,1,1)+7(1,1,1; 1,2,2) 

+27(2,1,1; 1,2,1)+/(2,1,1; 2,1,1) ] 

— (1+ f)®(8f)'L7(1,1,1; 2,2,2) +37 (1,2,2; 2,1,1) 

+61 (2,1,1; 2,1,2)+6/(1,2,2; 1,2,1) 

+31 (2,1,1; 1,2,2)+/(2,2,2; 1,1,1) ] 

+3(1+ f)?(8f)*{7(1,2,2; 1,2,2)+27(1,2,2; 2,1,2) 

+1(2,2,2; 2,1,1)+/(2,1,1; 2,2,2) ] 

—3(1+ f)(8f)®L7(2,2,2; 1,2,2)+/(1,2,2; 2,2,2) ] 

? 


+ (8f)®1 (2,2, (2.14) 


; 

The polar coordinates of t, t’, t’’ are denoted by 
t, a, B; t’, a’, B’; t”’, a’, B”, respectively, and their solid 
angle elements are denoted by dQ, dQ’, and dQ”, 
respectively (dQ=sinadadB). Now we use the addition 





FOURTH VIRIAL COEPFICIENT 


theorem of the Bessel function to 


yo(a| t’|), 


decompose 


—70(a|t—t’|) 
_JSialt-t |) 


(a|\t— t'|)! 


egg Ftnlal) Srpn(at) 
r(3) > (+n) C,3 (cosy), (2.15) 
n=) 


(at)? (at "3 


where y, y’, y’” are the angles between t and t’, t’ and 
t’’, t’ and t, respectively, i.e., 
cosy = cosa cosa’+sina sina’ cos(B—f’), etc. 
C,’(cosy) is the Gegenbauer polynomial and is ex- 
pressed in terms of Legendre polynomials by 
os dP n41(cosy) 
C,3(cosy) =——— 
d cosy 
u =even or odd 
u20 


(2u+1)P(cosy). (2.16) 


Then 


=[2! r( 2) - (@1d2d3) (40506) $ 


if fran 


XI y(aal)Iy(asl)I yal Ss i1(a lV 3, m( dsl’) 


2 


T(a),d2,03; 4,05,0¢) = 


XL LL Gt)G+m) (2+ 


l=0 m=0 n=0 


XI 44m (asl) I 34 n( del) J 94-n(del’)Ci (cosy) Cm? (cosy’) 


XC. (cosy) t-tdtt!’—idt' t'—'dt""dNdQ'dQ". (2.17) 


With use of (2.16), the angular part of (2.17) becomes 


f. . » fc cosry ‘m’(cosy )C,2(cosy””)dQdQ’dQn” 
f u=l v=m 1 
J u =even or odd v=even or odd w=even or odd 


X (2u+1)(20+1)(2w+ 1) Pu (cosy) 


X P, (cosy’) Pw(cosy””)dQdQ/dQ". (2.18) 


First we consider 


fe. (cosy) P,(cosy’)dQ’. 


vanishes for uv. 


(2.19) 


(2.19) When u=z2, it becomes 
dor 
P,,(cosy’”’). 
2u+1 
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We substitute this into (2.18): 


(2.18) =1rf > d (2u+1)(2w+1) 


x P,, (cosy””) P»(cosy’”)dQdQ”, 


After the integration over dQ, only the term for u=w 
does not vanish. Hence 


(2.18) = (4m)? ZQu+1) f ae” 


(4m)* 5 (2u-+1). (2.21) 


Since the nonvanishing terms in the right-hand side 
of (2.18) are those for w=v=w, the nonvanishing terms 
in >>; dom dn in (2.17) are those for which /, m, and n 
are all even or all odd. Denoting Min(l,m,n) by M, 
the upper limit in the summation of (2.21) is M, and 
hence we have 


(2.18) = (2.21) = (44)*X 3 (M+1)(M-+2). 


We use the result of the integration of the angular 
part in (2.15), 


I (a),@2,a3; 4,05,d6) 


3 v’ Y! 4(M+1)(M+2) 


m n 


mn ( 1,05,d¢6) 
XN nt( @2,4¢,04).V Im(@3,04,45), 
where 


br} (3+m) 
xW 


)(3+n) }*(a,a jax) 


2,44 m+n! (@i,0j,Qk). 


Ninn(@i,0;,04) =2 


Here we use the notation 


War u2,-- 


“un? (G1,02,** +n) 


ff sestast Foal) Sun aal dt, (2.23) 


the generalization of Weber-Schafheitlin’s 
discontinuous integral, and appears frequently in the 
following sections. >>)’ >>,’ >-,’ in (2.22) means the 
summation taken for the points where /, m, n are zero, 
all positive even integers, and all positive odd integers. 
The necessary W) 34m,a4n2(@i,@j,0x) _ be calculated 
from W41,34m,44n'(@,0,b) for a, b=1, 2. These integrals 
are calculated in Appendix I. In the whore calcu- 
—- we truncate the triple infinite summation 
dn’ >,’ by the finite summation in which (/,m,n) 
is pone to be (0,0,0), (0,0,2), (0,2,2), (0,0,4), (2,2,2), 
(1,1,1), (1,1,3), (1,3,3), and their permutations. These 
values are listed in Table Il of Appendix I. Then we 
have 


D3, ‘b' 


which is 


— }{ 1.2713— 2.9763 a 17.182 /?— 8.4200 f* 
+ 20.190 f'—35.434/°+ 14.640/°], (2.24) 
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where b= }ro* is four times the volume of the hard 
sphere. The first term inside the square bracket is the 
value for the hard sphere and its value cited by Riddell 
and Uhlenbeck" is 1.2728. The accuracy of each term 
in (2.24) can be estimated from Table II in the Appen- 
dix. [By adding more terms for (l,m,n) to Table II, 
we can make the coefficient of (2.24) as accurate as 
we desire. ] 
3. CALCULATION OF D, 


D,= _ ior f os fio moan( | 03— 04!) 


Xh(| o2— 03] )\dosdesdo,. (3.1) 


Applying the Fourier transformation to the third and 
fourth factors in the integrand, and integrating over 03, 
we have the 6-function and (3.1) is transformed to 


1\! 
D,=-10'(—) f-- frerivoornre) 
2a 


XexpLit- (es— 4) +it'- (o2— 03) |\doedo;do.dtdt’ 


== 1028) f -- froyeneyyo(|t—e paae 


- 4(32nt)o? f {y())} 4d. (3.2) 
0 

This is the result obtained by Montroll and Mayer.” 

Inserting (2.9) for y(t), we have 

D,= —3(32r*)o9 Zz (1+ f)™(—8/)"(aia2asa4)—! 


Qy-- M% 


XWaas }'(@,2,03,04), (3.3) 


> means the summation for all 16 cases where a), a, 

a3, and a, take the values 1 and 2, ie., 

Dy = — 3 (32m*)o®[ (1+-f)*W 4, 4,4, 9°(1,1,1,1) 
—4(1+ /)°8/2-W,, 3.4.9°(1,1,1,2) 
+6(1+ f)?(8f)?2-*W 4, 4, 4, 94(1,1,2,2) 
—4(1+ f) (8f)?2-°?W 4, 9, 94(1,2,2,2) 

+ (8f)*2-°W 4, 3, 3, 44(2,2,2,2) ]. 

The necessary W3,3,3,34(@1,@2,@3,@4) values are given in 

Appendixes II and III. Using these values, we have 

D,/b'= — (1/560)[544—4075/+35 007/2 

—99 687 f84+139 215f"]. (3.5) 
The first term, 544/560, is the value for hard spheres 
and has been given by Rushbrooke and Scoins.* 
4. CALCULATION OF D, 


D.= —10°f . » f Hosdi(os)h( nM | ese) 


Xh(|e2—03|)doedosdo,. (4.1) 


WR, J. Riddell and G. E. Uhlenbeck, J. Chem. Phys. 21, 2056 
(1953). 

2 FE. W. Montroll and J. E. Mayer, J. Chem. Phys. 9, 626 
(1941). 


(3.4) 
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Applying the Fourier transformation to the fourth and 
fifth factors in the integrand, we have 


1 3 
tips -10(—) f " f h(p:)h(ox)h(p,)v(v(0) 
Qn 


Xexp{i[t-(o;— 04) +t’: (o2— 03) }}doxdo;do.dtdt’. (4.2) 
Integrating over o2 and o4, we have 
D.=— 30° f ilpsddes f Cr OPEV? 


Xexp[io;:(t—t’) Jdtdt’, (4.3) 


- te f iloddor| f EOP exptitat}. (4.4) 


This is equivalent to the result of Rushbrooke and 
Scoins.® 

Inserting (2.9) for y(t) and using (2.8) for the 
integration over t, 


1 7) E i (t) 
D.=~1(86%)(2")| ~ fod f (-a+n— 


a) 
J, (0) 


—(1+f)— 
i} 


J; (21) \? sintp *| 
+8f- ) at (4.5) 
(21)3 p 
1 
~~ 13204 foie + Ny 8.40) 


16 
— —(1+f)fW4,3,35(1,2,0) 
2v2 


1 


where the content of the second square bracket is the 
same as that in the first. W4,4,4'(a1,¢2,0) is given in 
Appendix IV for the cases where a;=1 and 2. From 
this we have™ 


D./b*= — (1/4480) [— 6347427 369 f— 184 156 /? 

4594 272 f*—1 518 980 f*4+918 540/°]. 

The first term, 6347/4480= 1.41670, is the value for 

hard spheres and agrees with the value of Rushbrooke 
and Scoins.*® 


(4.7) 


18K, Hiroike has informed the author that he has obtained the 
same results as (3.5) and (4.7) for D, and De independently 
(private communication). 





FOURTH VIRIAL COEFFICIENT 


5. CONCLUSIONS AND DISCUSSIONS 


In the preceding sections the integrals D;, D2, and Ds 
have been obtained as functions of the temperature 
variable f=e**?—1 in (3.5), (4.7), and (2.26). The sum 
of them gives the total D. We summarize the second, 
the third®* and the fourth virial coefficients of the 
square well potential (g=2 in Fig. 1) and the fifth 
virial coefficient of the hard sphere. 


B/b=1-7f, 
C/b= (5—17 f+ 136 f2— 162f%)/8, 
D/b8=0.28642+ 1.5397 f— 23.554 f?-+53.645 f2 

+ 69.859 f*— 170.01 f>— 14.777 f*, 
E/b'=0.115+-+ +. 


The terms of the lowest order are the hard-sphere 
values. 

The value of D for the hard sphere obtained by 
Boltzmann? and by Majumdar‘ is" 


(5.1) 
(5.2) 


(5.3) 
(5.4) 


D 1283 3 73V2+81X17(arctanv2—7/4) 


=——_+ - 
b® §=8960 2 


32X35 
= (0.2869, (5.5) 


and that obtained by Happel® is 0.288. E/b‘ is the 








1.4167 




















-1 





Fic. 3. Temperature dependence of D,, D2, D3, and D. The unit 
of temperature is the Boyle temperature. 


iM Boltzmann and Majumdar used the result by Van Laar 
[Archives du Museé Joyler ser. 2, 6, p. 237 (1900) =Amsterdam 
Akad. Versl. Jan (1899), p. 350]. 
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py x2? 


x 








-20 





Fic. 4, Temperature dependence of Bx*, Cx®, Dx, Dox'!®, D3x"?. 


D= dD, +De2+D;; x=e* aT 


value for the hard sphere obtained by Rosenbluth and 
Rosenbluth’!® using the Monte Carlo method. 

The temperature dependence of D,, Do, D3, and D is 
shown in Fig. 3. As the unit of temperature we choose 
the Boyle temperature 7g, at which the second virial 
coefficient vanishes. Above the Boyle temperature D,, 
D:, and D; are approximately constant and approach 
the values for the hard sphere, and the first and the 
third are negative while the second is positive. Their 
absolute values are of the same order of magnitude and 
the contributions to D from these three cancel for the 
most part. Below the Boyle temperature each of them 
becomes negative and approaches negative infinity as 
T tends to zero, being of the order of e~**/*?, e~5e/F?, 
and e~*/*?, respectively. Such a feature of the temper- 
ature dependence of the fourth virial coefficient and its 
components will also occur in the case of the Lennard- 
Jones potential. 

In general, the mth virial coefficient tends to + 
like e"("—D<i2kT = y-2n(n-) Hence B2?, Cx®, Dix, Dox” 
and Dx" for x=e~¢*? (f=a-*—1) are shown in Fig. 
4. Figure 4 shows the contribution of each component 
of D to the total D, especially at low temperatures. 
The D; contribution is the main part at low tempera- 
tures. Such a contribution of D; to D makes us suggest 
that the summation of the ring approximation” of 
irreducible cluster integrals, which is equivalent to the 
solution of Green’s linearized integral equation,® is not 
a good approximation over the whole temperature 
region. For example, it can be proved that the isotherm 


18M. N. Rosenbluth and A. W. Rosenbluth, J. Chem. Phys. 
22, 881 (1954). 
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Fic. 5. Normalized 
second, third, and fourth 
cluster integrals for the 
square well potential. 








obtained from the ring approximation for the square 
well potential has a singularity similar to the one 
obtained by Rodriguez.'® The origin of such a singu- 
larity in this isotherm may be attributed to the neglect 
of other irreducible graphs. As a criterion of the validity 
of the ring approximation and similar ones, one should 
show that the isotherm behaves well at low tempera- 
tures. 

In the case of the lattice gas!’ the alternative signs 
of the virial coefficients appear. In the present results, 
B, C, and D are negative at low temperatures. It is an 
interesting question whether the higher virial coeffi- 
cients FE, F, --+ at low temperatures are all negative 
as B, C, and D, or not, since the negative signs of the 
virial coefficient below the critical temperature is 
assumed in Mayer’s theory of condensation. 

Kihara-Kaneko’s proposition'® that the contribution 
of irreducible cluster integrals 8,’s for the system of 
g=© is a good approximation to the calculation of the 
higher cluster integrals 6;’s for g~2, is valid at high 
temperatures, but this approximation is not sufficient 
at low temperatures. Figure 5 shows Kihara’s normal- 
ized cluster integrals.'® 

The critical data obtained from the second and the 
third virial coefficients for the square well potential? 
(g=2)(po,vo,7) and their corrected values under con- 
sideration of the fourth virial coefficient (p1,0:,7)) are 
shown in Table I. The critical temperature decreases 
9%, the critical volume increases 36%, while p1v1/k7; 
increases 9% from povo/kT». Such a detailed quantita- 

16 A. E. Rodriguez, Proc. Roy. Soc. (London) A196, 73 (1949). 


17S. Katsura, Progr. Theoret. Phys. (Kyoto) 20, 192 (1958). 
18 T. Kihara and S. Kaneko, J. Phys. Soc. Japan 12, 994 (1957). 


b,* was calculated using 
the value of D in the 
present result. This fig- 
ure was calculated by 
T. Kihara. 


~ 
g=00 ~ 


tive comparison, however, will depend on the precise 
form of the potential. 

The discussions of the distribution-function*” 
method often use the comparison of the true D and 
approximate D. One should, correctly, use the compari- 
son over the whole temperature range, rather than only 
the value for hard spheres. 

The method of Sec. 2 of the present paper can be 
applied to D2, giving the result 


D.= — 3(641"*)o® >. 
ai-**d5 
mit+ne=5 
a1+++a5=1 or 2 


(1+f)"(—8/)™ 


x 
X (d)20304)~ a5 ee ( 3+2n) 


n=O 


(5.6) 


2 


XW. 2, p42n?(G1,d2,d5) Ws, 3, 3420 3(@2,04,d5) 


The value of D./b* for hard spheres from the sum of 
the first three terms in }>,~0% is 1.41618, which agrees 
well with the exact value of (4.7), showing the good 
convergence of this series. 

Moreover, this method can also be applied to virial 
coefficients higher than the fourth. In particular, for 
some types of irreducible graphs, a certain simple rule 
has been found which enables one to write down the 
integrand in t space corresponding to (2.13). The 
details will be discussed at another time. 
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TaBLE I. Correction of the critical data of zeroth approximation by Kihara* due to the fourth virial coefficient D. 
The third line indicates the comparison of the various temperature variables, «/k7, T/T p, f, and x. 


Pb/kTs 


0.06656 


Zeroth approx. (from B and C), Kihara 
0.04865 


First approx. (from B, C, and D), present results 
Boyle temp. 


® See reference 5. 


1.9184 
2.6086 


T/Te J x 


0.3485 0.3831 
0.3819 0.3497 0.4654 0.8261 
0.13353 - 0.14285 0.9351 


o/b po/kT /kT 


0.3333 
0.3629 


0.8403 


0.41688 
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APPENDIX I 
( a,b,b) 


-f J 54) S ¢4-m( OD) ST py n(ODE ‘dt. (A.1) 


0 


W 34.0, 34m, 340° 


We use the integral representation of the Bessel function 


1 r(—s) ah th 
Jy41(al) = =e ( ) ds, (A.2) 
2ri¥i, T (+141) 2 


)o 1 


ay" 
fo) 
pS 7 2b 


1 / 
r(—s)l (-+ -- 
2 


“ 
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to the first factor in the integrand of (A.1). Here L is 
the straight line parallel to the imaginary axis crossing 
at s=—4. We insert (A.2) into (A.1) and shift the 
integration path to L’, which is the straight line parallel 
to the imaginary axis crossing at s=sp=}—}/l—e, 
and change the order of the integrations fds and 
Sdt (—$-—3l—4}m—}n<s)<}—3l). The integration 


over dt is the Weber-Schafheitlin’s integral, 


and it 


gives 


l 3+m+ntl 
Yr (ett e(e tet) 
2 2 


(A.3) 


re, eee re m+n—I1+5 m+n—1+2 ; 
risti+pn( s+ : )r(-s+ )r(-s+ ) 
2 z 2 


Since a/2bS1 for a, b=1 or 2, the integration can be 
obtained from the residue of the poles in the right plane 
ore. 

The necessary integral can be confined to the case 
where m and n are both even or odd and nSm with 
/=0, and to the case where / and m are both even or 
odd and /Sm with n=0. 

First we consider the former case: The poles of '(—s) 
cancel those of the second factor in the denominator of 


m+n+3 
ror(= ) 
| 2 


4 
(A.1)= worn a 
4(2m)}| 


«Z 
“y gin’ b] ial —— 92) 
2 2 y 


“ 


m+n+3 m—-n —m—n—-3 —m+n 
XP er ) 


2 hen paranre” - 
ran (1 oS) 
Z 2 
0, 5 - (a 26) + 


the integrand, and only s=0 remains for m=n, and no 
pole remains for n<m. The poles of '(—s+ 4) cancel 
of the third factor of the denominator and 
s=}, 3, ---3+4m+4m remain. The poles of '(—s+1) 
cancel those of the fourth factor of the denominator and 
s=1, 2---3m—}n remain for m2n+2, and no pole 
remains for m=n. 

The sum of the residues can be expressed by the 
generalized hypergeometric function: 


those 


m—n 
+ 
2 


I'(—3) (a/2b) 


—m—n+1 
ria)r( -- -—-- 
Z 


r'(4) 


; m—n+1 
or) 


m+n+4 m—n+l —m—n—2 —m-+-n+1 
xaF(~ — i oes —o tO a 26) 
? ? ? ? 


ya ~ a 


mrn+s 
r(—1)r(—3 ee ) 
2b 


a pare mrnrs m—n 
ran re (75) 


m+n+s m—n+2 —m—n—1 
XK aPs (“ , 


—m+n+2 


’ , 


2 2 


:2,3,4; (a/20)) . (AA) 
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TaBLe II. The values of Knm (a,b,c) = 


(122) 
185/288 
2649/57344 
117/1024 
K2(122) 
549/40960 
K4o(122) 


(i111) (211) 
5/12 22/9 
25/1344 0 

7/96 0 

K(111) Koo(211)=0 
11/960 0 

K(111) K(211)=0 


53/480 

— 35/6912 
17/960 
K,;(111) 


1503/5120 
10251/131072 
1257/40960 


K,3(211) =0 K,3(111) 


In the above 4F3(a1,a 2,003,044 5 Bi,B2,83 ; ), when the 
number of a; which are equal to zero is lt than that 
of 8; which are equal to zero, 4/;=1. When there exists 
1/I’ (zero or negative integer) in the constant factor, 
this term is put to be equal to zero. 

The integral in the latter case (n=0, / and m are 
both even or odd) can be obtained in the similar way. 

The results are summarized in Table II. 


1 
[Ware tlladdm- ne Kux(as)- 
4(2m)! 


(2x 
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Ws, 13. ;*(a,a,b,b) 


D 


=f rapa, Ty (bt) J y( bi)t-*dt. (A.5) 


0 


We use the integral representation of the product of 
the Bessel functions to the first and the second factors 
in the integrand; then 


© op “A (at/2)""™( m4) m 
f =! ——".F4(b1) Jy (bird 


m {1 ( (m+3 42 


go m=O 


1 af o (—)*P (m+2)(al)?#? 
Vado m= mI (m4+8)r (m+4) 


XI; (bt), (bi)t-dt. (A.6) 


The summation for m is replaced by the integral 
representation with the residue of '(—s), 


Te 
=— - fas 
Qari \/r4 1, 


I'(—s)0'(s+2)a5+** 
I'(s+$)P(s+4) 


xf Jy(bt)Jy(bt)t-*dt,  (A.7) 
0 


where L is the straight line parallel to the imaginary 
axis crossing at s=— 4. Carrying out the integration 


TOSHI 


4(27)4W 4 g4m,44n4 (a,b,c) 


21K yo(111) 

21K (111) 
241K (111) 
24K 99(111) 
25K (111) 
2!K 4o(111) 


21K, (111) 
21K 33(111) 
2§K,3(111) 
2!K1;(111) 


KATSURA 
Wein viata dt. a, 6, c=1 or 


(121) 





Koo(211) Koo(122) 
2v2/84 —241/43008 
2v2/12 31/768 
K2o(112) =0 K2(122) 
2v2/96 31/7680 
K4o(112) =0 K4o(122) 


709/7680 
— 3883/442368 
— 333/20480 
K,3(221) 


Koo(211) 
K22(121) 
0 


K2(121) 
0 
K49(121) 


Koo(122) 
K22(212) 
K2(122) 
K2(212) 
K (122) 
K4o(212) 


Ky, (212) 
K33(212) 
77/15360 
Ki3(212) 


2v2/30 
2v2/216 
0 


Ky, (121) 
K33(121) 
2v2/30 


K,3(112) Ki3(121) = 











over di ‘(Weber- Schafheitlin’s integral), we have 


—a ey" 
=" fae’) 
ri L b 


x 
(s-+3)(s+2) 


1 


—. (A8) 
)(s+3)s(s—4)(s—3) 


According as a26 or a6, the integral is given by 
the sum of the residues of the poles in the right or left 
half-plane, respectively. 


a’ 4 4 7b 8 sb ‘| 
E2020) 
anl27  35N\a/ 567\a/ | 


a’ | 8 sa\® 47a\t 4/24 *| 
SOOO 
nl s67\b/ 35\e/ 27\5 
1 263 


* 2,1,1)= Sag SE 


Manan? Qe 2835° 
2 


1 136 
Woo09'(1,1,1,1)=-— —, 
2m 2835 


1 1088 
Waaas(2,2,2,2)=— —. 
2m 2835 


APPENDIX III 


Wa utt.20)= f Jy (QI ;(20F;(pt)t-idt, 
; (0<p<2). (A.11) 


We express the Bessel function J,(¢) by the product 
of the exponential function and the polynomial: 


i 
Wasi! i al it( 44-- 
4.4'(1,2,0) ~(- *) = (2p) fie ( +) 
1 1 1 
tex(1- -) | feu( +=) tese( 1 ~)| 
t 2t 2t 


X {eitp—e-it¥ tld? (A.12) 
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where P means the principal value. Let ii=s; then 


W,4,3°(1,2,0) 


if2\! 1 “a s 
--(-) —{f ds set n(a- —+ ) 
8X7 (2p)'L Y ix 2s 2s? 
iw 1 1 
faery) 
ye 


—io 


12 1 1 
+Pf ds recto 1 _—— —) 
Ya 


—1n 


snd 3 1 
+P f ds se-vnn(14—4—) | (A.13) 
2s 623 


—1D 


In each of the above four integrals the pole of the 
integrand is only s=0. The third integral for p21 and 
the first and the second integrals vanish when they are 
integrated over the left half-circle and imaginary axis 
indented the origin [Fig. 6(a) ]. The third integral for 
pS1 and the fourth integral vanish when they are 
integrated over the right half-circle and the imaginary 
axis indented the origin [ Fig. 6(b) ]. Hence the principal 
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values of the above integrals are given by the integrals 
over a semicircle of infinitely small radius, i.e., J¢)2**/*d0 
and f7;7-*d8, respectively. When we expand the 
integrand in Laurent series beginning from s~‘, the 


Fic. 6. Integration 
paths in (A.14). 


a 
U1 C 








(b) 


coefficients of the negative even powers of s do not 
appear and the integrals of the negative odd powers of 
§ vanish since 


Therefore the terms contributing to the integrals are 
only the terms of order s~!. We pick up these terms ; then 


1/2\! i dsj f1(3+p)* 3 (3+p)* (3+ )? 
init Seg OSE, Ct 
16\mr/ (2p)tLYe,s (X24! a) 2! 


dsf 1(1+p)* 1(1+p)* (1+p)2 dsf 1(—1+p)* 1(—1+p)® (—1+)? 
fap ee (ee) 
(1S 4! 2 31 21 Cifor p21 5 2 4! 2 3! 2! 


C2 for pS1 


ds 41 (—3+p)* 3(—3+p)* (—3+p)? 
f ( ; -+ —_ . 2 +- a 
4 


where C, and C2 are the semicircles around the origin 
with infinitely small radius, f7)°* and fi)2*?, 
respectively. 

Hence we have 


W,3,35(1,2,) 


f. & «fl 
= — —— —(p'—30p?+72p—27), 
96 \/1 \/p 


1<p<2, (A.15) 


1d +“, 
Pe Md O<pK<1. 


, (A.16) 
/r 6 

Ws33.23(1,1,0) and W3,3,;'(2,2,p) can be integrated by 

this method. The method of Appendix I, however, is 


es \2 4! oe a 


more suitable. The results are 
W3,3,4?(@,¢,p) 
Vp (4 
caine ee mee 3 
2(2r)i1N3 a 


=() 


b 
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7 


Was rat0= f Tat) J(OIy(OIy(Otr-dt, (A.18) 


A method similar to that of Appendix IIT can be 
applied and gives 
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1 1 (3+a)?1 (3+a)8 3 (3+-a)’ 3 (3+a)® 1 (3+<a)5 
(A.18)= — . i (-1- )+ (3+ )+ (-3- )+= | 
4ry al 9! a 8! a 7! a 6! a 5! 
(1+a)*1 (1+a)* 1 (1+<a)' 1 (1+<a)® 1 (1+a)* 
+3] - = (1+ )+ (-1+ )+ (-1- )+ — 
9! a 8! a 7! a 6! a 5! 
(a—1)*1 (a—1)* 1 (a—1) 1 (a—1)§ 1 (a—1)§ 
2 + (-14 )+ (-1- ye (1- )+ ; 
9! 8! a 7! a 6! a 5! 
(a—3)91 (a—3)* 3 (a—3)? 3 (a—3)® 1 (a—3)° 
-|- + (:- )+ (3- )+—— (3- )+ |} 
9! a 8! a 7! a 6! a 5! 


The double sign before the third [ ] should be taken as + for a21 and as — for aS. 


(A.19) 


11 6251 v2 92081 
HW’; 3 4(1,2,2,2)=8W3 3,3, ;*(3,1,1,1) =- - 


HW } 3,3,3°(2,1,1,1) = . 
mw 2835 2° 


V2 m 2° 2835 
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Aggregation of F Centers in Potassium Iodide 
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\s optical aggregation of F centers proceeds in KI the R, M, and N bands develop, but later lose their 
identity in a very broad band, which is absent in KCI and KBr. The shape and position of this band are 
temperature independent, and it is removed from the peak of the true colloid band. A new band on the 
short-wavelength side of the F band was observed in crystals containing high concentrations of F centers. 


A SERIES of measurements on the optical aggre- 
gation of F centers in KCI, KBr, and KI indicates 
that the final product of aggregation at room tempera- 
ture in KI is markedly different from those in KC] 
and KBr. 

Crystals of materials 
Kyropoulos technique in an inert nitrogen atmosphere 
from Analar reagents, and were sufficiently free of heavy 
ion impurities to possess no absorption bands in the 
range 2300-20 000 A. Optical samples cleaved from these 
crystals were additively colored with metallic potassium, 
quenched from 450°C to 20°C, and irradiated with 
light in the F band at 16°C. At various stages of this 
illumination the crystal was cooled to liquid air temper- 


these were grown by the 


ature and the absorption spectra measured. 

The results of measurements on a crystal of KI are 
shown in Fig. 1. The essential points in these curves 
were reproduced with several other crystals and F 
center concentrations. Freshly colored crystals exhibited 
an # band and small R, M, and \ bands superimposed 
on the general background of the pure crystal. When 
the crystals were irradiated a very broad band appeared 

* Now at Division of Applied Chemistry, National Research 
Council Laboratories, Ottawa, Canada. 


with the R, M, and N bands superimposed upon it. 
As the F-band irradiation continued the R, M, and .V 
bands increased to a maximum and then decreased, 
until finally only the broad band remained. KBr and 
KCI crystals do not exhibit this broad band when 
treated in a similar way, but it is possible that this 
band is similar to the R’ band previously formed in 
KCI by Scott and Bupp! by thermal treatment. The 
shape and position of the broad band is scarcely affected 
by the temperature of measurement, and its peak 


TABLE I. Positions and half-widths of bands in ev at 77°K. 


Band J Ri R2 M 


KC] 
Energy 2: 1.89 
Half-width 23 0.11 
KBr 
Energy 205 109 1.54 137 1.13 
Half-width 0.14 0.95 0.95 0.145 


1.70 1.53 


Je 1.28 
0.08 0.08 


0.12 


1.04+0.02 


Energy 149 1.36 1.21 
0.09+0.02 


Half-width 0.21 0.12 0.08 0.08 
Ivey’s predicted energy 1.73 1.49 1.37 1.24 


1 A. B, Scott and L. P. Bupp, Phys. Rev. 79, 341 (1950). 
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position varies considerably from crystal to crystal, 
although it is always considerably removed from the 
peak of the true colloid band, which in KI occurs at 
approximately 1.46 ev. Unlike Scott and Bupp’s R’ 
band, the band is not formed in the absence of light. 
No confusion with the F’ band was possible since 
experiments indicated that the lifetime of the F’ band 
was of the order of minutes or possibly seconds in KI. 
The R, M, and NV bands were separated from the F 
and R’ bands by subtracting the background dotted in 
Fig. 1. It is assumed that the only overlap of importance 
is between the two R bands. The background curve is 
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Fic. 1. The absorption of an additively colored KI crystal 
after various times of irradiation with light in the / band. The 
sample thickness is 0.19 cm. Optical density is logio(Jo/Z), where 
J is the radiation entering and J the radiation leaving the sample. 


therefore drawn through all the minima except that 
between the R bands. The resulting analysis of these 
bands is shown in Fig. 2. 

A new band, easily resolvable at liquid air tempera- 
ture, was observed on the short wavelength side of the 
F band in crystals containing a high concentration of 
F centers. This was slowly bleached by F light at room 
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Fic. 2. Variation of the absorption due to F, MV, R, 
and N centers with F irradiation. 
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temperature. The position and half-width of this band 
were 2.47 ev and 0.12 ev respectively at 77°K, and 
2.42 ev and 0.20 at 300°K. Bands in similar positions 
but produced by pressure and plastic deformation have 
been reported by Maisch and Drickamer*and by Kiyama 
et al.’ 

The measurements on KC] closely resemble those of 
Herman, Wallis, and Wallis‘ and are in good agreement 
with the results of van Doorn and Haven.® An extra 
band was observed between the F and Rk; bands, 
corresponding to the A band of Petroff,® in both KCl 
and KBr. It is of interest to note that in all three 
materials the ratio of the absorptions of the R; and Re 
bands remains constant throughout the irradiation.‘ 

The main points of the measurements are outlined 
in Table I in which all values are in electron volts and 
are reproducible to +0.01 ev unless otherwise stated. 
The positions of the F, R, and M bands in KI show 
good agreement with the values calculated by Ivey? 
from empirical formulas. 


2W. G. Maisch and H. G. 
5, 328 (1958). 

3R. Kiyama and F. Okamoto, Rev. Phys. Chem. Japan 25, 
1, 6, 49 (1955); R. Kiyama and K. Schimuyu, Rev. Phys. Chem. 
Japan 25, 101 (1955). 

4 Herman, Wallis, and Wallis, Phys. Rev. 100, 1267 (1955). 

5C. Z. van Doorn and Y. Haven, Philips Research Repts. 11, 
479 (1956). 

6S. Petroff, Z. Physik 127, 443 (1950). 

7H. J. Ivey, Phys. Rev. 72, 341 (1947). 
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The Dirac-Van Vleck-Serber spin-operator expansion, first ap- 
plied by Anderson to the Kramers superexchange problem, is ex- 
tended, simplified, and systematized in order to handle all overlap 
contributions arising from a number of interacting configurations. 
The linear cation-anion-cation (e.g., Mn*tt-O~--Mn**) four 
electron problem is worked out in detail, taking account of all 
contributions from configurations (A) ionic, (B) electron trans- 
ferred to right, (C) electron transferred to left. Group symmetry 
requirements are invoked; and these, together with a simple ap- 
proximation equivalent to perturbation theory, are shown to 
reduce the complicated matrix formulation to a single linear equa 
tion. The solution contains terms previously obtained by Ander- 
son, by Anderson and Hasegawa, and by Yamashita, and a number 
of important extra terms. All superexchange terms are fourth 


I. INTRODUCTION 

N this paper we demonstrate a relatively simple 

procedure for handling the formal portion of the 
theory of superexchange. We confine ourselves to the 
problem of three nuclei and four electrons (such as 
Mn-O-Mn). The extension of the method to an anion 
surrounded octahedrally by six paramagnetic cations 
will be discussed in a subsequent paper. 

We consider three configurations and their inter- 
actions: 


A. The ionic configuration. 

B. The configuration in which an electron is trans- 
ferred from the anion to the cation on the right. 

C. The configuration in which an electron is trans- 
ferred from the anion to the cation on the left. 


We use electronic orbitals appropriate to the free 
ions, but we take account of overlap between anion and 
cation orbitals. In the absence of this overlap, there can 
be no superexchange effects of any kind. Thus the 
overlap is fundamental to the theory, and one must be 
careful to include all terms arising from it. Our results 
contain superexchange effects which may be traced to 


(1) overlap in the ionic configuration (Yamashita 
mechanism!) ; 

(2) transfer of an electron to right or to left (Anderson 
mechanism*—also treated by Van Vleck*) ; 

(3) interaction between configurations B and C 
(Anderson-Hasegawa mechanism‘) ; 


* This work was done in the Sarah Mellon Scaife Radiation 
Laboratory and was supported by the U. S. Air Force through the 
Air Force Office of Scientific Research of the Air Research and 
Development Command. 

t On leave from Tokyo University of Education, Tokyo, Japan. 

1 J. Yamashita and J. Kondo, Phys. Rev. 109, 730 (1958). 

2 P. W. Anderson, Phys. Rev. 79, 350 (1950). 

3 J. H. Van Vleck, J. phys. radium 12, 262 (1951). 

‘ Nagamiya, Yosida, and Kubo, in Advances in Physics, edited 
by N. F. Mott (Taylor and Francis, Ltd., London, 1955), Vol. 4, 
p. 1. 


order or higher in the overlap S. A rough numerical evaluation 
with modified Slater wave functions appropriate to MnO-type 
crystals yields an effective superexchange integral of the required 
size. Brief consideration is given to configurations in which two 
electrons are transferred, in particular (D) simultaneous transfer 
of electrons to right and to left (Slater mechanism). Unless the 
energy required to form this configuration is surprisingly small, its 
contribution is probably not so important, although the problem 
needs to be investigated in detail. Some consideration is also given 
to the linear cation-anion-anion-cation (e.g., Mn*+-Br--Br--Mn**) 
problem; the formal solution for the ionic configuration is worked 
out; and it is shown that superexchange terms first appear in the 
order S*7?, where S is the anion-cation overlap and T is the anion- 
anion overlap. 


(4) additional overlap terms within and between the 
configurations, and not previously considered. 


Thus we confine ourselves to the ground state and to 
states in which a single electron has been transferred. 
One may also distinguish the following configurations: 


D. The configuration in which one electron is trans- 
ferred from the anion to the cation on the left, and at the 
same time another electron is transferred from the anion 
to the cation on the right. This has been considered by 
Slater,’ Pratt,® Goodenough and Loeb,’ and Nesbet.? It 
should be unimportant in halides, but it may be of 
significance in oxides and sulfides. In the latter, how- 
ever, the large crystalline Coulomb fields will cause the 
energy of the configuration D to lie well above the 
energy of configurations B and C. We believe, in particu- 
lar, that Nesbet has underestimated the energy of D. 
However, D may still be important. Pratt® has shown 
that if orthogonal orbitals are used, and regardless of 
the nature of these orbitals, configuration D must be 
included; otherwise the ferromagnetic state is lowest. 
This important work first drew attention to the very 
serious nature of the orthogonalization problem in 
antiferromagnetism. 

FE. The configuration in which one electron is trans- 
ferred from the anion to the cation on the left, and at the 
same time an electron is transferred from the cation on 
the right to the vacated anion orbital. This has been 
considered by Yamashita and Kondo.' Its energy is 
fairly high due to the large ionization potential of the 
cation ; on the other hand, it is favored by the crystalline 
Coulomb fields, i.e., it actually lowers the Madelung 
energy. 

F. Same as E£, except all transfers to the right. 


5 J. C. Slater, Quarterly Progress Reports, Massachusetts Insti- 
ute of Technology, July 15 and October 15, 1953 (unpublished). 

®G. W. Pratt, Phys. Rev. 97, 926 (1955). 

7 J. B. Goodenough and A. L. Loeb, Phys. Rev. 98, 391 (1955). 

§R. K. Nesbet, Ann. Phys. (N. Y.) 4, 87 (1958). 
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G. Configurations in which one of the anion electrons 
is excited to a higher orbital. These have been considered 
by Slater®; Pratt® has shown that they are probably 
unimportant. 


A general discussion of the various proposed con- 
figurations has been given by Yamashita and Kondo.! 
It should be noted, however, that whereas we shall use 
configuration interaction of a number of Slater determi- 
nants, Yamashita and Kondo confine their analysis, for 
the most part, to a single Slater determinant composed 
of molecular orbitals. 

Although the contributions of configurations D 
through F should not be ignored, we believe that our 
analysis of configurations A, B, and C presents the 
salient features of configuration interaction, and in par- 
ticular demonstrates the power of our method. The 
extension to the other configurations, and to more di- 
mensions, will be considered in a subsequent paper. 

Our method is an extension of the Dirac-Van Vleck® 
vector model in the manner introduced by Serber" for 
handling configuration interaction and nonorthogonality 
effects. However, we do not explicitly employ the 
permutation matrices, or their spurs, as does Serber. 
Since we are interested only in the dependence of the 
energy upon spin directions, the spin-permutation opera- 
tors are retained throughout the calculation, and at the 
end are simply replaced by the equivalent product of 
spin operators. Somewhat this sort of scheme was used 
by Anderson,’ but not to the fullest extent and sim- 
plicity, and also not to include nonorthogonality of the 
orbitals. Furthermore, we introduce the essential sim- 
plification—making the three-dimensional problem 
tractable—of properly combining the excited configura- 
tions so as to satisfy group-symmetry requirements. 


II. THE IONIC CONFIGURATION 


To illustrate the method we first work out the effects 
of overlap in the ionic configuration. To fix ideas, let this 
configuration be Mn*++-O-~-Mnt*. Let the basis func- 
tion of this configuration be, in Serber’s notation, 


v4 = (a1|171) (a2! 72) (as! 173) (arg! 14). (1) 


We will call this function @ for short, and we will 
designate the corresponding functions with orbitals 
permuted by Pa. It is to be noted that the a; are the 
space orbitals only. 

Let a1=az be the orbitals for a pair of O-~ electrons, 
which may be either 2s or 2p,. Let a3 be the left-side 
Mn++ 3d orbital with maximum directed towards the 
oxygen (i.e., of the form Y,° with respect to the 
Mn-O-Mn, or x, axis), and let a be the corresponding 3d 
function of the right-side Mn**. The overlap integrals 


9 J. H. Van Vleck, Phys. Rev. 45, 405 (1934). 
1 R, Serber, Phys. Rev. 45, 461 (1934); J. Chem. Phys. 2, 697 
(1934). 


will be given by 
(ay | avg) = S, 


2 
(a1 | a3) = +S, ( ) 


Here, and in the future, the top of the + sign is to be 
used if ai=2s function, the bottom if a:= 2, function. 

Serber’s method starts with the Dirac expansion" for 
the Hamiltonian: 


H44=) p Hp“4P, (3) 
where 


Hp44= (Pa|H|a). (4) 


Here H44 and P are ordinarily to be considered as 
matrices, the latter being the so-called ‘‘regular repre- 
sentation” of the permutation group operating on the 
space orbitals. Dirac shows that for basis functions of 
the form (1) one may replace the matrices P by the spin- 
permutation matrices P’ as follows: 


P— (—)?P*, (5) 


where (—)” is — or + for odd or even number of equiv- 
alent two-function permutations. Furthermore, Dirac 
shows that one may think of the P’ as no longer matrices, 
but as operators, and in particular the two-function 
permutations may be replaced by 


P,;°=434+28,-S,, (6) 


where S; and §S; are the one-electron vector spin 
operators. Higher permutations may be replaced by 
products of P;;’; for example, P i;.°= P.;’Pjx’. 


If, now, the orbitals are nonorthogonal, Eq. (3) is to 
be replaced by 


(H—W)44=>° p(H—W) p44P, (7) 
where 
W p44= (Pa|W|\a)=W (Pala). (8) 


Here W is the unknown energy appearing in the secular 
equation. 

One more important point needs to be considered be- 
fore Eq. (7) may be applied to our problem, namely, the 
effect of identical orbitals, such as aj=a2. As Serber 
points out, in this case P}:a=a, or P. can only have the 
eigenvalue +1. To take proper account of this, Serber 
divides the entire group of 7! permutations into one set 
of n!/2 “independent” permutations, and a second set 
of n!/2 permutations which can be obtained by multi- 
plying each of the first set on the left by Piz. We label 
the first set by P, the second set by P12P. For example, 
in the case of three orbitals, P contains P;, P13, P23; and 
Py2P contains Py2, P32, Piss. (Here Py is the identity 
permutation.) In the case of our four orbitals, P contains 
Pr, Ps, P14, Po3, Po, P34, P38, P43, P34, P43, P43, 24, 
P 4,23; and P}.P contains P 2, P132, Pr42, P1023, Pies, P12,34, 
P 342, P 1439, Py 234, P2043, Py 324, P1423. 

If one wishes to handle Eq. (7) as a matrix equation, 


up, A. M. Dirac, The Principles of Quantum Mechanics 
(Clarendon Press, Oxford, 1947), third edition, p. 218. 
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one must be extremely careful, as Serber points out, to 
eliminate from the matrix those rows and columns 
corresponding to the impossible eigenvalue —1 for Py». 
Anderson has shown that, if one wishes to handle (7) as 
an operator equation, the impossible eigenvalues may be 
eliminated by use of projection operators. We discuss 
this in the Appendix. However, one need not use the 
projection-operator formalism providing one is careful 
to evaluate all operators in accordance with the re- 
stricting condition, Pj2=1. With this condition, the set 
P\.P simply repeats all the P terms in (7), and we have 


(H—W)44=2 > p(—)?(H—W) p44P*. (9) 


As Serber has shown, this expression is incorrectly 
normalized. In going from nonequivalent to equivalent 
orbitals the wave function (1) should be divided by v2, 
and hence the right side of (9) should be divided by 2. It 
makes no difference, however, since we are going to 
formally solve for W’, in terms of the operators (6), by 
setting the right side of (9) equal to zero. Thus our 
problem is reduced to simply finding the solution W’ to 
the linear equation 


> p(—)P(H—W) p44P*=0, (10) 


under the restriction Py.7=—1. 

To a high approximation we may neglect any overlap 
or matrix elements between the cation functions a3 and 
ay. The anion-cation overlaps are given by Eq. (2). 
Thus Eq. (10) becomes 
(H,;44—W)— (H1344—WS*) (P137+ P23”) 

— (Hy444—WS?) (Py4y°+ Poy’) 


+ (43.94 4A — WS*)(P13.04°+ Pig 23°) = (). (11 ) 


In this equation H ;; is shorthand for Hp;;. 
Under the restriction Pj27=—1, we obtain from (6) 


P43°+ Pos? = Puy? + Poy’ =1. (12) 


From this we also obtain 
Py3,04°+ Ps.o37= (P13? + P23”) (Pia? + P24’) 
_ Py3° Py? — P37 Pog? 
1 aa Py P 34° Pog? P34? 
1— P34’. (13) 


With the use of (12) and (13) one readily reduces (11) to 


Hy44 — (Hy344+ A444) + M13, 2444 (1— P34”) 


W = (14) 
1—2S82+.84(1— P44”) 


To order S* the spin-dependent part of this energy is 


AW= — 28 3-S4(Ai3.0544—S4H, . a). (15) 


Here H,3‘4 and H,444 have been dropped as being of 
order S* compared to H;4*, We consider the order of a 
quantity involving two overlapping functions to be 


AND T. 


OGUCHI 


given by 


(Pa| H\a)=order of (a! H\a)(Pala). (16) 


The Hamiltonian to be used in Eq. (4) is 
A=) Nitin Gis, 
H;=Hoit Vi, 
Ho; 
V=—2 >’ (2, /rig), 


OF Gi 


(17) 
where 

(18) 
(19) 


(20) 


po (pi?/2m) —e? (Za; ‘vig i), 


Gj;=e/2r;;. (21) 


Here Z, is the atomic number of the nucleus at the point 
£, Tig is the distance between the electron 7 and the point 
g, and r;; is the distance between electrons 7 and 7. The 
Hamiltonian has been split in the form (18) since 


(a;| Ho;|\a;)= (a;j\a,) Ej, (22) 
and hence the Ho terms in S*H,44 will just equal those 
in 43,24 , a 

H144=2(ay! Hy) a1) +2(as3| 13) a3)+8(a1a3|G! aia3) 
+2(a3a4|/G)a3a4)+2(aya2!Glayas); (23) 
S*[2 (az H4\a3)+2 (a4, He a2) | 
+S 4asay G ) ajya3)+8(aja2!|G a3av4) |. 


Hy3.24 ce 
(24) 


Here, as in (2), the + sign is to be used for 2s anion 
functions, the — for 2 functions. 

When (23) and (24) are inserted into (15) the terms in 
Hy cancel, and we obtain the result given by Eq. (15) of 
Yamashita and Kondo! (YK), except for two small 
errors in the latter. Their term (BB|G! BB), which is 
equivalent to our (a;a@2\G\aja2), should be positive, 
rather than negative; and their term (4B/G| BB) 
should not be present. It is important to note that YK 
calculate the difference in energy between a ferromag- 
netic TT state and an antiferromagnetic {| state, rather 
than between a triplet TT anda singlet {| - |? . Thus for 
our result to correspond to theirs we must take §;-S, 

=+4 for ferromagnet, —} for antiferromagnet. 

It is also possible to get an exact solution to the 
problem, analagous to Eq. (16) of YK, by inserting the 
eigenvalues of S;-S, directly into our Eq. (14). 

The method used by YK is an expansion in orthogonal 
Léwdin functions. This is extremely tedious, involving 
long algebraic expressions. We have checked the YK 
result by their method, and have obtained our answer, 
but not without first making a number of algebraic 
errors of our own. 

In contrast, the operator expansion, although it re- 
quires careful study of the machinery, produces a result 
with a bare minimum of algebra. Furthermore, it has the 
considerable advantage of a compact notation, as 
witness the simple form of Eq. (15). 

The overwhelming advantage of the operator ex- 
pansion becomes apparent when more complicated 
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problems are tackled. For example, we consider the 
problem of superexchange along the row cation-anion- 
anion-cation. We number the wave functions from left 
to right a5, a1=a2, a3=a4, as. Let the anion-cation 
overlaps, as before, be designated by S; the anion-anion 
overlap by 7. Then an elementary extension of the 
arguments leading to Eq. (15) yields 


AW=- 28;- S6(H51,23,464 - — S‘7°?H 4 ai (25) 


If the anions are identical, 7? may be fairly large, and 
this form of superexchange may be appreciable. This 
could be an important effect in structures such as MnBre 
which have double layers of anions between cation 
layers. Although the ordering pattern of MnBry is quite 
complicated, there is a definite antiferromagnetic cou- 
pling along nearly linear Mn-Br-Br-Mn bond linkages.” 
Of course, Eq. (25) contains only the terms from the 
ionic configuration; but, as will be seen in the next 
section, the other configurations contribute terms of the 
same order of magnitude. 

Superexchange across two anions may also be im- 
portant in §-cubic and $-hexagonal MnS which have 
ordering of the third kind.’ Here the near-neighbor 
cations connect through a single anion, and this is 
probably the predominant exchange effect. But the fact 
that next-neighbor cations also try to order antiferro- 
magnetically indicates that the more complicated kind 
of exchange represented by Eq. (25) is probably also 
present. 

By a simple extension, we see that superexchange 
across three identical anions will be proportional to 
S*T, etc. 

We next demonstrate the simplicity of the operator 
expansion when applied to configuration interaction. 


III. CONFIGURATION INTERACTION 


The three configurations A, B, and C, defined in 
Sec. I, will now be considered. The orbitals will be 
labeled according to the following scheme: 

Cation 

(c) 

a4 


Bi=P2 


Cation Anion 


(a) (b) 
y : a3 =a 
y?: Bs By 
v°: 1=Y2 ¥3 7 


a1=—a2 


Actually, all the orbitals in any column are identical. 
They are labeled differently in order to facilitate the 
permutation problem, as shall be seen. 

We note that our calculation is restricted to the case 
of identical cation orbitals in the excited configurations. 
Thus it may be applied to MnO, but not to an anti- 
ferromagnet in which the 3d shell is less than half-filled, 
so that the cation electrons can be in different orbitals. 
However, our general method can be extended without 


? Wollan, Koehler, and Wilkinson, Phys. Rev. 110, 638 (1958). 
'8 Corliss, Elliott, and Hastings, Phys. Rev. 104, 924 (1956). 


OF SUPEREXCHANGE 


1431 


too much trouble to this case, as will be shown in a 
subsequent paper. 

With the labelling as in (26), P12 will permute identical 
orbitals in all three configurations. Thus the entire 
secular equation will contain only the permutations P 
and will be of the form 


(H#—W)44 (H-W)4* 


(H—W)*°| 
\(H—-W)84 (H—W)88 | 


(H—W)®°¢| =0. 


(H-W)°4 (H-W)¢® (H-W)¢e| 


Here the correctly normalized matrix element (H—W)44 
is given by the left-hand side of (11). Let Hp®4 
=(P8|H\a), etc. Then the other matrix elements of 
(27) are 
(H—W)84= (H,;24-—WS*) 
— (Hy424—WS)(Pyy?+ Poy’) 
+ (H 13484 — WS) (Pisa? + Psa”) ; 


(H—W)°4=+ (H—-W)*4; 


(H—W)®8= (H—W)°°= (H,24—-W) 
—(H 428 —WS?) (Pi? + Pog’) 
—_ (173428 — WS?) 3473 


(H— W)c8= (H13, 04°32 WS?) (Pis.047+ P 14,23”). 


(30) 
(31) 


Here, as in Sec. II, the upper sign is to be used with 2s 
anion orbitals, the lower with 29; also it is understood 
that all operators are to be evaluated subject to the 
restriction Pj.’=—1. The matrix (27) is Hermitian, so 
all the elements are given by the above equations. These 
equations may be simplified by use of (12), (13), and the 
relation 


(Pisa? + P2347) = (P13°+ P23") P34? = Poa’. (32 
Let us introduce a notation to show explicitly the 
order and the sign of the matrix elements: 
h,BA= H,84/(P1B\a) = H,24/S3, 
hy3,24°8 = Hy3,24°8/ (Pis,2a7| 8) = + M13, 2468 /S*, 


etc. [see Eq. (16) ]. The order of all / will be S°. Let us 
further introduce 


(33) 


wy BA =h,BA—W, (34) 
etc. Then the matrix elements may be written 


(H—W)44= (H,44—W)—S?(wy344+wy444) 
+ S413, 2444 (1— Psa”); 


(H— W ) BA — Sw, BA — Swy,84 + 931013424 Pu! " 


(H—W)88=(H-W)°°=w,*8 
— S2(w 42+ wa? ® Py”); 


(35) 
(36) 


(37) 


(H—W)°8=+S'wys,24°8 (1— Psa’). (38) 

The A configuration is gerade (even), as can be seen 
from general considerations or from an examination of 
(35). This can combine only with the even combination 
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of excited configurations, 


Y= DWYPLy"). 


This means that the matrix (27) factors into a 1X1 
ungerade (odd) part and the 2X2 matrix of interest 


(39) 


\(H-W)44 (H—W)4e 
\(H-W)¢4 (H-W)96| 


(40) 


The matrix elements are given by (35) and by 

(H—W)?4= (2')(H—W)*4; 

(H— Ww)?¢e= wy PB — $? (wy? 8 4+-w548 8 P54”) 
+S*w13,24°8 (1— P47). 


(41) 


(42) 


We note that the matrix elements appear formally the 
same, when written in this notation, for both (2s)? and 
(2p)? anion orbitals. 

The exact solution of (40) involves nothing more than 
a quadratic equation in W, but the algebra is rather 
messy. Since we are interested only in the perturbed 
energy of the ionic configuration, and only in those 
terms involving S;-S,, we may make use of a simple 
approximation, equivalent to perturbation theory. We 
replace the unknown W everywhere but in the term 
H,44—W by its zeroth-order approximation 44. This 
amounts to a change of 


w,BA — g,FA=h,24—H,A4, 


(43) 
etc.4 We then have a simple linear equation in W, the 
solution of which is a term not involving §;-S, plus 
AW= — 2S4S3-S4{ 913,244 A 
+2[ (q1484)? (5488+ ¢13,24°8)/ (qr? 8)" ] 
—4 (gis? 4quss®4/qr®¥)}. 
This is our result. We repeat here the notation: 


gp =(H p**/(Pa|8) |—H1*4. 


(44) 


(45) 


The first term in (44) is the effect of overlap in the 
ionic configuration, and is the same as (15). The term in 
square brackets is a generalization of the sum of the 
Anderson and the Anderson-Hasegawa mechanisms, as 
we shall see in detail in the next section. The last term 
in (44) is completely new. 

As previously noted by Yamashita and Kondo,! all 
configurations first contribute to superexchange in the 
order S*. 


IV. MATRIX EXPRESSIONS 


In this section we exhibit the detailed form of the 
matrices entering into Eq. (44). We write all matrix 
elements in terms of the orbitals a, 6, and c, as given by 
the scheme of (26), and we use the simplest notation 
possible. To illustrate this notation, the matrix elements 

4“ This procedure can be shown to be correct, to order S* in 
S;-S,, by the more general method of expanding W in a power 
series in S everywhere in (40) and equating like powers of S?, S‘, 
and S‘S;-S,. 
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given in (23) and (24) are rewritten as follows: 
HH 744 = 2H o0°+2H cc°+ 8G bc, be 
+ 2Gac,act 2G ob, 03 
Hj3,2444 = S*(2H ».°+2H eo”) 
+.S?(4G ab, bat8G v5, 0c): 
The other matrix elements are: 
Hy? = Hev’+ 2G be, bb+2Gac,ast Ce 
+S(H oy°+ 2H aa°t+4Ga b,abT re ; 
13484 = S(2G ab, bat 4Gac, bb) 
+:9°(H.y°+ Hav’+ Hyat+ 2G ee, bot 4Gac, be) 
+SHee; (49) 
H;,2*= 2G ba,abS (8G be,ac tH ba°t+H ap”) 
+3°(2H ce°+2Gee,cc); (50) 
H33,24°8 = 2G ba,co+S(H av? +H ba°t+4Gaa,a bt+4G ac, be) 
+S$?(2H ce°+2Gac,ac); (51) 
a HH, wo +3H c+ 2Gc,00 7 4G at, 067 66 bc, be- (52) 


(46) 


(47) 


(48) 


In these equations the + signs are to be taken when 
considering (2s)? anion orbitals, the — signs when con- 
sidering (2p)*. From (22) it can be seen that the Ho 
terms will cancel in all the gp*® appearing in (44) except 
qr®*. This latter will contain, in addition to terms in V 
and G, a term 


Ho,aa*— Ho, v»°= €, (53) 


where ¢ is the transfer energy. 

From an examination of the matrix elements it can be 
seen that, if we neglected terms containing S explicitly, 
the ionic contribution (47) would disappear. Further- 
more, H,424 would become Anderson’s transfer integral 
b, except that we have added terms in G; 13428 would 
become Anderson’s exchange integral J’; and H13,24°? 
would become the Anderson-Hasegawa exchange inte- 
gral J”. Equation (44) would reduce to the familiar 
form!: 


AW — —28;-S,[20?(J’+J”)/(AE)*], (54) 


where AE is the energy difference between excited and 
ground configurations. The extra factor of 2 on the 
Anderson term comes from adding the contributions of 
our two configurations B and C. 


V. NUMERICAL CONSIDERATIONS 
A rough numerical evaluation of Eq. (44), appropriate 
to the MnO problem, will now be attempted. Probably 
most of the superexchange comes from the oxygen 2p 
orbitals. We express these very approximately as the 
Slater functions: 
b(r,0) = (6°/m)re—*" P1 (cos) ; (55) 
and we use for the manganese orbitals 


a(r) = (65/3) re—*", 
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with origins a distance + R from the oxygen. We take R, 
for convenience, as 4 atomic units; this is very nearly 
the observed separation. The value of 6 is given below. 

The functions (55) and (56) are, of course, extremely 
crude. They are, however, relatively simple to handle, 
and molecular integrals using these functions have been 
well tabulated.!® The chief difficulty is that the Slater 
functions drop off with distance much more slowly than 
the more accurate Hartree functions, and therefore will 
give an overestimate of all of our integrals. We may 
remedy this defect, however, by picking 6 so as to give a 
reasonable value of the overlap integral S. For ease in 
handling the tables, we choose 


5=9/4, 57) 
which gives 


S=0,04649. (58) 


This is reasonable. A determination of the overlap in 
Mnf», with use of Hartree functions, yields'® S=0.060; 
and it is probable that the overlap in MnO is close to 
this value. 

In evaluating the V integrals, we have used an 
effective nuclear charge of +2 on all ions. This amounts 
to treating the metal ions as Het-like, and the anions as 
neutral He-like. The three-center integrals have been 
approximated as best we could, assuming the overlap 
small. 

The values are 


913,2444 = — 1.50; gisB4 = —0.660; 
g3423 = —0.708; gis,24°? = — 1.16; 


giss24 = —1.35; q1®? =0.0089+- e. 


’ 


(59) 


Here e€ is given by (53). 

We now evaluate Eq. (44) as a sum of three terms: 

(I) the ionic term involving g13,2444; 

(II) the term in square brackets, which is a generaliza- 
tion of the Anderson and the Anderson-Hasegawa 
terms; 

(III) the remainder involving g13434. 


Thus we write Eq. (44) in the form 


W = —2S'S;-S,(I+1I+ 111) = —28;-SyJ. (60) 


In Table I we give the values of I, II, III, and J for 
three different values of «. Elsewhere'® we have esti- 
mated ¢ for MnF» to lie between 10 and 15 ev. 

Since the Néel point should be of the order of the 
number of linear couplings (in this case 3) times J, it is 
seen that the values are very good. It should be noted 
that all three terms contribute to an antiferromagnetic 
coupling and that terms II and III are most important. 
In term IT, according to (59), the generalized Anderson- 


'’ Kotani, Amemiya, Ishiguro, and Kimura, Table of Molecular 
Integrals (Maruzen Company, Ltd., Tokyo, 1955). 

'6 Keffer, Oguchi, O'Sullivan, and Yamashita, Phys. Rev. (to be 
published). 
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TaBLE I. Evaluation of the terms in Eq. (60) for different values 
of the transfer energy. All values, except where noted, are in 
atomic units. 


€ I II III 


~=13.0  1.69X10~ (53°K) 
—7.79 0.80X10~ (25°K) 
—433  0.38X10~ (12°K) 


0.184 (Sev) 1.5 
0.368 (10 ev) --1.! 
0.735 (20ev) —1.5 


0 21.7. 
—7.17 
—2.38 


Hasegawa exchange q13, 247 is larger than the generalized 
Anderson exchange g34?. 

We have made the evaluation using only the oxygen 
(2)? orbitals. To this should be added the contribution 
from the (2s)? orbitals. The latter orbitals will have an 
overlap integral which is only slightly smaller and a 
transfer energy € which is very much larger than the 
former, at least if MnO is similar to MnF», for which 
careful calculations have been made.'* This means that 
the contributions to the three terms from the (2s)? 
orbitals, as compared to that from the (2p)? orbitals, 
will probably be as follows: about the same or slightly 
smaller from term I, very much smaller from term II, 
much smaller from term III. The total contribution 
from the (2s)? orbitals may be of the order of one-fourth 
that from the (29). 

In a subsequent paper we will consider in detail the 
contributions from the other configurations mentioned 
in Sec. I. As an example of what happens, we now 
discuss briefly the contribution from the interaction be- 
tween configurations A and D. By second-order per- 
turbation, the exchange-like term will be 


AW = 2S48;-S4(M)?/q1””, (61) 
where M is a matrix element connecting the configura- 
tions. By an elementary consideration it is seen that M 
is identical to q13,24°8, the Anderson-Hasegawa term. 
[For orthogonal orbitals, from (51), this reduces to 
2G ba,cb, Which is the integral considered by Nesbet.® ] 
The contribution of (61) to J will thus be given by 


J4) = (1.16)?(0.0465)4, qr”? 


~ (0,063 10-4) /gy” a.u. (62) 


This will only be of importance compared to the terms 
in Table I if the double-electron transfer energy ¢, 
which is the principal part of g7?”, is less than ~0.37 
a.u., or 10 ev. Because of the very large crystalline 
electrostatic energy involved in this transfer, we doubt 
that €, can be this small. It should be pointed out that 
our orbitals are different from the orthogonal set used by 
Nesbet,* and no direct comparison of Eq. (61) should be 
made with his results. By an orthogonalization process 
one automatically mixes into the AD matrix elements 
some of the overlap terms contained in our matrix (40) ; 
and because of this Nesbet’s AD terms might well be 
more important than ours. Nesbet does not consider 
configuration interaction due to transfer of a single 
electron (B and C configurations) because the_effects 
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“vanish to second order in perturbation theory.’’ We 
believe, however, that the important criterion is not the 
order of perturbation theory, but the order of the 
overlap. It is true that our B and C contributions arise 
only in third-order perturbations, but they come in as 
S* and are therefore of the same order of magnitude as 
(61). Indeed, they are probably larger if our estimate of 
€2 is Correct. 

To the contribution (61) must, of course, be added the 
terms coming from the interaction of D with B and C. 
These contributions, together with those from the other 
configurations mentioned in Sec. I, must be considered 
in any complete theory. 


APPENDIX 


In this Appendix we demonstrate various numbering 
schemes for handling the configuration interaction. For 
simplicity we consider only the ground state ¥4 and the 
single excited configuration y¥, and we neglect overlap 
of wave functions except insofar as it is involved in 
transfer and exchange integrals. We will discuss three 
methods of numbering the orbitals. 

First Method.—This is the same as used in the text, 
that is: 

ad b 
y*: a3 Qi=a2 
y?: B3 Bs 
The secular matrix is 
(H,;44—W) (HA ®Pygt+Hoy4 8 P24) 
|. (A.1) 
(Hi48 'PistHoy? 'Po,) (H,®8+-H348?P3,— W) | 


We define the transfer integral 6, the exchange integral 
J’, and the energy difference between the ground and 
excited states AF as follows: 


v= fesnn 1 ) eel 1)dr: 


(A.2) 


J'= f eald)es()H BA C013) eo(Adratrs 


AE=H,88—-H, ‘ a 


Since Hy448=H»48=6, and we can use (12) of the 
text, we find by the method of the text that the super- 
exchange is given by 


AW = —2J'(b/AE)*S3:S,. (A.3) 
This is the Anderson term for a single excited con- 
figuration. 


Second Method.—The numbering scheme is the same 
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as the first method, except 
y?: Bs Bs 
The secular matrix now becomes 
(H13448 P34 t+ Ho3448 Pos) | 


(7? 8+ Hy348P 54 os W) | 
(A.4) 


(H;44—W) 
(14384 P 434+ Ho4374 P43) 


We note that Hy4324=Ho4384=6b, and we may use 
Eq. (32) since we still have P;2.=1 everywhere in the 
matrix. We obtain 


AW = (bP 347) (—2J'S3-S4) (bP 347)/(AE)?. (A.5) 


This readily reduces to (A.3). 
Third Method.—The numbering scheme is the same as 
the first method, except 
y?: B3 By Bo=B4. 
This is the method of Anderson. We note that Pj2=1 in 
the ground configuration, whereas P2,=1 in the excited 
configuration. We may impose these restrictions by 


introducing the projection operators 


Oy=43(1+P,;). (A.6) 


These operators have the eigenvalues 1 and 0 according 
to whether P,; has the eigenvalues 1 or —1; that is, the 
operators project out only the symmetric space func- 
tions, as is appropriate for identical orbitals. The secular 
matrix is 


Oj0( H,4 A W)O1. 
Oog(Hp2 4+ Hyo®4 Pys)Oy2 Oog( Hy ® 8+ Hy 32® Py; —W) Ova) 
(A.7) 


Oy2(H 148+ HogABPo4)Oo4 | 


We now use H,24=H,.24=H,48=HyA2=5 and we 
obtain 


AW =O 2(1+ P24)024024(—2J’S,- Ss) 
XK O24024(1+ P12)012(6/AE)?. 


This also reduces to the result (A.3). However, the 
method is more involved than that used in the text. We 
note that Anderson, in his paper? and in his lecture at 
the University of Tokyo (1953), took the nondiagonal 
element of (A.7) as b instead of b(1+ P12)=0b(1+ Pos) 
=2b (when properly projected). Thus his numerical 
factor is different from ours. 

The projection-operator method must be used, of 
course, if the cation orbitals c are not identical, as may 
be the case if the 3d shell is less than half-full. The 
simple method of the text is adequate, however, for Mn, 
Fe, Co, and Ni salts, 


(A.8) 
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The term splitting has been calculated for the lowest lying J state of the rare earth ions samarium through 
through ytterbium for the case in which the ions are simultaneously subjected to exchange and crystalline 
fields of arbitrary relative magnitude. The crystalline field is taken to be cubic and as representing the 
potential of eight negative ions situated at the corners of a cube. The calculation is carried out for the 
magnetic axis in the two major (local) crystalline directions, [100] and [111]. The relevance of these calcu- 


lations to the rare earth iron garnets is discussed. 


INTRODUCTION 


HE subject of term splitting of ions in crystals has 

long been of interest to physicists, and the in- 
creased attention paid it in recent years has been 
stimulated by the invention of the solid-state maser. 
Group-theoretical techniques built upon the pioneer 
work of Bethe! enable one to predict the character of 
the term splitting when only the symmetry properties 
of the perturbing fields are known; computational 
procedures have been developed by Van Vleck? and co- 
workers and Elliott and Stevens’ for calculating the 
energy matrix elements arising from the perturbing 
fields. The problem of calculating the actual splitting 
in some simple cases then reduces to diagonalizing a 
matrix of finite but awkwardly large dimension. 

In most physical systems hitherto of interest, the 
electric-field (Stark) splittings of the ions involved 
were orders of magnitude greater than the magnetic 
splittings, so the magnetic splittings of either dipolar 
or exchange origin have usually been calculated as 
additional small perturbations upon Stark-determined 
states. In a recently discovered crystal system of great 
physical interest—the ferrimagnetic rare earth iron 
garnets—the magnetic splittings are equivalent in size 
to the electric splittings; hence one is led to consider 
the diagonalizing of energy matrices containing crystal- 
line field and magnetic exchange elements of equivalent 
size. 


CHARACTERISTICS OF THE RARE EARTH 
IONS IN CRYSTALS 


The intriguing features of the rare earth ions are 
mostly associated with the partially filled and deeply 
buried 4f shells characteristic of these elements. The 
neutral atoms have an electronic configuration which 

1H. A. Bethe, Ann. Physik 3, 133 (1929). 

2 J. H. Van Vleck, Electric and Magnetic Susceptibilities (Oxford 
University Press, New York, 1932); Phys. Rev. 41, 208 (1932); 
W. G. Penney and R. Schlapp, Phys. Rev. 41, 195 (1932); R. 
Schlapp and W. G. Penney, Phys. Rev. 42, 666 (1932); W. G. 
Penney and G. J. Kynch, Proc. Roy. Soc. (London) A170, 112 
(1939) ; G. J. Kynch, Trans. Faraday Soc. 33, 1402 (1937). 

3R. J. Elliott and K. W. H. Stevens, Proc. Phys. Soc. (London) 
A64, 205 (1951); A64, 932 (1951); A65, 370 (1952). K. W. H. 
Stevens, Proc. Phys. Soc. (London) A65, 209 (1952); R. J. Elliott 
and K. W. H. Stevens, Proc. Roy. Soc. (London) A215, 437 (1952). 


can be written in the conventional notation as 
(1s)?(2s)?- + - (4f)7(5s)?(5p)®5d(6s)?, 


where x ranges from 0 to 14. The rare earths commonly 
enter ionic crystals in a trivalent state, having lost the 
one 5d and two 6s electrons. All net angular momentum 
then resides in the unfilled 4f shell, which is shielded 
from external fields by the eight 5s and 5p electrons. 
The shielding is sufficiently effective that the crystalline 
Stark splitting of the free ion terms totals only hundreds 
of reciprocal centimeters, as contrasted with the splitting 
of thousands of reciprocal centimeters typical, say, of 
the transition elements. The Stark splitting is an 
inverse function of x, the number of 4f/ electrons 
present, and diminishes from about 600 cm for Cet+++ 
(x=1) to about 100 cm™ for Yb*+**+(x=13). The spin- 
orbit interaction is sufficiently strong (except for Eu 
and Sm) that it separates the terms of different J by 
thousands of cm™'; therefore, even in the crystal, J 
usually remains a “good”? quantum number. The total 
term splitting of some 10? cm™ is further divided among 
the 2/+1 levels within a term, which gives rise to 
individual splittings on the order of tens of cm™ and 
explains why at room temperatures (k7= 200 cm™) the 
rare earth ions exhibit almost free-ion magnetic 
susceptibility. 

The rare earth ions with x>5 are of appropriate 
radius to enter the rare earth iron garnets (3R.O; 
-5Fe2O;).4 In this crystal the ions are not only acted 
upon by crystalline electric fields, but they also partici- 
pate in an exchange interaction with the iron ions in- 
volving energies of about 25 cm™ per unbalanced 4f 
electron spin. Since as many as seven unbalanced spins 
can occur, the term splitting due to exchange is, in 
general, of approximately the same magnitude as the 
Stark splitting. The following calculations were under- 
taken to examine the behavior of the energy levels of 
such a system. 


4F. Bertaut and F. Forrat, Compt. rend. 242, 382 (1956); S. 
Geller and M. A. Gilleo, Acta. Cryst. 10, 239 (1957); Aleonard, 
Barbier, and Pauthenet, Compt. rend. 242, 2531 (1956); R 
Pauthenet, Compt. rend. 242, 1859 (1956) ; 243, 1499 (1956) ; 243, 
1737 (1956). R. Pauthenet, thesis, University of Grenoble, 1958 
(unpublished). 
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FORMULATION OF THE PROBLEM 


We will take the perturbing potentials which the 
free ion encounters when placed in the crystal as 


H.=H (Stark)+H (exchange) 
= V (crystal) +24 50 .; 81i-82;, (1) 


where V (crystal) is the electric potential generated by 
nearest neighbors, electrons labeled with subscript 1 
reside on the perturbed ion, electrons with subscript 2 
reside on the exchange-coupled neighbors, and A is the 
exchange energy involved. We discuss now the explicit 
form we shall assume for these potentials. 

The rare earth ions enter the garnet lattice in a posi- 
tion of roughly cubic symmetry. The ion has eight 
(oxygen) nearest neighbors situated at the corners of a 
quite badly skewed cube, as illustrated in Fig. 1.° In 
the following calculation the crystalline potential will 
be taken to be purely cubic, and only the leading term 
of the potential, that which is biquadratic in the 
coordinates, will be included. The next higher term 
in the cubic potential, of sixth order in coordinates, is 
neglected as being (it is hoped) considerably smaller 
than the biquadratic terms. Effects due to all terms 
higher than sixth order will rigorously be zero for 4f 
electrons. The calculation must be regarded as treating 
a system similar to the garnet system but considerably 
simplified. Insofar as the cubic potential dominates the 
actual situation (as it very likely does), the simplified 
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Fic. 1. Coordina- 
tion polyhedron of 
rare earth ion in the 
rare earth iron gar- 
net.® 


Fic. 2. Term splitting, J =5/2, magnetic axis in 
local [100] direction, arbitrary energy units. 


® Figure 1 is based on data from S. Geller and M. A. Gilleo 
(reference 4). 
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Fic. 3. Term splitting, J=5/2, magnetic axis in 
local [111] direction, arbitrary energy units. 
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Fic. 4. Term splitting, J=7/2, magnetic axis in 
local [100] direction, arbitrary energy units. 


model should yield correctly the main features of the 
garnet system. 

The operator-equivalent approach of Elliott and 
Stevens*® was used in obtaining an explicit form for 
V (crystal). The treatment regards V (crystal) as an 
operator acting upon the 2/+1 dimensional space of 
states of fixed J. It is therefore very material to the 
calculation that J be a “good” quantum number. As 
observed earlier, the fact that configurations of different 
J are separated by energies from 10 to 100 times larger 
than the intraterm splittings (except for Eu and Sm) 
argues that J is indeed a good quantum number for all 
the rare earths with x>7. 

The explicit form of V (crystal) involves J, and hence 
depends on the direction of quantization of the free 
ion with respect to the crystalline axes. The energy 
levels are treated here for two simple cases: the mag- 
netic axis along the local [100] and [111] directions. 
The local [100] and [111] directions do not correspond 
to the crystal [100] and [111] directions; see Fig. 1. 
The crystalline potentials are (following Elliott and 
Stevens?) 

V[100]= Vol [3574-307 (J+1)J 2425/2 
—6) (J+1)4+-3P(J+1))—§LJ4'+-J*)}, 
V[111 = Vol —3[35/4—30J (J+1)J24+25J2 
— 6) (J41)43I2(I+1)2]+ (10V2/3) 
XL (I 2+-I*)+ J42+-J_)J,)}. 
In the ensuing calculation Vo has been chosen in a 


normalized fashion such that in each case the total 
Stark splitting is 100 scale units, and its sign has been 


(2) 
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taken to be that which would be given by eight negative 
charges at the corners of the coordination polyhedron. 

The exchange energy is much simpler in form. Since 
the 4f electrons of the rare earths with x>7 present a 
case of predominantly Russell-Saunders coupling, we 
may assume that the separaie spins assemble to form 
a fixed total S, accomplishing for us the summation 
over i of Eq. (1). In the garnet, each rare earth ion is 
weakly exchange-coupled to two ferric ions, which are 
in turn strongly exchange-coupled to other ferric ions. 
Below the Curie temperature the ferric ions of the 
crystal are bound into a state of very large effective 
spin §. At low temperatures the spins of the ferric ions 
are substantially aligned, and taking the direction of 
alignment to define the z axis, we have 


H (exchange) = 27,4 S12S22, (3) 


where 7; is the number of unpaired electron spins of the 
rare earth ion. Equation (3) assumes this simple form 
only because $>>1, a necessary condition if the trans- 


Fic. 5. Term splitting, J=7/2, magnetic axis in 
local [111] direction, arbitrary energy units. 


Fic. 6, Term splitting, J=6, magnetic axis in 
local [100] direction, arbitrary energy units. 
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Fic. 7. Term splitting, J =6, magnetic axis in 
local [111] direction, arbitrary energy units. 
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Fic, 8. Term splitting, J=15/2, magnetic axis in 
local [100] direction, arbitrary energy units. 


Fic. 9. Term splitting, J=15/2, magnetic axis in 
local [111] direction, arbitrary energy units. 
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Fic. 10. Term splitting, /=8, magnetic axis in 
local [100] direction, arbitrary energy units. 


Fic. 11. Term splitting, J=8, magnetic axis in 
local [111] direction, arbitrary energy units. 


verse components of the vector dot product are to be 
negligible. Keeping in mind that S; can have a nonzero 
time-averaged projection only along J;, we can redefine 
our energy constant and have simply 


H (exchange) = A’J,. (4) 


For the rare earth garnets the value of A’ is approxi- 
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TABLE I. Rare earth ion parameters. 
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mately known for each ion on the basis of saturation 
magnetization curves,‘ and in the special case of Gd*** 
on the basis of antiferromagnetic resonance data.® For 
the purpose of calculating more generally useful curves, 
in the following calculation A’ has also been normalized 
to give a total Zeeman’ splitting of 100 scale units. 

Inasmuch as we are interested in the energy term 
splittings as various proportions of Stark and Zeeman 
energies are included in the perturbing potential, the 
matrix elements, eigenvalues, and eigenvectors have 
been calculated on an IBM 704, for the perturbing 
Hamiltonian, 


H,=(1—a)H (Stark)+aH (exchange), (5) 
for values of a from 0 to 1 and for J’s corresponding to 
the ground states of all rare earth ions entering the 
garnet structure. The eigenvalues for these cases are 
presented in graphical form in Figs. 2 through 11.* 


INTERPRETATION AND USE OF RESULTS 


The application of one of the energy-level schemes 
(Figs. 2 through 11) to a specific physical system re- 
quires certain prior knowledge of the system, i.e., the 
total Stark splitting, AZ, and the total Zeeman splitting, 
AZ, which would be produced by the two kinds of 
perturbations acting separately. For the rare earth 
garnets, AF is not usually known, though the optical 
and infrared spectroscopy which will yield this informa- 
tion is under way in several laboratories. AE has been 
measured for several of the rare earths in other crystals, 
the fluorides, chlorides, and hydrated sulfates and 
nitrates.’ Since the total splitting does not seem to be 
a strong function of the host crystal, one can make an 
estimate, good to perhaps +50% of AF in the rare 


6 Geschwind, Walker, and Linn (to be published). 

7 All magnetic-like energies, including exchange, will be loosely 
termed Zeeman energies in the following discussion. 

8 Compare Fig. 4 with a similar plot in C. Kittel and J. M. 
Luttinger, Phys. Rev. 73, 162 (1948). 

® An inclusive list of references on the subject of optical spec- 
troscopy and low-temperature susceptibility measurements of the 
rare earth ions would be prohibitively long. The Stark splittings 
in Table I were based on several sources, largely publications 
bearing the names Spedding, Nutting, Mookherji, Hellwege, and 
Dieke. A recent review article with a comprehensive relevant 
bibliography is W. A. Runciman, Repts. Prog. in Phys. 21, 30 
(1958). 
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earth garnets. The values of AF listed in Table I were 
obtained by such an estimation process, interpolating 
or extrapolating from ion to ion when necessary, and 
must be considered to be correspondingly uncertain. 
There is a very good possibility that AZ in Table I may 
be systematically low, if higher levels of the term 
splitting are absent in the references on which the 
estimates are based. The value of AZ can be obtained 
from the static magnetization curves of the rare earth 
garnets, or, for the special case of the gadolinium 
garnet, from the antiferromagnetic resonance data.° 
The two methods give results that are in essential 
agreement. The resonance measurement is probably 
the most accurate and leads to a AZ of approximately 
25 cm per unbalanced spin. The total Zeeman splitting 
is here taken to be linear in the unbalanced spin; the 
slight change in exchange energy resulting from the 
variation in ion size within the pertinent group of rare 
earths is ignored. The static magnetization data of the 
garnets indicate that this linear approximation is 
essentially valid. 

Knowing AE and AZ, one can calculate a, the param- 
eter in Figs. 2 through 11 that specifies the relative 
importance of Stark and Zeeman splitting: 


a= AZ/(AZ+AE). (6) 


The ordinate scale in energy units at any a is given by 


AW /div= (AE+AZ)/100. (7) 
All these parameters are listed for each ion of interest 
in Table I together with values of M, of the lowest 
lying energy level for the magnetic axis in each of the 
two directions treated. M, is calculated from the eigen- 
vectors of the ground state, which are not tabulated 
explicitly in this article. Identification of the lowest 
lying energy level requires knowledge of which ordinate 
direction represents increasing energy; for a given 
crystal potential and J, the Stark splitting may still be 
of either sign, depending on the composition of the J 
state, and Figs. 2 through 11 could be either right side 
up or upsidedown. The figures have all been plotted 
right side up as referred to the rare earth garnet con- 
figuration on the basis of the calculations of Elliott and 
Stevens,* with the exceptions of Tb, Er, and Tm, for 
which the figures are inverted. 

Various macroscopic magnetic characteristics of the 
rare earth garnets can, in principle, be calculated or 
estimated from a knowledge of the rare earth ion levels: 
saturation magnetization at low temperatures, mag- 
netic anisotropy, effective g-factors and line widths in 
ferrimagnetic resonance. Some of these properties will 
be calculated in a subsequent article. However, it is 
appropriate here to point out some of the difficulties 
inherent in this subject for the benefit of those intending 
to make similar use of the individual-ion information. 

First, the crystal directions of the cubic garnet unit 
cell do not coincide with the directions of the local 





ENERGY LEVELS 
coordination cube. The oxygen cubes occur in the unit 
cell in six inequivalent positions, with one local (100) 
direction of each along one of six diferent (110) direc- 
tions of the unit cell. Figure 1 is drawn with the unit 
cell (100) directions used as coordinates and shows 
only one of the six possible local configurations. 
Second, the magnetic axis in Figs. 2 through 11 is 
determined by the direction of My,, not of the applied 
magnetic field. Since the absolute energy of the lowest- 
lying level is a strong function of the direction of 
magnetic axis (differing by 8 cm from [100] to [111 ] 
in Dy***+), there is a marked tendency for the rare 
earth ion to twist the polarization of the Fe ions to 
which it couples away from Hexternat. The macroscopic 
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magnetic properties will have the symmetry of the unit 
cell, but strong distortions of M within the unit cell 
must be expected. 

Finally, the idealizations of the local symmetry 
mentioned earlier are again reiterated to indicate the 
degree of uncertainty which must reside in any calcula- 
tion of properties of the total actual system. 
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Recent experimental results on low-temperature heat capacities and elastic constants of alkali halide 
crystals are examined for evidence of anharmonic contributions. It is concluded that if such effects are 
present their contribution to the characteristic temperature at 0°K is less than that of uncertainties in the 


experimental results. 


OME new experimental results on low-temperature 

properties of alkali halide crystals were recently 
obtained at the Rice Institute, and reported in this 
journal. Norwood and Briscoe! have measured elastic 
properties of KCI and KI over the temperature range 
4° to 300°K, and Scales? has measured heat capacities 
of LiF and KI over the range 2° to 7°K. One aim of 
this work was to compare the limiting characteristic 
temperatures, ©) and (elastic), given, respectively, by 
calorimetric and elastic measurements as 7 approaches 
O0°K. These should be equal provided that the lattice 
vibrations are purely harmonic; but recent work by 
Ludwig’ has suggested that a combination of zero-point 
energy and anharmonicity may cause differences of the 
order of 1%. The conclusion reached in the Rice 
Institute papers is that some of the experimental results 
may support Ludwig’s theory. 

It is clear that modern refinements in experimental 
technique are beginning to make it worth while to look 
for anharmonic effects in the heat capacity of crystals,* 
but in order to detect these it is important that the 
consequences of harmonic lattice theory be kept clearly 


* National Research Contract No. 5395. 

1M. H. Norwood and C. V. Briscoe, Phys. Rev. 112, 45 (1958). 
2 W. W. Scales, Phys. Rev. 112, 49 (1958). 

3W. Ludwig, J. Phys. Chem. Solids 4, 283 (1958). 

4 Flubacher, Leadbetter, and Morrison, Phil. Mag. 4, 273 (1959), 


in mind during the examination and interpretation of 
experimental results. We believe that the Rice Institute 
papers are open to criticism in this respect and that 
some of their arguments could be seriously misleading ; 
this applies particularly to Scales’s estimation of © 
from his experimental results. 


THE ESTIMATION OF ©, 


Scales starts by assuming that below a temperature 
of the order of © /50 a ‘‘true 7* region” is reached, in 
which C,=a7*; in this region ©» would be effectively 
constant and equal to ©». Now this assumption neglects 
the effect of dispersion on the lattice waves of low 
frequency, which in fact introduces further terms in 
higher powers of T into the expansion for the heat 
capacity. The contribution of such terms does not in 
general suddenly disappear at some finite temperature, 
as is suggested by Scales’s graph of ©p for KI (his 
Fig. 6). On the contrary, Op must approach a constant 
value asymptotically according to the expansion 


-}; 


this is equivalent to the quite general expansion for 
the lattice heat capacity 


-) p> ‘ Oo {1—A (T ,)? + B(T ()o)4- . (1) 


C= aT bT 4 cTi ++, 
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Whether or not the change in ©p with temperature is 
observable over a certain range depends upon the value 
of A and of the other coefficients in the expansion, and 
of course upon experimental uncertainty. Scales’s 
assumption of a region of constant Op below ©/50 is 
based on a very common but most unfortunate mis- 
understanding of Blackman’s work. Blackman®.® in- 
vestigated the shape of the lattice frequency spectrum 
at low frequencies for a number of different crystalline 
models, and concluded that no general prediction could 
be made of the sign and magnitude of A. As a rough 
guide, however, one can say, not that 9p will be 
constant below © /50, but that its total variation is 
unlikely to be much more than 1%.° Thus if Qo is 


required to within 1% or better, it is necessary to 
consider the variation of @p below ©»/50. In another 
paper’ we have examined this whole matter in con- 


siderable detail. 

While great care is thus needed in interpreting the 
results for KI, the temperature range of Scales’s 
measurements on LiF falls well below T=0,/50, so 
that sufficiently accurate measurements would be 
expected to fix ©» to within a few tenths of 1%. In 
general, however, low-temperature calorimetry becomes 
increasingly difficult and less accurate in the region 
T<©,/100. Where possible, therefore, results over a 
wider range of temperatures should be used to help 
determine ©o. It is clear from Eq. (1) that a sensitive 
method of plotting results at low temperatures is to 
plot @p against 7?: this should give a smooth curve 
whose intercept with the axis 7?=0 will give 9o, while 
the slope can be used to derive A. We have shown’ how 
an equivalent method (C,/T* is plotted against 7°) 
makes use of results in the region T7<©/25. By 
comparison, a plot of C, against T* is very insensitive. 

This method of smooth extrapolation (of the graph of 
C,/T* against T*) has been used® to obtain © for 
several alkali halide crystals. The value derived for 
KI was 132.3+1.0°K, which is to be compared with 
Scales’s result of 128.3°K. Reference to Fig. 6 of 
Scales’s paper shows that the major part of the differ- 
ence is to be ascribed to his assumption of a strict T* 
law for 7<3.5°K, which puts great weight on the 
measurements at the lowest temperatures at the expense 
of those in the range 4°<7<7°K. It is important also 
to notice that his value of ©» implies an unexpectedly 
sharp curvature in the graph of ©» against T at about 
3.5°K. 

To some extent a similar criticism can be made of the 
discussion of the results for LiF. Taken by themselves 
Scales’s results indeed indicate his estimate of © 
(~722°K), and moreover they are in good agreement 
with the lowest temperature results of Clusius and 


5M. Blackman, Proc. Roy. Soc. (London) A149, 117 (1935). 

6M. Blackman, Proc. Cambridge Phil. Soc. 33, 94 (1937). 

7T. H. K. Barron and J. A. Morrison, Can. J. Phys. 35, 799 
(1957). 

§ Barron, Berg, and Morrison, Proc. Roy. Soc. (London) A242, 
478 (1957). 
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Eichenauer®; but again it is important to realize that 
acceptance of these results involves an unexpectedly 
sharp curvature in the Op versus T graph at about 
20°K. We have discussed this example in detail else- 
where,’ and have shown that a combination of Clusius 
and Eichenauer’s results at higher temperatures and 
Martin’s” for T<20°K gives a smooth curve for C,/T* 
against 7° and suggests that @o is about 743°K, three 
percent higher than the value estimated by Scales 
(722+6°K). 

There is therefore conflicting evidence for the be- 
havior of C, for LiF below 20°K. The results of Scales 
and of Clusius and Eichenauer indicate that there is a 
true J* region up to about T= © p/35, after which Op 
suddenly starts to decrease; while Martin’s results 
suggest a smooth variation of Op. The sharp curvature 
could be explained either if A (and B) in Eq. (1) were 
small or by a phase transition. On the other hand, the 
smooth variation seems to us more likely, especially 
when the results are compared with those for other 
alkali halides and with theoretical work on ionic lattices. 


THE ESTIMATION OF © (ELASTIC) 


The calculation of ©(elastic) from elastic constants 
involves the averaging of the inverse cube of the sound 
velocity over different directions in the crystal. Norwood 
and Briscoe! do this in two ways: by using de Launay’s 
tables" for cubic crystals, and by using the semi- 
analytical “Alberta” method of Betts, Bhatia and 
Wyman"; the answers differ by about 13%, and are 
quoted as having equal authority. In fact, a clear 
distinction can be drawn between the two methods. 
De Launay’s tables are obtained by straightforward 
numerical integration over a large number of different 
directions in the crystal. Where they have been com- 
pared with other accurate calculations, the tabulated 
values have been found reliable to within at least 
0.01%. Provided that care is taken over the interpo- 
lation (by drawing curves or by using a suitable 
formula), they should provide accurate results with 
errors of at most a few tenths of 1%. By contrast, the 
“Alberta” method usually takes fewer directions in the 
crystal, but makes more use of the crystal symmetry. 
It is more general than de Launay’s tables, since if the 
dispersion relations in a few crystal directions are 
known it can be used to estimate not only ©» but also 
A of Eq. (1). (See, for example, Horton and Schiff."*) 
In the form quoted by Norwood and Briscoe! it is, 
however, only accurate to within at best about 1%," 


® K. Clusius and W. Eichenauer, Z. Naturforsch. Ila, 715 (1956). 

10 T). L. Martin, Phil. Mag. 46, 751 (1955). 

1 J. de Launay, J. Chem. Phys. 24, 1071 (1956) ; 30, 91 (1959). 

2 Betts, Bhatia, and Wyman, Phys. Rev. 104, 37 (1956). 

18 G. K. Horton and H. Schiff, Can. J. Phys. 36, 1127 (1958). 

14 Better accuracy is achieved in a later paper by E. E. Budzinski 
and H. Schiff [Can. J. Phys. 35, 507 (1957) ]. 
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and so in preference de Launay’s tables should always 
be used to obtain (elastic) for cubic crystals. 


CONCLUSION 


Revised values of ©» and O(elastic) for the three 
salts mentioned in the Rice Institute papers are given 
in Table I. [ O(elastic) for LiF has been obtained from 
the elastic constants of Briscoe and Squire! using de 
Launay’s tables. The value of 743° quoted in reference 
7 was derived from their preliminary results.!*] It will 
be seen that if @po is derived by extrapolating smoothly 
a graph of C,/T* against T°, then for each salt @o and 
(elastic) agree well within the limits of experimental 
accuracy; there is no indication that ©(elastic) > Oo 
(as suggested by Ludwig’s calculations), although the 
limits of accuracy would in fact permit a difference of 
up to 1%. On the other hand, if Scales’s values of ©o 
for KI and LiF are chosen, we find that ©(elastic) > Oo 
by about 2%. 

The example of LiF is of particular interest because 
the lower estimate of 0(722°K) is quoted by Ludwig 

15C, V. Briscoe and C. F. Squire, Phys. Rev. 106, 1175 (1957). 

16 C, V. Briscoe and C. F. Squire, Bull. Am. Phys. Soc. Ser. IT, 
2, 18 (1957). 
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HALIDE CRYSTALS 
TABLE I. Characteristic temperatures at 0°K 
for KCl, KI, and LiF. 


KCl KI LiF 


73545 
74347°¢ 
72246 


(elastic) 236 +1 131 +2 
@o (extrapolated) 
@y(Scales*) 


235.1+0.5» 132.3+1> 


® See reference 2. 
» See reference 8. 


¢ See reference 7. 
(p. 300 of reference 3) as providing some confirmation 
for his theory; but as we have seen, measurements in 
the important range of temperatures below 20°K 
disagree. The resolution of this uncertainty by experi- 
ments of more reliable accuracy would be very desirable, 
since it would firmly settle both the shape of the C,/T* 
curve (relevant to the harmonic theory) and the 
difference between @o and ©(elastic)"” (relevant to the 
anharmonic theory). 

17Tt should be noted that Ludwig? finds that the classical 
relation between elastic constants and sound velocities is no 
longer strictly valid for longitudinal waves. If so, the values of 
Ci and ¢,2 given by the pulse technique may not be exactly 
equal to the static values; however, no numerical estimate is 
given of the magnitude of this effect. Fortunately, ©(elastic) 
depends mainly upon ¢4,, which is determined solely from the 
velocity of transverse waves. 
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Diffusion Measurements in the System Cu-Au by Elastic Scattering 
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A broad-range magnetic spectrograph has been used to study the diffusion of gold into copper in the 
temperature range 360-500°C by elastic scattering of protons and deuterons. This is a new technique in 
solid diffusion measurements and should have fairly wide applicability in intermetallic diffusion. This in- 
vestigation extends the data a factor of 10° below the sectioning range. These results together with data of 
sectioning observers, show the Arrhenius law to be approximately valid over a range in D of 5X10*%. A 
precise fit to all the data requires a slight upward curvature. In the temperature range studied here, we 


ce we 


obtain 0=45, 7504750 cal/mole and Do =0.104-+£0.06 cm?/sec. This represents a decrease of about 4000 
cal/mole from the most precise data in the sectioning range. This curvature is attributed to diffusion along 
internal surfaces, although the possibility of multiple diffusion mechanisms still remains. 


INTRODUCTION 


LTHOUGH considerable effort has been brought 

to bear on the problem of solid diffusion measure- 

ment and many ingenious methods have been used, the 
sectioning technique has become more or less standard 
because of its precision and wide applicability in metallic 
systems. However, because the depth of investigation 
(~0.01 cm) is so large, the method is effectively limited 
to the range of D>10~" cm?/sec. This, coupled with 
the upper limit imposed by melting point, has effectively 
limited the range of data to a few orders of magnitude 
in many cases. Nowick!' and many others have pointed 

1A. S. Nowick, J. Appl. Phys. 22, 1182 (1951). 


out the seriousness, for theoretical purposes, of the lack 
of sufficiently accurate data over sufficiently wide range. 
Methods extending the range significantly below 10-” 
cm?/sec have been either indirect or else limited to 
specific systems having unique properties.?~> Extension 
of the data to low temperatures is of importance not 
only for increased accuracy, but also to explore for 
possible deviations from the Arrhenius law, indicating 
multiple mechanisms of diffusion, such as the suggestion 
of Lazarus of the possibility of barrier penetration in 

2G. von Hevesy and W. Seith, Z. Physik 56, 790 (1929). 

3. Cremer, Z. Physik Chem. B39, 445 (1938). 

4W. Jost, Z. Physik Chem. B9, 73 (1930). 

5C. A. Wert, Phys. Rev. 79, 601 (1950). 
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the system (Fe-Ag) at high temperatures.® Further- 
more, it is in the low-temperature region where one 
could hope to induce a vacancy density considerably 
larger than that which obtains at thermal equilibrium. 
Therefore, it should be possible to study directly the 
effects of radiation, plastic deformation, etc., on the 
diffusion process. 

This paper describes a method based on elastic 
particle scattering in which the total depth of investi- 
gation is <10~° cm, therefore extending the range to 
10-'8 cm?/sec. The method used here is a nonsectioning 
technique, although it is entirely feasible to use the 
spectrograph for surface concentration determinations 
in connection with the sectioning method. In either 
case it is preferable that the diffusing atoms be heavier 
than the solvent atoms, and in the nonsectioning 
technique, considerably heavier. It is, however, possible 
to obtain data of reduced accuracy when the reverse 
situation obtains if the solubility is appreciable. 

An interesting feature of the method demonstrated 
in this paper is that although the specimens are poly- 
crystalline (mean grain size ~0.01 cm) the ratio of 
grain size to depth of investigation is so large (~ 1000) 
that grain boundary diffusion effects are thus eliminated 
from the measurements. 


DIFFUSION 
Specimens and Technique 


The copper used. was obtained in the form of 7-mm 
rods, 99.999% pure, from Johnson Mathey and Com- 
pany. These were milled into } in.X} in. blanks, 3g in. 
thick. The blanks were polished with 0000 metallo- 


6 [), Lazarus and B. Okkerse, Phys. Rev. 105, 1677 (1957). 
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graphic emery paper, then electropolished using a 
standard phosphoric acid method. The specimens were 
rinsed in de-ionized, de-aerated water and kept im- 
mersed for the few minutes necessary to transfer them 
to the electroplating bath. The blanks were then plated 
with ~4 yug/cm? of gold in a standard double cyanide 
solution. The specimens were then sealed in Pyrex 
ampoules evacuated to 10~-* mm Hg. 

The diffusion anneals were carried out in a regulated 
oven in which a massive block of aluminum served for 
thermal inertia. Thermocouples inserted in the block 
adjacent to the specimen wells indicated a temperature 
stability of about +1°C. These thermocouples were 
calibrated against a National Bureau Standards 
certified platinum-platinum 10% rhodium thermo- 
couple. 


of 


Equipment and Experimental Procedure 


The magnetic spectrograph has been widely used for 
measurement of excited state Q values and for precision 
mass determinations. Rubin has demonstrated the 
usefulness of a magnetic spectrometer in the chemical 
analysis of surfaces by particle scattering and has 
suggested the possibility of determination of con- 
tinuously varying concentration distributions with 
minute depth resolution.’ 

The spectrograph used in this investigation is a 40-cm 
instrument of the broad-range type developed by 
Buechner.*® Figure 1 shows a schematic layout of the 
system used. The diffused specimens as targets were 
bombarded with protons from a 2-Mev High Voltage 

7 Rubin, Passel, and Bailey, Anal. Chem. 29, 736 (1957). 


§C. P. Browne and W. W. Buechner, Rev. Sci. Instr. 27, 899- 
907 (1956). 
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Fic. 2. Measured values of D=(1/E)(dE/ds) vs distance s 
along focal surface. 


Engineering Corporation Van de Graaff accelerator. 
The bombarding energy was standardized at 1.881 Mev 
using the Li’(,7)Be’ threshold. The magnet entry slit 
was about 3 mm in width. The combination of corona 
stabilizing and entry slits permitted a region of the 
target } mmX3 mm to be illuminated by the proton 
beam. All scatterings were carried out at 90°. The 
scattered protons are dispersed according to energy 
along the focal surface where they impinge on a 2- X 10- 
inch NTA nuclear emulsion plate. Eight strips 8 mm 
wide may be exposed without disrupting the vacuum 
system. The recorded particle spectrum is determined 
by microscopic examination of the developed plates. 
Proper interpretation of the particle spectra permits a 
determination of the concentration distribution of 
diffused solute. The energy range of the spectrograph 
for a given field setting is 1.3 of the median energy. The 
measured value of solid angle of acceptance is about 
5X10~ sterad and varies slightly with position along 
the focal surface. The measured quantity defined as 
D= (1/E)(dE/dS), involving the reciprocal of energy 
dispersion of the spectrograph, is shown in Fig. 2, where 
S equals distance along the focal surface. 

Typical proton spectra of a sample before and after 
diffusion are shown in Fig. 3. The line shape of the 
undiffused gold layer is about that obtained for an 
infinitesimally thin layer. The width (~4 kev) is due 
to the sum of all factors leading to an energy spread in 
the scattered particles, such as beam spread, magnet 
stability, finite apertures, etc. The rise below 10 cm is 
in each case due to scattering from the copper solute. 


ANALYSIS OF DATA 


At 90 degrees, the energetics of scattering is par- 
ticularly simple. A particle of mass M, and energy Eo, 
elastically scattered by a particle of mass Mo considered 
at rest, is reduced in energy to E, according to the 
relation 


M,-—M, 
(1) 


Mot+ M 1 : 


If the scattering element is in a layer which is thin 
relative to the energy loss of the particles, a narrow 
group is obtained (i.e., the thin gold group, Fig. 3). 
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Fic. 3. Proton spectra of gold on copper before and after 
diffusion at 440°C for 2 hours. Ep =1.881 Mev, angle of scattering 
90°. Thickness of gold layer ~4 ug/cm?. 


However, if the scattering element is distributed in 
depth, ionization and excitation energy losses spread 
the spectrum and an extended distribution such as 
shown in Fig. 3 is obtained. 

Now in a thick target containing various elements 
Aj, with atomic density ;, distributed uniformly 
throughout the specimen, the number of particles Y; 
scattered by the element A, and counted in the spectro- 
graph in unit energy interval is 


da 
y; 625x108, ( ) gQ} > Vi(« if 
dQ ; i 
cos6, : 
cosO, 

where (do/dQ);= differential scattering cross section 
atom steradian in center-of-mass coordinates, g= inci- 
dent charge in microcoulombs, 2=spectrograph solid 
angle in sterad (center-of-mass system), 6,;=angle 
between incident beam and a normal to the target, 
6.= angle between scattered beam and a normal to the 
target, €;=Sstopping cross section before scattering, 
€i2= Stopping cross section after scattering. VY; is an 
implicit function of energy through the dependence of 
the e’s and do/dQ on energy. However, for the energy 
intervals involved in this paper the e’s and do/dQ are 
constants within 1%. Therefore, V; is proportional to 
N;. Now, if .V; is not constant but varies with depth 
then Y; will be a corresponding function of energy, 
where the relation between energy and depth of 
penetration into the sample is given by 


€; €i2 
(E,-hE)=> v( avr )e (3) 
i cos6, cos. 
where &, is the energy of protons scattered from the 
surface, and x is depth into the target surface. In 
practice, for very thin solute layers, one can ignore 
solute, applying Eq. (3) only to solvent species, without 
incurring significant error. 
Energy is readily convertible to distance along the 
plate by 
E,—E=DEAS, (4) 


where the differential form is accurate to better than 
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Fic. 4. Convolution of spectrograph resolution function with 
solutions of the diffusion equation for several assumed diffusion 
coefficients, compared with the elastic proton spectrum obtained 
from a sample annealed at 503°C for 1 hour. E,=1.881 Mev, 
depth resolution dx/ds=9.042X 10-7 cm/mm, where s=distance 
along focal surface. 


1% over the energy intervals involved here. We find 
in effect, that the spectrum as recorded, under a simple 
change of independent variable yields the concentration 
distribution directly. In practice, of course, Y; is not 
directly observed. One rather observes the convolution: 


+n 


Vo(t)= f g(n) V;(E—n)dn, (5) 


—X 


where Vo; is the observed spectrum, and g(n) is the 
spectrometer resolution function normalized to unit 
area. The function g(n) may be determined by bombard- 
ment of an ultra-thin target. Since the width of g(n) is 
appreciable compared to the diffused spectra, it is 
necessary to take account of the convolution in the 
analysis. The procedure used was to convolve the 
appropriate solution of the concentration-independent 
diffusion equation, 


2 


S x 
C(xt)=— ep(-—), (6) 
(xDi)} 4Dt 
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Fic. 5. Elastic proton spectrum obtained from a sample an- 
nealed at 352°C for 495.75 hours, compared with the convolved 
solutions of the diffusion equation for several assumed D values 
E,=1.881 Mev, dx/ds=9.042X 1077 cm/mm. 
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with the spectrometer resolution function for several 
assumed D values, and to compare the results with the 
data. For convenience, the convolutions were carried 
out in units of distance along the focal surface. 

A complication arises at the lower temperatures for 
the Au-Cu system where a’ and a” phases are possible. 
In this region, the thin-layer boundary condition is not 
strictly applicable in the early stages of diffusion. 
However, a resort to the numbers involved shows that 
all solubility constraints are removed during diffusion 
in a surface layer which is thin compared to the depth 
resolution of the spectrograph. It therefore appears 
that the procedure used does not introduce an appre- 
ciable error. 

RESULTS 


The experimental results are summarized in Table I. 
Comparison of the proton spectra with the convoluted 
solutions of the diffusion equation are shown in Figs. 
4 and 5 for the highest and lowest temperatures. The 
fits obtained define the diffusion coefficients within 
about 10%. The mean penetration of solute was rather 
small for the points at 362 and 389 degrees. In order to 
improve the depth resolution, these two samples were 
analyzed using 1.88-Mev deuterons. One of these 
spectra is illustrated in Fig. 6. In this case, the depth 
resolution is 6.55X10~* cm per mm distance along the 
focal surface. This illustrates the point, which may be 
readily verified by recourse to the expressions given 
under analysis of data, that to improve depth resolution 
one may either decrease the bombarding energy or 
increase the mass of the scattered particle. By pushing 
this point to the practical limit, it should be possible to 
directly measure diffusion coefficients as low as 10-* 
cm*/sec without inconveniently long annealing times. 

A plot of InD vs 1/T is shown in Fig. 7. The best 
straight line fit to these data is extrapolated upward 
and compared with the recent precise measurements of 
Tomizuka® in the temperature range 700°-1030°. The 
individual data points fit the straight line with a root 
mean square deviation of 12.6%. The equation of this 
line is 


D= (0.104+0.06) exp(45 7504+750/RT) cm?/sec. 


Absolute accuracy of the points is believed to be better 


TABLE I. Experimental results for gold-copper diffusion. 





Duration of anneal 


rc. (hours) (cm?/sec) 


1.2X 10-4 
1.81071 
2.2X 10-15 
7.0X 10716 
9.0K 1077 
1.51077 
1.1107" 


503 1 
457 2 
457 2 
432 6.25 
389 20 
362 96 
352 495.75 


§C. T. Tomizuka (private communication). 
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than +20%. The principle source of error as regards 
absolute accuracy is in the measured value of stopping 
cross section. The values used in the calculations were 
obtained from the compilation of Fuchs and Whaling.” 
However, a separate experiment was performed to 
check the stopping cross sections used. The results 
agreed with Fuchs and Whaling within 5%. 

Annealing times shorter than one hour were not 
attempted in the oven setup. However, it should be 
possible to close the gap between the present method 
and the sectioning range, by developing techniques for 
accurate heating of specimens for short times of the 
order of seconds. 


DISCUSSION 


As may be seen from Fig. 7, the agreement between 
the present results and the data of Tomizuka is quite 
good, and establishes that the Arrhenius law is valid to 
first approximation over a variation of eight orders of 
magnitude. The apparent decrease in slope with 
decreasing temperature is prebably due to diffusion 
along internal surfaces. ‘Tomizuka’s result in his range 
(dotted line Fig. 7) is D=(0.69+0.07) exp(49 700 
+500/RT). The present investigation therefore shows 
a decrease in activation energy of 3950 cal/mole over 
678°C. It is very interesting that this result of 5.8 
cal/mole°C agrees so closely with the 6 cal/mole°C 
found by Tomizuka" in his re-examination of the data 
on the diffusion of antimony in silver. In the present 
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Fic. 6. Elastic deuteron spectrum obtained from a sample an- 
nealed at 375°C for 20 hours, compared with the convolved 
solutions of the diffusion equation for several assumed D values 
Ea=1.88 Mev, dx/ds=6.546X 1077 cm/mm. 


10 R. Fuchs and W. Whaling, California Institute of Technology 
(unpublished). 
1C, T. Tomizuka, Acta Met. 6, 660 (1958). 


MEASUREMENTS 


IN SYSTEM Cu-Au 
} ! | 
TOMIZUKA'S BEST LINE FIT 
—THIS INVESTIGATION 
































12 


1000 “1 
—— (eK) 
T 


Fic. 7. Plot of InD vs 1/T. The solid line is the best straight-line 
fit of the data of this investigation, extrapolated upward into the 
sectioning range. The dashed line is the result of C. T. Tomizuka 
for the range indicated. 


case, the condition of Hart” that a diffusing atom 
encounter many dislocations in distances of the order 
of the depth of investigation is probably not valid. 
Therefore, we might expect deviations from the 
Gaussian distribution given in (6). The distributions 
observed are certainly Gaussian to first order (see Figs. 
4, 5, and 6); however, there was a persistant tendency 
for the low-concentration data to fall outside statistics 
toward greater penetration. The cause of these small 
but persistent deviations is not yet understood. 
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The magnetic response of the Bogoljubov model is calculated, by employing a new technique for dealing 
with gauge-noncovariant approximations to gauge-covariant Hamiltonians. 

By extending our treatment to very high orders of perturbation theory, we derive a Meissner-Ochsenfeld 
effect with infinite correlation length, as anticipated by London. For larger wavelengths, the response differs 
from the London one, in the way predicted by Pippard. 


I. INTRODUCTION 


EVERAL attempts have so far been made to derive 

the Meissner-Ochsenfeld (M-O) effect (at absolute 

zero) from the Bogoljubov'* theory of supercon- 
ductivity, with conflicting results. 

(i) Rickayzen’s* calculation consists of a straight- 
forward extension of the Bogoljubov transformation to 
the case of a Hamiltonian with magnetic field. Such a 
study is complicated by the fact that now, while the 
Bogoljubov Hamiltonian as such is gauge covariant,° a 
perturbation-theoretic treatment (such as that used to 
derive the thermodynamic properties) to any finite order 
of perturbation theory violates gauge covariance. 

The ensuing difficulty is that one finds, in general, 
indeterminate answers for magnetic quantities.® Since 
Rickayzen ignores this difficulty, and makes an a priori 
choice of the “London gauge” (V-A=0), his calculation 
is not conclusive. 

(ii) Wentzel,’ realizing the gauge difficulty, follows a 
course different from ours. Commencing with the 
Frohlich Hamiltonian, by successive canonical trans- 
formations he eliminates the magnetic field from it, up 
to order A*. Such an elimination can only be performed 
in perturbation series in the coupling constant g—this 
is done to order g*. Wentzel then gets a Hamiltonian 
with magnetic field, which is tractable: A Bogoljubov 
transformation may be applied, and the calculation per- 
formed in a fully gauge-covariant manner. The result is 
a London type M-O effect, but with a coefficient which 


* Professor Schafroth was killed in an air accident in the 


Australian out-back in May, 1959. 

+ Also supported by the Nuclear Research Foundation within 
the University of Sydney. 

1N. N. Bogoljubov, J. Exptl. Theoret. Phys. U.S.S.R. 34, 58 
(1958) [translation: Soviet Phys. JETP 34(7), 41 (1958) ], and 
Nuovo cimento 7, 804 (1958). 

2N. N. Bogoljubov, Doklady Vsesoyuz. Akad. Sel’skokhoz. 
Nauk im. V. I. Lenina 1, No. 1 (to be published). 

3 Bogoljubov, Tolmachov, and Shirkov, “A New Method in the 
Theory of Superconductivity” (in Russian), Moscow, 1958 
[English translation: Fortschr. Physik (to be published) J. 

4G. Rickayzen, Phys. Rev. (to be published). 

5A Hamiltonian H(A) is called gauge covariant if for any 
scalar function, A, there exists a unitary transformation U, such 
that 

Uy'H(A)U,=H (A+grada). 


6M. R. Schafroth, Phys. Rev. 111, 72 (1958). 
7G. Wentzel, Phys. Rev. 111, 1488 (1958). 


differs from that obtained by Blatt and Matsubara, and 
by us. This evaluation treats the magnetic terms by a 
perturbation theory first (expanding them in powers of 
g), when for the nonmagnetic terms such an expansion 
is not permissable. It would appear that Wentzel’s 
result is due to some lack in cancellation between a 
function of g and its formal power-series expansion. 

(iii) Very recently, Blatt and Matsubara,’ using a 
concept of collective excitations as introduced by 
Bogoljubov,? have made a calculation which fulfills 
gauge identities. They derive the London kernel (in the 
limit of small wave number, i.e., g — 0). This collective 
excitation formalism may be interpreted as an alter- 
native method of treating the higher-order terms in a 
perturbation calculation: The derivation contains a 
selective summation over the most singular terms in 
each order. 

This calculation lacks rigor (for example, certain 
terms in the transformed current density operator are 
ignored, for simplicity), but it allows a simple physical 
interpretation of the Bogoljubov theory. Other calcula- 
tions (Anderson,’ Rickayzen™) have also been made 
along these lines, including the influence of Coulomb 
interactions. 

In this paper we derive the M-O effect directly from 
the simple Bogoljubov theory, without having recourse 
to the collective excitations. 

We have seen above that such a treatment is compli- 
cated by its violation of gauge covariance, giving rise to 
indeterminate answers. However, it has been shown 
lately (Blatt, Matsubara, and May," which we refer to 
as BMM) that there is an unambiguous way of de- 
termining a best approximation from those available. 
We thus use the Bogoljubov perturbation scheme to 
define approximate eigenfunctions of the problem, and 
then apply the BMM technique to derive the magnetic 
response of the model. 

It is to be emphasized that this technique gives a 
result as accurate as that for nonmagnetic quantities (to 


8 J. M. Blatt and T. Matsubara, Progr. Theoret. Phys. (Kyoto) 
20, 781 (1958), and paper to be published. 

*P. W. Anderson, Phys. Rev. 112, 1900 (1958). 

© G. Rickayzen, Phys. Rev. Letters 2, 90 (1959). 

" Blatt, Matsubara, and May, Progr. Theoret. Phys. (Kyoto) 
(to be published). 
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the same order of perturbation theory), where the gauge 
difficulty never arises. 

The magnetic response function, K (qg), which we cal- 
culate agrees with that of BCS” in the limit of large q. 
This is understandable, because in this limit the ambi- 
guity due to gauge difficulties arises only in terms of 
second order of smallness. 

In the case of small g, by going to infinite order of 
perturbation theory, we confirm the result of Blatt and 
Matsubara. Wentzel’s result differs from this by a factor 
p/2, where p is, in a sense, the expansion parameter of 
the Bogoljubov theory. Rickayzen’s calculation corre- 
sponds to an ad hoc choice of the BMM parameter a=1, 
whereas, in fact, to the order of perturbation theory used 
by Rickayzen, we find a<1. When our calculation is 
extended to much higher orders, we find for small gq, 
a— 1; for large g, a#1. 

In Sec. II we sketch the procedure which we use for 
calculating the linear response of a magnetic system in 
the limit of zero temperature—an outline of the BMM 
technique is presented here. Section III summarizes the 
main formulas of the Bogoljubov theory, which is then 
applied in Secs. IV and V to the problem of evaluating 
K(q) in the limits of small and large g, respectively. 
Finally, we summarize our results in Sec. VI. 


II. LINEAR RESPONSE OF MAGNETIC SYSTEM 


The M-O effect is a particular form of linear response 
of a system to an applied inhomogeneous magnetic field. 
To calculate this response, the following procedure is 
useful. 

Consider a system of .V particles, with masses m, and 
charges e;, which, in the absence of magnetic fields, is 
described by the Hamiltonian 

N (p;)? 
H=>> —+4+ V(xi,x2:- 


i=1 2m; 


‘aD. (2.1) 


Such a system includes, in principle, a very accurate 
model of a solid: namely a system of .V; nuclei and V2 
electrons (V,;+N.2=.\) interacting with each other via 
Coulomb forces. 

A magnetic field, with vector potential A, is now 
introduced into the Hamiltonian® by means of the trans- 
formation 


pi — pi— (e,/c)A(x;). (2.2) 


A(x) may be expanded in Fourier components: 


A(x)=>>, A(q)e'*?, 


’ 


(2.3) 


condition giving A*(q) as 


(2.4) 


where there is a “‘reality’ 


A*(q)=A(—4q). 

We shall restrict ourselves to one Fourier component ; 
this is permissible for calculating the linear response. 
2 Bardeen, Cooper, and Schrieffer, Phys. Rev. 108, 1175 (1958). 
13 This accounts for the orbital motion of charges; other possible 


terms, depending explicitly on the field strength B, would describe 
magnetic moment of spins, etc., and will not be considered. 


EFFECT 


The associated magnetic field is 
B(q)=iqXA(q). 


The Hamiltonian H(A) is now obtained by making 
the transformation (2.2) on the field-free Hamiltonian 
(ZA): 


(2.5) 


H(A)=H+H4+H 4 "C 
with 
H «= — (1/c)j°*(q)-A(q)+c.c., 
e;" 


Haa=X —|A(q)|?. 


i 2mic? 
The operator j°(q) has been introduced as 
ej 


rMg=d 


i 2m; 


(pye~*9 *! + e-*4-Fp,), (2.9) 


In the limit of zero temperature, we may contine our 
attention to the ground state. First, the change in 
ground-state energy is calculated by applying perturba- 
tion theory to H(A), regarding (H4+Ha4a) as a per- 
turbation on the field-free Hamiltonian H. In terms of 
the field-free eigenvalves and eigenvectors, 


H \k)=E;,|k), (2.10) 


this change in ground-state (|0)) energy is 


E(A)=(0| Haa|0) +E ,[(0| 14 |) 


X(k!H4|0)/(Eo—Ex)]. (2.11) 


Next the current density, J, is obtained by use of the 
relation 
j= —cdEF(A), ‘OA. (2.12) 


We now observe that (2.11) can be written in the 
form 


E(A)=—3 > J,.(q)A,*(q)A,(q), (2.13) 
with the definition 


J ys= Spo Lb ys, 


L=) >. e2/mc, 


C.C. 


1 _ <0} 7.°(q) | &XA| 7,°(—q) | 0) 
, ky— Fk, 


It is important to note that / and S,, are not com- 
pletely independent: They are related by a condition 
imposed by the requirement of gauge invariance. In a 
medium with cubic symmetry, this may be written as 


D J u4uge=9, 


uy 


(2.17) 


that is, 
= Gud py 
G(qy=d —L=0. 


BY q° 


(2.18) 
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The current density (2.12) now reads 


J p= D> J uvAv. 

It is often desirable to eliminate the vector potential, 

A, in favor of the physical quantity B. Introducing the 

magnetization, M (J=c curlM), and using Eqs. (2.5), 
(2.14), (2.18), and (2.19), we get directly 

M (4) = (1/29) {ou Sup—3L} By (Q). 


The linear magnetic response may thus be characterized 
by a “kernel” function, K(q), such that 


(2.19) 


(2.20) 


M ,(q)=K(q)B,(q). (2.21) 


A number of alternative expressions for K(g) may be 
given by use of the identity (2.18); in particular, if one 
uses (2.18) to eliminate from (2.20), one gets the 
“normal form” 


_ Gud» 
K (q)= (14-3 ~ )s. 
q° ue 


gq 


(2.22) 


In general, (2.18) generates a set of alternative expres- 
sions for K(g), which can be written in terms of a 
parameter @ as 


K (q)= K p(q)+aG(q) ’. (2.23) 


If S,.(g) and G(g) are calculated from a gauge- 
covariant Hamiltonian H, then of course G=0, and all 
the forms (2.23) are identical. However, if one evaluates 
S,» by using the eigenfunctions of an approximate (field- 
free) Hamiltonian 7, thereby losing gauge covariance, 
then in general the resulting approximate S,, violates 
the identity (2.18). That is G¥0, and one gets an 
indeterminate result K for (2.23), depending linearly on 
the parameter a: 


K (a)=KptaG/¢. (2.24) 


However, there is still one well-defined value of a, a; say, 
for which K (a) is closest to the exact value K. If the 
difference between true and approximate Hamiltonians 
be characterized by an expansion parameter, e, 


H=H+ ell’, (2.25) 


then the exact AK may be expanded around K(a) in 
powers of « 
K=K(a)+eK'(a)+€K"(a)+---. (2.26) 


We now choose a=a; by demanding that 


ri . G’ 
K'(a))=K z' +a, 
a 


=(). 


Thus we have 


K(a)—K= 0(e), (2; 


i.e., the approximate kernel is as good as any other 
physical quantity in the same degree of approximation 
in H’. The ambiguity due to lack of gauge covarience 
has entirely disappeared. 


as ge 


SCHAFROTH 


It must be noted that the criterion above cannot be 
used with a gauge-noncovariant Hamiltonian A, unless 
FZ is itself defined as an approximation to an exact, 
gauge-covariant Hamiltonian. Thus this technique is 
only applicable to evaluation of the magnetic properties 
of a properly gauge-covariant system in an approxima- 
tion which relaxes the covariance. This is the case in the 
Bogoljubov scheme. 


III. THE BOGOLJUBOV THEORY 


In this section, we summarize, for subsequent use, the 
main formulas of the Bogoljubov theory.’ The treat- 
ment outlined here is a refinement of the original theory! 
in that it involves renormalization of both electron and 
phonon energies, thus greatly facilitating the use of 
perturbation theory. Bogoljubov starts from a modified 
Frohlich Hamiltonian, 


H=> [E(k)—A Jax.'ais+D w(p)d,b, 
ks p 
w(p)\? 
{0 (—~) autarbytcc,| (3.1) 


(where A is a Lagrange parameter associated with the 


oi a xsl ks N, 
ks 


+ S&S 


kk’ ps 
(k’ —k =p) 
restraint 


and subsequently determined by the condition N=.V) 
A canonical transformation is then made from the elec- 
tron and phonon operators, a‘, a, and ), to new “fermion” 
and phonon operators a’, a, and 8, 


Cet =Ujapotrian', 


(3.2) 


(3.3) 


a_,py = UjQk1—ViEAKO', 
b,=\,B8ptu6_;', 


where a’, a, and @ obey their respective Fermi- and Bose- 
type commutation rules, provided that 


u2+o/=1, 
The Hamiltonian (3.1) is now written 
H=HotH +H! +H! +H! +h ert U, 


where Hy is the ‘zero-order’? Hamiltonian, with re- 
normalized fermion and phonon energies, 


Hy= >» k é(k) (axtarotaxtarn)+>, p &(p)Bp'B p, 


and the remaining terms are given by 


A e—upe= 1. (3.4) 


(3.5) 


(3.6) 


H,'= 3 


kk’ 
(k’ —k =p) 


w(p)\} 
o(— ) (De +UKVE) (Mp try) 


x (axolagt taxa) (8,'+B_ mR 


w(p)\! 
. - ) (16, — 040K") (Apt+Mp) 


(3.7) 
X (axotaxotaxr'a nr) (Bp'+8_p), 
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Hy’ => .( E(k) —d) 2040; (apo'agst+taxiaxo), 
Hy! = YX) p oP) p(B p'B-»' +B_2Bp), 
Hr=D (E(k) —A)(ui?—042) — (k)} (a colaxotars'on), 
+E pop) (uy?+Ayz”) — O(P)}B'Bp. 
aining term is a constant, 
U=D LE(k)—)]202+D p o(p)u,’. 


The terms (3.7) are described in Fig. 1 by Feynman 
graphs. 

The transformation parameters u, v and yp, d are de- 
termined by the criterion that “dangerous” processes 
(creation of a virtual pair of fermions or phonons from 
the vacuum) should compensate. One then writes #;, v% 
in terms of a function C(k) as 


The rem 
(3.8) 


[#*(k) +C2(k) } 
£(k) 
Lek) +x} 

E(k) = E(k) — 
and C(k) is to be determined (to second order) from 
f a(k— ‘k’) 
v 2V &(k—k’)-+e(k)-+é(k’) 
C(k’) 


[e*(k’) +C*(k’) } 


where 


Re. (3.10) 


This equation has two solutions: the trivial solution 


C(k)=0, (3.12) 


and another solution which, for all practical purposes, 


—-—---@ — —-- 


Ha 


Fic. 1. Graphical representation of terms in the Bogoljubov 
Hamiltonian, Eq. (3.5). Unbroken lines represent fermions, broken 
lines phonons. Time runs from left to right. 
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is the constant 


C= exp(—1/p). (3.13) 


Here @ is some average phonon energy, and p, defined as 


gy ke 
alae 
2m? \ dé/dk] k=kp 


is required to be small. The trivial solution (3.12) 
corresponds to the normal state, which has a higher 
ground-state energy than the superconducting state, 
given by (3.13). 

Finally, we mention that, to second order, 


normalized fermion energy (é“(k)) is given by 
 [o(k—k’)+¢ (k’) 2 (k) 


ag] 
=[e) +c). 


The noteworthy feature is that this fermion excitation 
spectrum has an energy gap A above the ground state; 
the magnitude of this gap is 


A=C. 


(3.14) 


the re- 


o(k—k’) (1,2 ,-?+0)20,/7) 
SoMa ory 


(3.15) 


(3.16) 


IV. K(q) IN THE LIMIT OF SMALL gq 


) Using Eigenfunctions Correct to Second 
Order in the Bogoljubov Scheme 


We now apply the formalism outlined in Secs. IT and 
III to the problem of evaluating the kernel function 
K (q) defined by (2.18), (2.22), and (2.23). By using the 
Frohlich Hamiltonian, Z and S,, given by (2.15) and 
(2.16) may be rewritten as 

L= (e?/mce?)N, (4.1) 
k)<R| j,°(—q)| 0) 
Eo— Ei 


(0| 7,°(q) | 
S.=— (4.2) 
ck 
with the operator j,°(q), (2.9), in second-quantized 
form 
 e 

c@=— L 

2m _ kk's 
(k’ —k =q) 


(k+k’),a kat Ok'ee (4.3) 


Upon making the Bogoljubov transformation" (3.2) 
into the operators at and a, the operator 7,°(q) is given 
(exactly) as 


J (q) = 
Jur’ (Q) 


Jur (Q) + ju2?(Q), (4.4) 


—(e/m)>—.k so etal bgt UK 
X (ak-4a, 


== Ke m) ok Ryu (Uy qeria— Wet 5 qVk—-$a) 


ae? hq) 


0 Ta kt} q,0— Ok} }q,1 tau 1q,1)) 


(4.5) 

Ju2'(Q) 
X (@e—4q,0'@ e+ 49,1! — Mk} q,1@k+4q,0) 

14 Tt is to be noted that a Bogoljubov transformation (3.2), with 


fixed, field-independent coefficients u and v, does not destroy the 
gauge covariance of the Hamiltonian. 
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Fic. 2. Graphical rep 
resentation of the opera- 


tors j;° and j2° given by 


(4.5). Wavy lines repre- 
sent the action of the 
external field, A. 


The vectors k, k’ of Eq. (4.3) have been rewritten as 
k — k—}3q and k’ > k+3q. Invoking graphical repre- 
sentation, j;° and 7° may be expressed as in Fig. 2. 

For investigation of the magnetic properties of a 
system, the behavior of K(g) in the limit g— 0 is 
important. In particular, if K(g) goes like 1/g’, the 
system is a London superconductor. We therefore ex- 
pand the matrix elements in (4.5) in this limit, 


(i445 gM k—pqHVe+4 qUk—4o) 


k-q\? CC k-q \‘ 
CY See) 
2m J (C+ &)? 2mC 
k-q c k-q \° 
(9s) 
2m F &)?+C? 2mC 


Eo= (1/2m) (k°+¢°/4—kr’). (4.7) 


(From now on we neglect the g dependence embodied in 
£, as it gives rise to correction factors of order (g/kr)’, 
which are negligible for all g of any interest.) Moreover, 
since all integrals over k have their dominating con- 
tribution from the region |k! ~k,, in (4.6) we rewrite 
k-q(=kgx) as kpgx; this approximation is as good as 
those made in setting up the Bogoljubov theory [e.g., 
(3.13) ]. 

Thus the expansion (4.6) is an expansion in powers of 


(4.6) 


where 


(gx/u), with 


p=2mC/k rp. (4.8) 


Hence by “small g’”’ we mean g <u; we shall now derive 
K (qg) in this limit, as a power series in (g/u). The case of 
g>u (“large qg’”) which is relevant to penetration 
phenomena, is deferred to Sec. V. 

It is important to note that in the limit g<u, the 
matrix element associated with 7,° (i.e., scattering of a 
fermion by the field) is of order unity; whereas that 
associated with 72° (emission or absorption of a pair by 
the field) is of order g/u. 

In evaluating S,,, only those terms which contain 
either j;° or js° twice are pertinent : Any term containing 
each of j;° and j,° once is odd in k, and thus vanishes 
identically on summation. 

The scheme so far is exact and gauge covariant; the 
approximation now enters, when the Bogoljubov treat- 
ment is used to provide approximate eigenvalues and 
eigenvectors of the field-free Hamiltonian, to calculate 
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Su». (The quantity L is, of course, exact, independent of 
the approximation used for S,,.) 
Formally, we proceed as follows: We write 


H=Hyty(Hy'+He!) +7 (Ai" +e") +77 (Ap), 


where Hp etc. are as defined in Sec. III, and y is a formal 
expansion parameter, which we put equal to 1 finally. 
Terms of order y?" correspond, in effect, to graphs with 
n closed phonon lines. 

It may be shown’ that gauge covariance is no longer 
fulfilled identically to each order in y. This difficulty is 
overcome by using the BMM technique. 

To order y*", Ho contains the energies of fermions and 
phonons renormalized to this order. Hence the eigen- 
values for the energy denominators in S,, are known. 
By application of perturbation theory’ (with corrected 
energy denominators) eigenvectors are then calculated. 

The eigenvector for the state with, say, fermions 
k,0, kel, ---kx 1 and phonons fy, p2: - - px2 (represented 
in zeroth order by |,0,-++,kwil; pi,---,px2)) when 
corrected to order y’" by perturbation theory, is de- 
noted by {| 0,---,kw1; pi,-++,PN2)},. For example, to 
second order, the ground state is 
{|0)}o= |0)+-y! k0,k'1; p(=k’—k))RO,R'1; p! H1'|0) 

+-7{ | kO,R'1,80,R/""1) 

< (RO,R1 ROR 1 | Hy! RO,R'1; p) 

X(R0,R'1; p Hy" |0) 

+ ROR; pp’ (RO,R'1; pp" | Ho" kO,R'1; p) 
<(k0,R'1; p Hy’ 0) 
+etc.}. 


(4.9) 


(4.10) 


These perturbation terms have the graphical repre- 
sentations shown in Fig. 3. 

Using eigenfunctions calculated to order y2", a S,,)‘” 
is found. Since the approximation used violates gauge 
covariance, it is to be expected that this S,,(” will give 
rise to a G'” which violates the identity (2.18), and the 
technique outlined in Sec. II has to be used to choose a 
best value for a in K‘” (q,a). In principle, this is done by 
choosing a=a,, such that K“ (q,a1) is a stationary ap- 
proximation to K(q): in detail, we calculate K' "t)) (Qa) 
by use of wave functions and energy values correct to 
the next highest order, and then choose a=aj, such that 
the difference 

AK ") (g,a) =K ied CP) ~—K' ”")(g,a) (4.11) 


is minimized. 


— 
(x) 


Fic. 3. Graphical representation of perturbation corrections to the 
ground-state wave functions. 


(y’) 


15 P. M. Morse and H. Feshbach, Methods of Theoretical Physics 
(McGraw-Hill Book Company, Inc., New York, 1953), Vol. 2, 
p. 1004. 





MEISSNER-OCHSENFELD 


The lowest order terms in AK” (q,a) are obviously of 
order y*("+—that is to say 


AK” (q,a) =P" DR (qa) 1+ O(y?) ]. (4.12) 


In this way, our gauge-noninvariant K‘” (g,a) may be 
used to get a well-defined approximation, of order y?", to 
K(q), by choosing a=ay, such that 


K™ (9,01) =0, (4.13) 
1.e., 

Kp” (q) +aiG' wg) g’=0. (4.14) 
As a starting approximation, we take quantities correct 
to second order'® (i.e., to order y*, or n= 1). 


K,(q)=— _— 


g? ct \m 


4L is 
=-——{(1+ 0(4/u)*+ 0(y9)}. 
15:0" 


In simplifying (4.17), we have used the relation 


= (V/3m2)kp’. (4.19) 


G (q)/q? is defined from (2.18) as 


G(q) as -L 
—= OQ —, (4.20) 


g° “uP q° 


The 1/g? term due to 5, has its leading coefficient of 
order y*. This is because terms proportional to 1/9? 
must arise from the operator /;° [see (4.6) ]; but since 
ji’ is a scattering operator, fermions must first be 
created from, and finally destroyed to, the vacuum, by 
perturbations on the ground state, |0), which are at 
least of order y. Hence 


G (q) L 
= ——{1+ 0(9/u)?+ O(7’)}, 
q° 


(4.21) 
q’ 
and, as we expected, the approximate wave functions 
and energies have violated the gauge condition G(qg)=0. 
So our approximate kernel is to be chosen from the 


infinite linear family 
4 a 
+= JL. 
Su? ¢? 


We therefore proceed to construct the quantity 
K (q,a), as defined by (4.12). In other words, by using 
eigenfunctions correct to order y‘, we compute the 
lowest order (y*) terms in the difference K® (q,a) 
—K (q,a). Bearing in mind that S,, contains either the 


K®(q)=— (4.22) 


16 We avoid use of the zero-order eigenfunctions, since the 
continuous energy spectrum so obtained is qualitatively at fault in 
describing a spectrum with a gap. If one uses such quantities, one 
encounters difficulties stemming from the expansion of a spectrum 
with a gap around a continuum. 


> ( ; ) 3 (28 uy 34uQr/Q?) Ruhr (tt i+§ gk} q7— Why hth)” 
wrk (€(k+3q)+e(k—}q 


EFFECT 
The quantity 


1 Jud 
~ - _ By ~ 
Ru%(Q=—E (on )s0 
gq uy q° 
is regular as g tends to zero; this is because all terms of 
order 1/q? involve an angular integration of the form 


(4.15) 


J (3—3 cos*)d(cosé) =0. (4.16) 


0 
The leading term in Ky (qg) is then of order 7’, 
obtained by the operator /2° creating and then destroying 
a pair of fermions. 


+ O(7’) (4.17) 


(4.18) 


pair of operators j;", 7:° or the pair js", 72°, it is seen 
that such terms arise basically in three ways. 

(i) The j;° operator scatters a fermion of a pair which 
is created and destroyed by perturbations on the zero- 
order ground state, | 0), of orders y and y*. Schematically, 
such terms correspond to graphs such as shown in Fig. 4. 
Such terms have a leading component of order 1/g’. 

(ii) A jo operator creates a pair from the vacuum, 
and destroys it back into the vacuum; the wave func- 
tions in between having perturbations of order y and y’, 
or else of order y* coupled to an unperturbed function, 

Also in this class we include terms arising when 
successive js° operators create (or annihilate) four 
fermions from the vacuum, and these fermions are 
annihilated (or created) by perturbations to the zero- 
order vacuum of order y‘ (see Fig. 5). Here the leading 
term will be of order 1/x?. 

(iii) There is also a term of order y‘ in the differ- 
ence between S,, and S,, arising from the difference 
in the two denominators (i.e., from the fact that 
[ e (k)— € (k) J~y‘). This, in conjunction with un- 
perturbed wave functions, gives a contribution to 
K (g,a), which is of order 1/p?. 

The quantities K,g(g) and G(qg)/qg’, defined in 
(4.14), may be expanded in powers of (g/)?: 


(4.23) 
(4.24) 


Ke” (g)= (L/q*) (ao +42 (g/u)?+ +++), 
G™ (q)/g?= (L£/q?) (60 +62 (g/u)?+---). 


, g 
—_— —— ~ 


hes odie, 


ae ee 


~~ ———— ) 
(8) 


(¥) 


~ 


IG. 4. Graphical representation of a typical member of the terms 
of type (i). The graph has been “exploded” into its successive 
components. 
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(¥") 
Fic. 5. Graphical representation of some typical terms of type (ii). 


The coefficients a and 6” are calculated in Appendix 
I. This calculation is facilitated by the renormalization 
of the phonons, and the requirement that “‘dangerous”’ 
processes, giving rise to virtual creation of a phonon 
pair (only), should compensate: Many terms may be 
eliminated as being down by factors of at least @/Er. 
The argument given previously relevant to the 1/¢ 
term of Kx" (gq) [see (4.16) ] gives immediately 


(4.25) 
Evaluation of terms of type (i) yields [neglecting terms 
of relative order (1/p) exp(—1/p) | 
)p{1+ O(@/Er)}. 


To find a2 and 6. terms of all three types, (i), (ii), 
and (iii), must be considered. It is found that the 
dominant contribution comes from the “ladder graph” 
type of terms in (ii) illustrated by Fig. 5(b) and (d); 
such terms are characterized by the fact that each 
alternate energy denominator, in perturbation theory, 
contains only fermions, and no phonons. One gets 


—19{1+ 0(p)+0(a/Er)}, (4.27) 
1o{1+ O(p) + 0(a/Ep)}. (4.28) 


Thence a, is chosen by the demand (4.13) that 
K™ (g,a1) vanish: ignoring O(g¢/u)* 


1 
i= 
(16/3) (1—1n2)p+ (q/u)? 
3 


bo = 4(1—In2) (4.26) 


ao) = 


eV = 3 


(g/m)? (4.29) 


“(1+ 0(4/ p)?}, (4.30) 


“16( 1—In2)p 2? 


and finally, substituting into (4.22), to second order in 


M. 
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y, we get the kernel in the limit g<y as 


K (q)=K (q,a1) 


4 3 L 
--(+—_—__)- 
15 16(1—In2)p/ p? 


Such a kernel describes a large diamagnetism, rather 
than a London M-O effect. Experimentally, this result 
is nearly indistinguishable from the pure London one. 


(4.31) 


(b) Using Eigenfunctions Correct to Order y*" 


We now generalize the procedure above to the calcula- 
tion of K‘” (g,a). We first note that, up to factors of the 
form [1+ 0(¢/u)?] and [1+ 0(y”) ], 


. 4 a 
K (g,a) = — (+5): 
15y2 


The best value of a (i.e., a1) is chosen by the criterion 
(4.13). Generalizing the notation used previously, we 
have (see Appendix IT) 


(4.32) 


ay” =0, (4.33) 

ax” = —rp[1+ O(p)], (4.34) 

by” = rep "41, (4.35) 

ba) = —ax”[1+ 0(0") ], (4.36) 
where 7), 72 are numerical constants. Thence 


9 


q 
———-{1+ 0(p)+ 0(q/u)°}, 
few +¢ 


(4.37) 


| ea 


where f is a numerical factor of order unity. 
The resulting approximation to K(q), 
K™ (q,a1), in the region ¢<y, is 


namely 


1 
R (qa) = -———_L. (4.38) 


q+ fo"u 


Provided now that p is small (and stability of the 
lattice requires!’ p<}), in the limiting case of very 
large, the kernel (4.35) becomes the London kernel: 


lim K‘” (q,a1) = — L/@’. (4.39) 


V. K(q) IN THE LIMIT OF LARGE g 


In the region g>y (although still ¢<kr~10* cm™) 
the calculation of K(q) is straightforward. Using wave 
functions and energy values correct to second order in 
y, as in the preceding section, we again get Kp" (g) as 
given by (4.17), ignoring terms of order p and higher. 
Replacing sums by integrals in (4.17) and making a 
Tiablikov and V. V. Tolmachov, J. Exptl. Theoret Phys. 


U _ 1254 (1958) [translation: Soviet Phys. JETP 34(7), 
8¢ 8)]. 


7S. V. 
SSR. 
67 (195 





MEISSNER-OCHSENFELD 


change of variables &=[ (g/u)C ly, 


Mi bar 
Ri=—a fay 
q 40 


i dx(3x?—1) 
» Lota +eT4+[y—2)+e} 
(y?—2?+ &) : 
-— - ; (5.1) 
{Lyt+a)?+e ]L(y—2)P?+e]}} 


which is to be evaluated in the limit e<1, with e=y/g. 
This evaluation, carried out in Appendix III, yields 


aaaee ees os n(“)+ o(“) (5.2) 


16 ¢’ 3a” g 








Also we have 


G(q) 1(2ser? 
a) 8 
q° q lo \m k 


(this ba? k- dibonsr: $qUk+} a. 


Again we may expand in powers of u/q: 


or 


and (5.4), it is apparent that, 


GP(q) _ On? us i. 16 - 


(5.4) 
g° 16 ¢ y 


3a? q 
Comparing Eqs. (5.2) 
in the limit u/g<1, 


Kx (g>G” (¢ q)/ /q. (5.5) 


This then implies that K“(q,a) is very insensitive to 
variations in a. 

a, is evaluated, according to the BMM « criterion, in 
Appendix IV. In carrying out the resulting calculations, 
we find it convenient to subdivide the region g> yu into 
two parts: 


(a) w<q<pexp(1/p), 


(5.6) 
(b) kr>g>wuexp(1/p). 


Wentzel’s remarks as to the nature of the kernel at large 
q apply only to region (a): this is the more interesting 
region, since it is relevant to g< 10° cm~. In this region, 
both Kg (g) and GS (g)/q? have leading terms which 
differ from Lp*/g? only by numerical factors. a; is thus a 
purely numerical constant, containing any g dependence 
only in correction terms. Hence, in region (a), 


edo) 
1+ 0] —In )I L. 
16 gq? | q q 


In region (b), G (q)/g? is smaller than K,(q) by a 
factor of less than 1/p exp(—1/p). We find [see (IV.13) ] 


K(q)= (5.7) 


EFFECT 


cos) 
a\=o- exp{ — 
ad p 
mn 1 m 1 
x| +4] (" exp ) n( xp-) |}, (5.8) 
q p q p 


with o some numerical constant. The ensuing term 
aG/ gq’ in K(q,a1) is, however, still smaller than 
Kp throughout the region (b), by a factor [exp(—1/p) 
XIn(u/q) ]. That is, in region (b), 


Om* ye | —% M 
K(qy=-—— 1+ exo(- )n(“) L. (59) 
16 ¢° 3 | p q 


The results (5.7) and (5.9) may be collected together 
by writing the kernel, in the limit g>uy, as 


On? u 
K(q)=-—— 
16 ¢ 


Xx | i+ of (“+e0(—)) n(*)| I. (5.10) 


Extension of the above calculation to higher orders in 
7 leaves the result (5.10) unchanged, except for factors of 
the form [1+ 0(?) ], which are to be neglected. This is 
in contrast to the behavior for g<y, where extension of 
the calculation to high orders is necessitated by the 
nature of the parameter a, described by (4.34). 


VI. CONCLUSION 


We now have the kernel, K(q), for all values of g 
(except g2kr, which is a region of no interest). We 
may consider separately the regions g>u and g<uy, 
where yu is given by [ Eq. (4.8) ] 


u=krC/Er, (6.1) 


p= 10 cm". 


(6.2) 


The region g>w is relevant to calculation of the 
penetration depth as measured in static and microwave 
experiments, and allied quantities. The behavior of 
K(q) in the limit g— 0 describes the law of field pene- 
tration at large distances within the superconductor— 
that is, it describes the qualitative nature of the M-O 
effect. 

For g>u, the lowest order in our perturbation scheme 
gives the kernel in a completely satisfactory manner: 


g 2 
y 2 i+ 4 (“+« ”) in(“) (6.3) 
16 q? ‘| q q 


This kernel has features which differ from the London 
kernel in the way predicted by Pippard.'* When treating 


K(q)= 


18 A, B. Pippard, Proc. Roy. Soc. (London) A216, 547 (1953). 
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Fic. 6. The kernel, to mth order in our perturbation scheme, 
K‘” (q), as a function of g, for n=1, 2, 3, and 4. The firm line indi 
cates the limiting form, as m becomes very large. (We have taken 
p*}.) 
penetration phenomena, the so-called ‘“‘London penetra- 


tion depth,” Ax, 
— 
) (6.4) 


4drre*n 
A=. ——— 
mc? 


must be replaced by a penetration depth, d, 


(6.5) 


with 
, 


a = (9n"/16)y. (6.6) 


An interesting fact is that the description of the mag- 
netic susceptibility of very fine-grained colloidal sus- 
pensions of superconducting material is qualitatively 
different from that obtained by use of the London 
kernel (6.9). Since the susceptibility of such suspensions 
is used to measure the ratio of the static penetration 
depth at different temperatures, this point is significant. 
We defer further discussion to a later publication. 

In the case of g<y, we have found, to order y?” in 
perturbation theory, 


1 


ooo, 
g?+ fo" 


KK (qg)=— (6.7) 


The behavior of this kernel is depicted qualitatively in 
Fig. 6 (with p=~}). 

The kernel has a 1/g? singularity, which is cut off 
below some qo'”, 


qo m= pny, (6.8) 


As we take higher and higher orders of perturbation 


M. 
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theory (i.e., 2 very large, and p" very small), the cutoff 
occurs at smaller and smaller values of g, becoming 
indistinguishable from the true London kernel 


K1(g=—L/¢. (6.9) 


This result agrees with that of Blatt and Matsubara. 
This is to be expected, since our calculation (in which 
the ladder graphs play a central role) is one way of 
treating the leading contributions from high-order terms, 
while the collective excitation formalism (which in- 
volves a summation over terms analogous to our ladder 
graphs) is an alternative approach to these higher-order 
terms. 
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APPENDIX I 


The coefficients do, a2, bo, and 6? detined by 
Eqs. (4.23) and (4.24), are calculated in this Appendix. 
In evaluating integrals arising in the Bogoljubov theory, 
we may utilize the fact! that the dominant contribution 
comes from the Fermi surface: i.e., we replace 
(g?/ 2?) f k’dk by pf dé. Also, in general, we replace the 
phonon energy @(k—k’) by an average energy, @, with 
aK rp. 

The coefficients are calculated under the assumption 
that p is small, compared to unity: in getting final 
results, quantities of order 


@/ErK1; (1/p) exp(—1/p)<1 (1.1) 


are ignored, compared to unity. 


(1) ay, bo? 


First, for the 1/g? coefficients, only terms of type (i) 
(see Sec. IV) need be considered. By an earlier argument, 
exemplified by (4.16), we get immediately 


ap? =0. 


(I. 


A variety of terms contribute to bo; these we have 
evaluated. It is found that, for various reasons, many 
may be neglected as being smaller than the leading con- 
tribution by factors of at least @/Er or (1/p) exp(—1/p) 
(see I.1). 

The Bogoljubov formalism employed here leads to the 
following simplications : 

(a) The choice of the parameters “, v and yw, A such 
that “dangerous” graphs compensate means that all 
graphs with either two fermions or two phonons (only) 
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in an intermediate state must compensate, to give zero 
contribution. 

(b) Due to renormalization of phonon energies, 
graphs which contain internal phonon self-energy lines 
[e.g., Fig. 7(a)] give contributions smaller than the 
leading one by factors of at least @/Er. (See Bogoljubov, 
reference 2.) 

Most of the remaining graphs [e.g., Fig. 7(b)] con- 
tain four distinct fermion momenta (k,k’,k’”’,k’’”’) related 
by the requirement that 


k+k” = k’+k’”, 


The other graphs contain not more than three distinct 
momentum indices, being of the type illustrated by 
Figs. 7(c) and (d). 

It is found that contributions from graphs of type (d) 
differ from those of type (c) by factors (1/p) exp(—1/p) 
and are therefore ignored. 

Graphs of type (c) (containing fermion self-energy 
parts) now give the major contribution to 9, the terms 
exemplified by (b) being smaller by factors of at least 
@/Er. Qualitatively, this may be understood as follows: 
In general, the energy denominators are of the form 


@(k—k’)+ é(k)+ €(k’), 


(1.3) 


and have a minimum value, of order @, when 
k| =kr[1+ 0(/Er) ], 
k’| = ke 1 + O(@ Ep) }. 


(1.4) 


Consequently, when we come to integrate over fermion 
momenta, the dominant contributions occur when all 
the momenta have moduli of order kr. At this stage, the 
condition (1.3) imposes a limitation on graphs of type 
(b)—if |k’|, |k’’| and |k’”’| be chosen to satisfy (1.4), 
only a fraction of those k allowed by (1.3) will satisfy 
(1.4). Moreover, since this fraction is roughly @/Ep, it 
is not surprising that a quantitative evaluation of these 
integrals leads to a contribution to 69 which is 
smaller, by such a fraction, than the contribution from 
integrals without a constraint like (I.3). 


Fic. 7. Some typical terms of type (i), in graphical form. 
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Thus, ignoring terms of relative orders @/Ey and 
(1/p) exp(—1/p), bo is found from graphs of type (c) 
alone: 


losje\*s2\? (k-q)? 
wo M(E 
c\m 2V7 kk’? 
(av’ +-u'v)?(A+ pu)? (A +p’)? 
x . “ 
(a+ e+) 


@(k—k’)a(k—k”) 


| (u!’ —vv!")? 


x 3 ee = 
l(a’ +2’— (Gta +2"+2) 


(uv’’+u!’v)? 
(i +2’+8)(@+a'+2"+2/428)’ 


1.5) 


[where é’ means é(k’), and so on ]. Utilizing the remarks 
made at the beginning of this appendix, we have bo", to 
the approximation (1.1), as 


by = $p?(1—1n2). (1.6) 


(2) ao, by 


Now we must consider terms of the general type (ii) 
and (iii) (whose leading terms are proportional to 1/y°), 
as well as the second-order terms in the expansion of 
type (i) in powers of (¢/u). We can pick out certain 
terms, of type (ii), which give a contribution larger than 
others by a factor of (at least) 1/p. These are the so- 
called ladder graph terms, as illustrated by Fig. 5(b) and 
(d)—but not by (a) and (c). They are characterized by 
the fact that each alternate energy denominator con- 
tains no phonon terms [see, e.g., Eq. (1.11) below ]. 
They do not contain either fermion or phonon self- 
energy parts [such graphs are found to give contribu- 
tions smaller than those due to ladder graphs by factors 
of at least p (fermion self-energy) or ®/Er (phonon 
self-energy) ]; nor do they contain the process whereby 
a fermion pair is annihilated and re-created by a phonon 
[ these have been found to differ from ladder graphs by 
factors of at least 

(q kv) (@/C)= (g/m) (@/ Ep) |. 

An important property of graphs of type (ii) is 
that their contributions to Kg (g) and to § (q)/q¢? 
(LRe (g) Jic and [4 (¢)/q*]i:, say) are equal in mag- 
nitude, but opposite in sign. This may be proved as 
follows: 

In these graphs, the matrix elements for the two 7” 
operators contain different momentum indices; such is 
not the case in types (i) and (iii). The consequent 
[Xp (q) ];; assumes the form 


(k-q)(k’-q) 
[Ke(g) i= Xo (Ce ) 


kk’ 2 


qq 


x f(|k|,k-q,|k’|,k’-q). (1.7) 
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Writing 


(k-k’) = (1/g*){ (k-q) (k’-q)+ (kXq)-(k’Xq)}, 


(1.7) reduces to 


(1.8) 


(x29) ]=-¥ ae -q) 


i 
x f(\k|,k-q,|k’|,k’-q), (1.9) 


: (k-q)*(k’-q)?7 C? C 
a ae oe 
, kk'g! e+¢ 2 £?+( 2 


2 


We now replace sums by integrals in the usual manner, 


and perform the integration over &” [noting that 


In(@/C)=1/p]: 
dy f 
(1+)! 0 


1 x 
[6 ]:= ef 
48 wv, 


| | 
Lit y2)+( 1+~’?)! 


dy’ 
(1+y’) 


(1 (1+)! 
+[imo—14 — 


p y 


xin((1+y9)-y)}+- 0¢ "| (1.12) 
1 .4 

=—p[1+ O(p) ]. 
6 


( 


(1.13) 


Summing over all ladder graphs leads to the result 
[bs Ju.c.= —Lae Jr.c.=tol1+ O(9) ]. 


The fact that this result is of order p, rather than p’, 
is a consequence of alternate energy denominators con- 
taining no phonon contributions: the ensuing inter- 
mediate-state denominator é(k’’)+ é(k) in (1.11), for 
example, then produces a factor 1/p. It is thus easy to 
understand why the remaining terms of type (ii) [such 
as Fig. 5(a) and (c) ], which do contain phonons in the 
intermediate state, are found to be of order p’. 

Contributions to 6. and az" also come from type (i) 
graphs, by expanding the matrix elements for /;° ac- 
cording to (4.6). The resultant integrals are similar to 
those for bo”, except for a factor of the form C*/(C?+&)°. 
Inspecting the results for bo’, we find that introduction 
of such a modification produces a factor exp(—1/p) ; 
thus, within the approximation defined above, (I.1), the 
contribution from type (i) terms is negligible. 

Finally, type (iii) terms, which are proportional to the 
energy corrections of order y‘, are of the form 


‘f 


(1.14) 
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[Kp (q) lee . —[S(g) Nii. (1.10) 


Thus for terms of type (ii), there is no need to find a2 
and 6," separately. 

Now we calculate a typical member of the ladder 
graph terms, namely that corresponding to Fig. 5(d). 
The contribution to 6. from this graph (bo ];, say) is 


(1) 


Mr) 


(ur)? (ue +’)? (00 +00)? (0'4"” — v'0"")? 





(1.11) 


That is, the consequent contributions to 6. and a2" 
are proportional to p?. 

Using all these facts, and in particular Eq. (1.14) for 
the ladder graphs, we get the results (4.27) and (4.28). 


APPENDIX II 


We now extend the procedure of Appendix I to 
evaluate the coefficients ao, ao, bo and bo in nth 
order of our perturbation scheme. This task is greatly 
facilitated by the calculation, which we have just 
described, for the case of m=1: most of the remarks 
made there may be generalized to arbitrary . 


(1) ay” 

It has been seen earlier [ (4.16) and (1.2) ] that the 
1/q¢° term in both Kg (gq) and Kg (g) has a coefficient 
zero. This property holds quite generally, for Kz"" (q). 
Thus 


ag =P {Zk -k—3 (k-q)?/@?} f(\k!) 
=(), 


(11.1) 
(11.2) 


(2) by 


Only graphs of the general type (i) need be considered 
to find b)‘". From the definition of K‘” (g), such graphs 
will be of order y?("*—that is, they contain (m+1) 
internal phonon lines, each accompanied by a factor g’. 
Thence, on replacing sums by integrals, a factor p"*! 
arises. 

Most of the possible graphs may be neglected, for the 
reasons detailed in dealing with the case of n=1: they 





k 
= 
\ 
ne ei ee 
\ \ 
\ x os ‘\ 


\ \ ‘ \ 
\ N 


wikiees 


\ 














Fic. 8. Graphical representation of a ladder graph term, 
in mth order. 
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give contributions smaller than the leading one by 
factors of @/Er or (1/p) exp(—1/p). As before, there 
remain graphs (containing fermion self-energy parts) 
which lead to a result for bo‘ differing from p"+! by a 
purely numerical factor [e.g., $(1—In2) for n=1]. 
Hence we get Eq. (4.35). 





EFFECT 


(3) a, b.(™ 


We first consider type (ii) terms, which give contribu- 
tions to a2‘" and 4.‘” equal in magnitude, but opposite 
in sign. 

The contribution from a typical ladder graph in mth 
order (as illustrated in Fig. 8) may be written as 


2/e\? (k- q)?( k’- @)” C2 g’a n+l 
cone) ESSE SSE 
CN m/ uk’ ki---kn kk'q! ¢2 e240? 


(uv,+10)? (A+H)*( (uyuo— 


xX 
2é(@+ E+ &)( 2+28)(o+e+e +23): 


Making the usual approximations, this reduces to 


Dita)*( (A'+u')*- Vn’)? 
(obinta 428) (2¢-+28) 


(u ntl — 





see -f — C'dé f dé | 1 
“t= 0 (C2? 24 £2)! 9 Ce” (C24 E14 (C24+22)3] 
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Performing the integrations over &, ---, &,, one by one, 
and keeping only the most important terms, we get the 
direct analog of Eq. (1.12), 


p n+l D dy re) dy’ 
ew ee 
4 0 (1+)! 0 ( 1+y’) 
1 
. a i! 
La+ty)t+ (ty) Ip" 
=4() [1+ 0(0)} 


[1+ 0(e)] (IIS) 


(11.6) 


Summation over the similar contributions from all 
other ladder graphs now introduces a factor which is of 
order 4"*!; thus we have that the contribution of the 
ladder g graphs to bo” is pL1+ O(p) ], up to some purely 
numerical factor. As in the case of m=1, all other 
possible graphs of type (ii) give contributions which are 
smaller than those due to ladder graphs by factors of at 
least p. 

It is easily seen that type (i) contributions are at least 
of order p"*!—energy denominators of the type which 
give rise to factors 1/p in the ladder graphs do not occur 
in type (i) graphs. Also, the argument used to show that 
for n=1, type (iii) terms are proportional to p’, is readily 
extended to show that, in general, these terms are of 
order p”*, 

Thus the lowest-order contribution to ao” and b.‘™ 
from type (ii) terms is proportional to p, while that from 
types (i) and (iii) is proportional to p"*!. The relation 
between the coefficients valid for type (ii) terms may 
then be utilized, to get 


bo = —ay[1+ O(p") ]. (11.7) 


(@)"t! 


— (11.4) 
aerry Ete ++ (Ext 
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These results now give (4.34) and (4.36). The fact 
that they are indefinite up to purely numerical factors 
does not concern us, as the essential feature which is 
needed is that a2" and b2‘") remain of order p (being 
equal but opposite in sign), while bo‘” is of order p**?. 


APPENDIX III 


In this appendix we evaluate the integral (5.1) 
pertinent to the magnetic response at large g. That is to 
say, we calculate 


ly - 
on ff 0 or Obay +2}4[(y—2)+e}! 


(ya? 2}. ¢?) 
xs1- | (III.1) 
lL Ciyta) +e (ya) +e)! 


(3x?—1) 


in the limiting case of «<1. 
First, we note that, putting e=0, 


10) =f ‘i if tn: 


de —, (III.2) 
1(0)=0. 


(111.3) 


A direct expansion of (III.1), as such, is difficult; we 
find it easier to expand the derivative of 7(e) with regard 
to e. Since the constant term is known, (III.3), (e) is 
thus obtained. 

Introducing new variables of integration: 


(111.4) 
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(111.1) becomes 


w 8 1 8 
10-3 f asf at f asf a| 
1 s—2 0 -f 


(2(s—)?—1] 
(°+eé)!+(P+e)! 
| st+é 
xi i- -t. 
| [iste)(e+e)]}] 
Then, differentiating with respect to e, 


dl (e)/de= —4eJ, (IIT.6) 


where, for convenience, J is separated in three parts: 


J =JSitSetJs, 


feLells 
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xX 
[( Fe Ce Ba a 
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x|3— (rte + ‘ ) 
| Ste (ste(rt+e) ete/ | 
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(st+e*) | 
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Cieteye(e+ey yl (e+ey(e+e)! 
1 1 
+ — 
(*+e)(PF+e) (9+e&)'(P+eé)! 


-{ f asf al 
1 s—2 
[(s?+e)§+(F+2é))](e+é)(P+eé)! 
1 s 
j.=-3f asf dt 
—, (IIIL8) 


x a - 
[(s?+e)!+ (2+)! ](se'+é)'(P+e)! 


(111.7) 


(2+)! 
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[S™ (q)/@ = —-LK 8 (q)]: 


a0, yoy”) (u_/v_+ uo’ 
Ete ete,’ $e )(z 


‘Yay U4 
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u,’ means M454, and so on. 
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From (III.6) it follows that those terms of J which are 
regular in € give contributions to /(¢€) which are, at 
most, of order e-—this order we neglect. Thus we confine 
attention to those terms in J;, J2, and J3 which are 
singular in the limit «e— 0. 

In J; and Js, putting e=0 gives rise to, at worst, 
logarithmic singularities; these are then separated out, 
to give 


J,=—12 Ine+(terms regular ine), — (III.9) 


(IIT.10) 


Jo=2 Ine+ (terms regular in e). 


A more detailed treatment is required for J3, which 
has a stronger (e~') singularity. Performing the inte- 
gration over /, 


1 ds (+e)! 
1.=—6f 7 a in( (111.11) 
o § (s?+ é)! € 


This may be expressed as 


6f ¢' dr 
Js — If = 
€ 0 1-r° 


In(1/r) 


n dr 
-f satianrsomt init) (IIT.12) 
r(1+r°)! 


— (3n?/4e)—6 Ine+(terms regular in «). _ (III.13) 


Thus, collecting together the expressions for J;, Jo, 
and J3, we have 


dI(€)/de= (32*/8)+8e Ine+ O(e). —_ (IIT.14) 


Using (III.3), the final expression, used in (5.2), is 
now obtained: 


ox 32 

I= e( 14+ eine 0(¢) ). (111.15) 

¢€ 3? 

The simplification of Eq. (5.3) for G™(q) is carried 
through in a precisely analogous manner. 


APPENDIX IV 


The quantities Kg (g) and §”(q)/q’, defined by 
(4.14), are found here in the case of g>u. 

To illustrate the argument, we begin by evaluating 
the contributions [K »” (q) }, and [5 (q)/q?]:, from the 
ladder graph shown in Fig. 5(d). 
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We now replace sums by integrals, and make the change in variables = (¢/u)Cy, and k-q= kg cosd=kgx, etc. 
Terms of relative order (u/g)? are ignored; this corresponds to taking the normal-state values of « and v. In such 
case, the presence of matrix elements (w,v_—«_v,) amounts to the requirement |«| > |y!, while the occurrence of 


(u,u_+2,v_) implies |y! > |x!. (IV.1) now becomes 


3 is 1 1 1 
[S$ (q)/¢? = —-LK es (q) h= an 7 f ax f ax’ f dx" 
2 q- 0 0 0 


xf ay f ay f dy" — —__—- a ’ : 
~~ 3 a2 U(at a’) (ata) (Ate! +y"+2+y) (Ata +y' +2" —y" +22) 


where 
(IV.3) 
Evaluation of (IV.2), and all other integrals associ- 
ated with Kg (gq) and SG (q)/q’, is simplified by con- 
sidering two regions of g values separately: 
(a) po/C>q(>wn), 
At, 
(b) pa/C<q(<kp), 
A<1. 


A= (u/q) (@/C). 


Region (a). 


[S$ (9)/¢@ = —-LK8 (gq) hh 


$£ 
=——pJ[1+0(1/A)], (IV.6) 
16 ¢’ 


Equation (1V.2) becomes 


where J is a numerical constant {7= (1/15)[1+11 In2 
—6(In2)?—32?]}. A similar result holds for terms of 
type (ii) in general, so that for these terms, both 
Kp (g) and G" (g)/q? are proportional to (L/q")p’. 

It is readily verified that when terms of types (i) and 
(iii) are calculated, by a procedure analogous to that for 
the example just given, they give contributions to 
Kp (g) and G” (q)/g? which are, in essence, of the form 


G(q)/¢?]_ L 1 A’dx x2 
' |: ef as | (IV.7) 
Kp (q) g % (A+x)*L1—3x 


Thus for A>1, the contribution of these terms to 
C (gq) /g? has a leading component (L/g?)p, while their 
od \Q)/9 § com} q 
contribution to Kg (qg) is smaller by a factor 1/A. 


axa’ A? 


(IV.2) 


To summarize, in region (a) we find by adding up all 
terms, 


Kp (g)=01(L/q?)p?L1+ O(1/A) J, (IV.8) 


G (q)/g=o2(L/@)pL 1+ 0(1/A)],  (IV.9) 


where o1, o2 are numerical factors. 
Region (b).—Returning to Eq. (IV.2), we find that 
for A<1 it becomes 


—[Ke (gq) 1 
= (numerical constant) (/q") 
X p?A*L1+ O(A InA) ]. 


[SG (qg) Qh 


(1V.10) 


Results of this form hold generally for type (ii) 
terms, in the region (b); they differ from those for 
region (a) by factors A’. 

For type (i) and (iii) terms, the form (IV.7) may be 
simplified, to get a contribution to SG (q)/q? which is 
again like (IV.10). However, the contribution to 
Ke‘ (qg) of such terms is seen to be proportional to 
(L/q*)p’A: that is, it is larger by a factor 1/A than 
contributions of type (ii). 

Collecting the above results, in region (b) we have 

Kp” (q)=03(L/q?)p*AL1+ O(A Ind) ],  (1V.11) 
SG (qg)/g=o(L g’)p?A*L1+ O(A InA) ], (IV.12) 


where o3, o4 are again numerical factors. 
Hence the result (5.8) follows: 


ai = (a /A)[1+ O(A InA) ]. (1V.13) 
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The Hamiltonian of a Bloch electron in a static magnetic field is 7 =} P?+-V (r), where V(r) is the periodic 
potential, P=p+A/c, and A is the vector potential giving rise to the magnetic field 3. We consider the 
case of a nondegenerate band m. It is then shown that, with an error vanishing with 3 like 3CN* (NW arbi- 
trary), the eigenstates of H can be calculated from an equivalent Hamiltonian H,,(P) with the following 
properties: (1) It is a one-band Hamiltonian, obtained by transforming away all relevant interband matrix 
elements. (2) It depends only on the gauge-covariant operators P*. (3) It has the periodicity property 
H,,(P+K)=H,,.(P), where K is an arbitrary reciprocal lattice vector. (4) It can be written as a series 
H,,(P) = :20"s'Hm;«(P) where s=35C/c and the functions H»,;(P) are completely symmetrized in the 
noncommuting operators P*. Properties (3) and (4) can also be summarized in the equations H,,(P) = Lia 
Xexp[iR™-P], where the R“ are lattice vectors and the a can be expanded as a = Z;2o%s'a;. An 
algorithm is given for the construction of the ,,; and carried through for i=0, 1, 2. The formalism is not 
restricted to the neighborhood of the bottom and top of the band. We believe that the equivalent Hamil- 
tonian H,,(P) provides a sound basis for a discussion of wave functions and energy levels of Bloch electrons 


in a magnetic field. 


1. Introduction 


HE study of the physical properties of metals and 

semiconductors in external magnetic fields has 
been among the most fruitful methods for obtaining 
insight into their electronic structure. In many cases the 
experiments give us information about the electronic 
energy levels in a magnetic field. Examples are the dia- 
magnetic susceptibility, De Haas-Van Alphen effect, 
cyclotron resonance, and magneto-optic effects. 

It is therefore not surprising that the theory of the 
motion of Bloch electrons in a uniform magnetic field 
has received a good deal of attention. Following the 
classic work of Landau! on the quantum theory of free 
electrons in a magnetic field, the first analysis of Bloch 
electrons in a magnetic field was carried through by 
Peierls.2, This latter work was based on the tight- 
binding approximation and therefore its results have 
only qualitative validity. Since then a great many con- 
tributions to this problem have been made, some of 
them dealing with the individual energy levels,>~” 
others emphasizing the free energy of the entire sys- 
tem.''- However, because of the great mathematical 

*A preliminary account was published in Proc. Phys. Soc. 
(London) 72, 1147 (1958). 

1 L. Landau, Z. Physik 64, 629 (1930). 

2 R. Peierls, Z. Physik 80, 763 (1933). 

3]. M. Luttinger, Phys. Rev. 84, 814 (1951). 

4E. N. Adams II, Phys. Rev. 85, 41 (1952); L. Onsager, Phil. 
Mag. 43, 1006 (1952). 

5 J. M. Luttinger and W. Kohn, Phys. Rev. 97, 869 (1955). 

6 P. J. Harper, Proc. Phys. Soc. (London) A68, 874, 879 (1955). 

7 T. Kjeldaas and W. Kohn, Phys. Rev. 105, 806 (1957). 

8 A. D. Brailsford, Proc. Phys. Soc. (London) A70, 275 (1957). 

9G. E. Zil’berman, J. Exptl. Theoret. Phys. U.S.S.R. 32, 296 
(1957) [translation: Soviet Phys. JETP 5, 208 (1957) ]. 

0G. E. Zil’berman, J. Exptl. Theoret. Phys. U.S.S.R. 33, 387 
(1957) [translation: Soviet Phys. JETP 6, 299 (1958) ]. 

"LE. D. Landau and D. Shoenberg, Proc. Roy. Soc. (London) 
A170, 341 (1939), Appendix. 

2 A. H. Wilson, Proc. Cambridge Phil. Soc. 49, 292 (1953). 

8. N. Adams IT, Phys. Rev. 89, 633 (1953). 

“J. M. Lifschitz and A. M. Kosevich, J. Exptl. Theoret. Phys. 
U.S.S.R. 29, 730 (1955) [translation: Soviet Phys. JETP 2, 636 
(1956) ]. 


complexity of the problem, many authors have found 
it necessary to use one or the other uncontrolled 
approximation, so that the reliability of their results 
remains often in doubt. Thus much work is based on the 
so-called single-band Hamiltonian 


H=e,(P), (1.1) 


where €,,(k) is the energy band in question,!** 
P=p+A/c, 


and A is the vector potential giving rise to the uniform 
magnetic field. At least some interband matrix elements 
are left out of account in such theories and in certain 
cases it has been shown that this may lead to very 
serious errors. 

Two relatively recent developments have been of 
particular interest. One is an expression due to Onsager,' 
which relates the levels in a magnetic field to simple 
geometrical properties of the energy bands and applies 
to bands of arbitrary shape. This expression has been 
extremely helpful in analyzing De Haas-Van Alphen 
experiments. However it is derived by means of a semi- 
classical argument whose range of validity is, as the 
author points out, not entirely clear. 

The other question which has been recently discussed 
by several authors®:** concerns the extent to which 
magnetic levels, highly degenerate in the absence of a 
periodic potential, are spread into bands. That such 
banding occurs is beyond any doubt, and may be 
demonstrated in some simple examples. But the width 
of the bands is a matter which is not settled at present 
and there is substantial disagreement between different 
authors. 

The present work was undertaken in the hope of 
clarifying some of the existing uncertainties. It is in a 
sense an outgrowth of some earlier work,®”? but unlike 
it is not restricted to the vicinity of the bottom or top 


(1.2) 


15a Atomic units are used in this paper. 
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of an energy band. In this paper we derive an effective 
single-band Hamiltonian as well as the corresponding 
basis functions. The diagonalization of this Hamil- 
tonian and the resulting energy levels, wave functions, 
and free energies will be discussed in a second paper. 

Following is a summary of the results obtained in 
this paper. We are concerned with the solution of the 
Schrédinger equation 


Ay= (3P?+V)y= HY, (1.3) 


where P is defined in Eq. (1.2) and V is a periodic 
potential. For A=0, the eigenfunctions of H are the 
Bloch waves 


(1.4) 


Cnk = Unx(re'*'t, 


and the corresponding eigenvalues are the energy bands 
én(k). We fix our attention on solutions of (1.3) which 
in the limit of vanishing A go over into Bloch waves be- 
longing to a nondegenerate band denoted by m. 

In the presence of a magnetic field, the Hamiltonian 
H in (1.3) has nonvanishing matrix elements between 
Bloch waves associated with the band m and those be- 
longing to other bands. We shall explicitly construct a 
new set of functions ¢n,x such that, in a certain sense, 
H $m can be expressed as a linear combination of the 
Zmx’ With the same m. In fact we shall show that 


H &me™ Xv Gmv’(k’ | Hn (P) | k), (1.5) 


where H,,(P) is an explicitly constructed function of 
the operator P, and the notation (k’| |k) denotes a 
matrix element between plane-wave states k’ and k.1*» 
The meaning of the phrase “in a certain sense” and 
of the ~ sign in (1.5) is the following. We construct 
Zax and H,,(P) by a step-by-step process, which can be 
carried to arbitrary order NV in the magnetic field 3. 
If we stop at the Nth order, the equation (1.5) is cor- 
rect apart from terms which vanish with 5€ like 50%", 
We do not yet know if for small enough 5 our process 
converges strictly or only in an asymptotic sense. To 
simplify our langugage, we shall assume the former, but 
if in fact the convergence is only asymptotic our results 
may have errors which vanish however more rapidly 
than 3C%, where N is arbitrary.!® 
The new basis functions gnx have a “periodicity” 
property similar to that of ordinary Bloch waves. That 
is, if K is a reciprocal lattice vector, 
Smk4+K= + Emx. (1.6) 
The sign here depends on the details of V(r) as well as 
on K. It is extremely convenient not to restrict k to a 
single Brillouin zone but to allow it to run over a very 
large volume in k space. Because of (1.6) the Gn are 
then of course not linearly independent, but the diffi- 


15 For earlier discussions of an effective Hamiltonian, see 
especially references 3 and 4. 

16 This possibility is of importance in the question of the 
“banding” of the magnetic levels. 
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culties associated with this redundancy are more than 
balanced by its advantages. “ 
The effective one-band Hamiltonian, H,,(P) includes 
all interband effects. If we define 
s=H/c, (1.7) 
it can be expanded in the form 
Hn(P) =H m;o(P)+5H m.1(P)+s2H m,2(P)+--+, (1.8) 


where each Hn;; is a completely symmetrized function 
of the noncommuting operators P*. Furthermore, if we 
denote the lattice vectors of the crystal by R“, each 
H,,;; has a Fourier series expansion of the form 
Hy», (P)=So1a; exp(iR™- P). 
Explicit constructions for the functions 


Am; (kK)= Xai expGR-k) 


(1.9) 


(1.10) 


(and hence for the a;!) are given. For i=0, we find of 
course 


Fm:o(k) = em (kk), (1.11) 


which gives rise to the simple one-band Hamiltonian 
(1.1). 

In view of (1.8) and (1.9), H,(P) can also be ex- 
pressed as a Fourier series, 


A, (P)=1a™ exp(iR© - P), (1.12) 
where 


a = a9 + say + sa,gQ4---, (1.13) 


From this form the important periodicity property 


H,(P+K)=H,,(P) (1.14) 
is apparent. 

The Hamiltonian H7,,(P) of Eq. (1.8) is far from 
unique. Any unitary transformation U(P) with the 
appropriate periodicity properties will lead to an equiva- 
lent but different Hamiltonian. The leading term 
Hm;o(P) is of course common to all these forms. 

When the crystal in question has a center of inversion, 
only even powers of s occur in our expansion (1.8). In 
this paper the expansion is explicitly carried out up to 
order s?. When a center of inversion is absent, also odd 
powers of s occur. For this case the expansion is ex- 
plicitly carried through up to order s. 

This is the point where the present paper stops. It 
remains to find the solutions of Eq. (1.3) corresponding 
to the band m. These can be written in the form 


y= YA m(k) Pmky 


and by (1.5) will satisfy the Schrédinger equation, if 
A,,(k) satisfies the equation 


(kn (P)| k’) An (k’) = EA), (k). 


(1.15) 


(1.16) 


The solution of this equation and related questions will 
be discussed in a subsequent paper. 

The results derived in this paper have a very simple 
structure. It is therefore a pity that the methods by 
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which they are derived here are shockingly compli- 
cated. Also the particular expressions we obtain for the 
higher-order Hamiltonians in Eq. (1.8) are very in- 
volved and depend rather surprisingly on such unex- 
pected quantities as the values of the normalized 
Bloch waves at the arbitrarily chosen origin of co- 
ordinates, We have already mentioned the lack of 
uniqueness of the Hamiltonian. Perhaps much simpler 
expressions exist and much simpler derivations. If so, 
we hope that they will in time be found. 


PART I 
2. Formulation of the Mathematical Problem 


Our problem is the solution of the Schrédinger 
equation 


Hy= (3P?+V)y= Ey, (2.1) 


where V is a periodic potential, P is the velocity 


operator 


P= p+A/c, (2.2) 


and A is the vector potential giving rise to a uniform 
magnetic field 3: 


3=curlA. (2.3) 


The boundary conditions are the usual periodicity con- 
ditions on the surface of a large box of volume 2. 

We shall be interested in the solutions of (2.1) which 
correspond to a simple band whose band index we de- 
note by m. By “‘correspond” we mean that in the limit 
of vanishing A the solutions go over into Bloch waves 
with the band index m; and by a “simple” band we 
mean a band which has only one wave function for a 
given k and which does not touch or intersect another 
band. 

Of course, except in some very special cases, such as 
V=0, an exact solution of (2.1) is not possible. We shall 
seek approximate solutions which are valid if the mag- 
netic field is sufficiently weak. However, what consti- 
tutes a weak field for one part of the spectrum may not 
for another. Loosely speaking, we may say that a weak 
field is one in which the magnetic energy of the electron 
is small compared to some characteristic energy when 
the magnetic field is switched off. In atomic units, if 
we call 

(2.4) 
this means that 

SKY, (2.5) 
where ¥ is a pure number of the general order of magni- 
tude 1. However in some cases, for example when deal- 
ing with very small effective masses, y may be as small 
as 107 or 10-*. We shall by a sequence of unitary trans- 
formations construct new Hamiltonian, equivalent to 
H, in the form of a series whose mth term contains a 
factor s". The validity of the expansion can then in 
each case be checked by inspection of successive terms. 

In most practical cases a few terms are sufficient. 
It is perhaps worth mentioning that the validity of 
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our procedure is quite independent of the temperature. 
For example, the observation of the De Haas-Van 
Alphen effect requires a “‘strong”’ field, but in the sense 


soRT. (2.6) 


Since kT at 1°K equals 10~° in atomic units, the condi- 
tion (2.5) need by no means rule out the inequality (2.6). 


3. Initial Basis Functions 


Returning now to Eq. (2.1), it is of course well 
known that it cannot be solved by simple Schrédinger 
perturbation theory in powers of s. The effect of even 
a very weak magnetic field on the eigenfunctions is too 
profound. Nevertheless it seems natural to begin by 
writing Eq. (2.1) in the representation of the unper- 
turbed eigenfunctions of H, the Bloch waves, and carry 
on from there. However, this approach led to technical 
difficulties which we could not overcome. 

The procedure which we could carry through begins 
with the following basis, introduced in reference 5: 

Xnk=Unoe*''. (3.1) 
Here tn is the periodic part of the Bloch wave 
nk=Unxe*'', (3.2) 


for k=0. The normalization is fixed by the equation 


(2n)8 
Unk Un kdl =Snn’, 
9) 
-« rell 


€ 


(3.3) 


where @ is the volume of the unit crystal cell. Finally it 
follows from time reversal symmetry that we may fix 
the phase of #,o(r) such that 


Im 4,0(r)=0. (3.4) 


When k is restricted to the fundamental Brillouin 
zone, the set X,x forms a complete orthogonal basis. 
This was the set used in the papers by Luttinger and 
Kohn® and Kjeldaas and Kohn’ that were concerned 
with levels near the bottom or top of a band, which 
involved only small values of k.!” In the present paper 
we shall not limit ourselves to levels near the band 
edges. In particular we want to be able to describe 
levels which, in a semiclassical description, correspond 
to Bloch wave packets circulating under the influence 
of the magnetic field through several Brillouin zones. 
Two courses of action suggest themselves: One is to 
hold fast to the basis of Luttinger and Kohn and con- 
sider very carefully what happens when k is on the’ 
Brillouin zone boundary. This procedure we found not 
tractable. The other course, which we adopt, is to ex- 
tend the basis (3.1) by letting k run over a very large 
volume 7 in kK space. Now since the Xx form a complete 
set when k runs over any one Brillouin zone, of volume 
r, this extended basis has a redundancy of (7/7). The 


17 The results of the present work are in agreement with the 
conclusions of these earlier papers. 
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magnitude of 7 will of course drop out of our final re- 
sults just as does the coordinate volume Q of the 
crystal. 

When we come to count states, the finiteness of both 
7 and Q will be made use of. But during most of the 
following developments it is convenient to regard them 
first as infinite. Then both r and k become continuous 
variables ranging over their entire respective spaces. 


4. Schrodinger Equation in the Initial 
Representation 


Since the functions X,x of our extended basis are 
more than complete, the solutions y of the Schrédinger 
equation can certainly be expanded in terms of them: 


y=> An(K)Xax, (4.1) 


where the symbol >> includes both summation over n 
and integration over k. However, it is obvious that the 
coefficients A,(k) are not unique. 

When we substitute (4.1) into the Schrédinger equa- 
tion (2.3), we are led to consider the effect of (H—E) 
operating of X,. The resulting function has of course 
again a highly ambiguous expansion in terms of the 
Xnk, but we are entirely at liberty to choose one which 
is particularly simple. 

We follow in this largely the procedure of Kjeldaas 
and Kohn.’ We wish first to compute the result of 
4(p+A/c)? acting on Xnx. We make the following pre- 
liminary definitions. If Q is an operator which is a 
function of «* and p%, we define 


(4.2) 


1 
k’|Q|k)=—— few. Je'ktdr. 
W1Q = ferwraens 


Further we call 


(21) 
Par? =— f Un'o* p°Unod fr. 


Q cell 


(4.3) 


With these notations we may write 


PX ac= pMnoe** 
= Xyrrn [Byrn (Kk) + prn3(Ke— ky] 
= Xn [Snrn (| p*| k) + pant (k”— kk) ], 
(4.4) 
and 


X°X n= XU oe! 
= (1/1) (0/dk*)unoe'** 
=D X wn Bnrrn(1/i)(8/0k%)5(k”— k) 
=P Xn [Onn (k’” |x| k)]. 


Now in a uniform magnetic field we can write 


(4.5) 
Pa= p2-+ s0'¢2" (4.6) 
(summation over repeated indices implied), where 


Hi=c(s®—s%), etc. (4.7) 
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Hence by (4.4) and (4.5) we can write 
PX n= DL Xn’ 
X [Onn (k”| P#| k) + pan (k”—k) ] 
A second application of P* results in 
$P*PX,. 
= XnreeLOnrn(K’ | 2P*P*| kt) + pain ®(k’ | P| k) 
+3 Don Pain *Pnin%5(k’— k) J. 
To this must be added 
LV (1) —E ]Xnx=X Xn (Vin'n— Ebn'n)d(k’—k), 
giving 
(H—E)Xn 
= Xn LSnn(k’| 3 P*P2| k)+ porn (k’| P| k) 
+ (3 Sar Prin Pant $V nin— Ebn'n)b(k’— k)). 
(4.11) 


(4.8) 


(4.9) 


(4.10) 


This may be further simplified by noting that the spe- 
cial case A=0, k=0 gives 


$ Don Pan Paint t Virn= €nOn’ny 


where e¢, is the energy of the mth band at k=0. With 
this relation (4.11) now becomes 


(H—E)Xnx= Dd Xue: (n'k’ | H—E| nk), 


(4.12) 


(4.13) 
where 


(n’k’ | H| nk) =8y al end(k’— k)+3(k’| P*P2| k) | 
+ Pnin®(k| P*|k’), (4.14) 
and 


(n'k’ | E| nk) = E5,/n5(k’—k). (4.15) 


It should be noted that the matrix (n’k’|H|nk) is 
Hermitian. 

When we substitute y, in the form of Eq. (4.1), into 
the Schrédinger equation (2.1) we obtain with the help 
of (4.13) the following necessary and sufficient condition 
on the coefficients A ,(k): 


DD Xn (n’k’ | H— E| nk)A,(k)=0. 


n'k’ nk 


(4.16) 


Now in the customary representation theory one infers 
from the orthogonality of the basis functions that 
(4.16) is equivalent to 


¥ (n'k’| H—E\nk)A,(k)=0. 


nk 


(4.17) 


In the present case, since the Xyx are not linearly 
independent, the situation is different: 

(a) Any set of coefficients A,(k), satisfying (4.17) 
will also satisfy (4.16) and hence the corresponding 
wave function, 


y=2 Ay(Kk)Xnx, (4.1) 


will satisfy the Schrédinger equation (2.1). However 
the possibility exists (and is in fact realized) that a 
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nonvanishing solution A,(k) of (4.17) may give rise to 
an identically vanishing wave function y. Equivalently, 
two different solutions of (4.17) can give rise to the 
same y. 

(b) The totality of solutions of (4.17) give rise— 
via (4.1)—to the totality of solutions of the original 
Schrodinger equation (2.1). This result is demon- 
strated in Appendix A. It is not entirely trivial, since 
solutions of (4.16) may (and do) exist which are not 
solutions of (4.17). However, these solutions are equiva- 
lent (in the sense of giving rise to the same W) to other 
solutions of (4.17). 

In view of this situation, the problem of finding all 
solutions of (2.1) can be divided into two parts: We 
first determine all solutions of (4.17). Then we con- 
struct the corresponding y’s by means of (4.1) (some of 
which may vanish identically) and select from these a 
linearly independent set in terms of which they all can 
be expanded. 


5. Elimination of Interband Matrix Elements 

We now turn to the problem of solving Eq. (4.17), 
which on interchange of the primed and unprimed 
variables becomes 


> (nk! A \n’k’)A,,(k’) =EA,(k); 


n'k’ 


(5.1) 


here the matrix (nk|H|n’k’) is given by Eq. (4.14). 
This equation is an integral equation in k space and a 
matrix equation in the band index. Our first aim is to 
eliminate by a series of canonical transformations the 
interband matrix elements in (5.1) which connect the 
band in question, m, with other bands and thus to 
transform (5.1) into an equation of the form 


Y« (k}7,,! k’)A,.(k’) = EA,,(k). 


First some preliminaries. As we have chosen the 
functions #,o(r) to be entirely real [see Eq. (3.4) ] it 
follows that all matrix elements py’,%, Eq. (4.3), are 
purely imaginary and that 


O@= a - yal « 
Pann - Pn n Prin ) 


(5.2) 


(5.3) 
in particular, 


Pant=0. (5.4) 


Further, it will be convenient to define the matrix 
operator H(P) whose elements with respect to the band 
indices m are given by the operators 
Han (P)=Snan Lent $508 Po ps6 }4- (1 —San')Pan'*P*. (5.5) 
By comparison with (4.14) we have 
(nk| H\n’k’) = (k| Han (P)|k’). 
Now we see from (5.5) that H(P) consists of a part 
diagonal in m, of zeroth and second order in the opera- 
tors P*, and an off-diagonal part linear in P*. Our 


next program is to regard P* as formally small and, by 
successive unitary transformations, to remove, to 


(5.6) 
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higher and higher order in P*, the off-diagonal elements 
of H(P) which involve the band m. 
As a first step we write 


11 (P)=exp[—S(P)JH(P) exp[S(P)], (6.7) 


where S“(P) is an anti-Hermitian matrix operator, 
which is taken to be linear in P*: 


San’ (P)=Dan*P*. (5.8) 


Expanding the right-hand side of (5.7) in powers of 

S®(P) gives 

A (P)=H(P)+[H(P),S®(P)] 
+3([A(P),S®(P)],S@(P)J+---. 


In this expansion the terms linear in P* have the matrix 
elements 


(5.9) 


(Pun *+wnn'Dan'®) P%, (5.10) 
where 


(5.11) 


Wnn' = En— En’- 


In order that 1“ (P) have no linear interband matrix 
elements involving the band m we must choose the 
coefficients Dny’* so that 


Pmn*+WmnDmn*=0, n¥m, (5.12) 


and, because of the anti-Hermitian property of S®, 


Dam®= — (Dmnn®)*. (5.13) 


Obviously these requirements still leave us considerable 
freedom in the choice of the constants Dyn’*. The 
simplest choice would be Dnn*=— (Dam*)*= — Pmn%/ 
@mn, for nx¥m, and all other D,,,*=0. However for 
reasons which will become apparent later on we make a 
slightly more general choice, namely 


(5.14) 
(5.15) 


Dan*= bnntCm?— (1 —bnn)Pmn®/Wmn ; 


Dea = (Dan*)* ; 
and 
(5.16) 


Dan'*=0, nx¥m and n’¥m. 


Here the C,,% are real numbers, for the time being quite 
arbitrary, so that iC,,*P* is properly anti-Hermitian. 

The matrix elements of H(P) involving the band 
m are now, by Eq. (5.9), 


= Pmn'*Pa'm? 
ERS ( P) = Devs éat 156+) — oe ee P«Ps4- mat 


Wmn’ 


ee wal 
p me [poe 


Wmn 
(5.17) 
(5.18) 


+ ( 1 —5mnn) | [- iC mPa ‘a> 


Aam™ (P) =[H am™ (P) Jt. 


Thus by our construction Ann™(P) contains, for n¥m, 
only terms of second order in P*. To emphasize this 
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feature we write Hm, “)(P) in the form 
Hinn™ (P)=Snnlm™ (P) 
+ (1 —8nn) {Qmn%®P* PP+- aay ey 


We now make a second unitary transformation and 
define 


A®(P)=exp[—S®(P) 1 (P) exp[S®(P)], 


(5.19) 


(5.20) 


where S® is of second order in P* and so chosen that 
Hymn has, for n#m, no terms of lower order than the 
third in P*. Proceeding as before, this may be achieved 
by taking 
San’ ® (P)= Dan ®P2P8, (5.21) 
where 
Dan? = bint mn? — (1—8nn)Omn®?/wmn 3 


Dyaw= pane’ (Dan®?)* ; 
Dan'*=0, 


Here the constants C,,%° are for the time being largely 
arbitrary, except that in view of the anti-Hermitian 
nature of S®, C,,*8P*P® must be Hermitian: 


(Caf)*PaP? =C,,28 Pa Pb. (: 


(5.22) 
(5.23) 


nm and n’¥m. (5.24) 


25) 


Clearly, this procedure may be continued so that 
after the ‘th transformation we have 
Hann (P) =8nnl mn (P) 
+ (1 —Smn)[Omn™ ‘atti Pal... Pattit... ]. (5.26) 
The subsequent transformation is then given by 
Sane? , (P) = Daggett ; ‘att Par ae ale Pew ’ (5.27) 
where 


Dug “atti = §,, 0 mee ae 
— (1—dnn)Qnn® 2), tin (5.28) 
(5.29) 


(5.30) 


Dae 
Dig ‘ 


att] — — Day ®" 21: 


ratti=() nxm and n’¥~m; 


and the numbers C,,"*"«**! are arbitrary, apart from 
the requirement that 


Cm ettipa. e » Pati =C, atti -a1 Par, at Pet, 


(5.31) 


From the power series of exp(—S(¢? 1 exp(S(+”) 
we see that 


H+» (P)=H7,, (P)+0(P). (5.32) 


Thus all H7,, for > to are identical up to order (P*)**! 
inclusive. We now define H,,(P) as the formal limit of 
Hy as t— «©. It can be written in the form 


H»(P) _ Gat pp apalpas 
+E, Pupapat..., (5,33) 
where in view of (5.32) E,®°*'** can be obtained by 
calculating H,,°). 
We shall not discuss here the question of convergence 
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of this procedure. However, in Sec. 10 we shall estimate 
the error of the solutions obtained. 


6. Commutation Expansion of H,,(P) 
We now wish to write A7,,(P) in a series of terms each 
of which is “effectively” smaller by a factor s(=3C/c) 
than the preceding one. Since, by (4.6), 


Pe= pe+ saa’ ya! 


one’s first inclination might be to order the series (5.33) 
in powers of s. However, in such a series successive 
terms are not effectively smaller by a factor of s. The 
reason for this can be most easily seen in the free elec- 
tron case. Here 


H(P) = } Papas bp pr+se (p7x®’ +x%p’) 


+ 52a’ saa!’ ya! ya’! 


(6.1) 


If we now consider solutions of (approximately) fixed 
energy, then as s— 0, the dimensions over which the 
orbits extend behave as s~, so that terms of the type sx 
occurring in (6.1) are in fact independent of s. 

A more appropriate procedure is to make what we 
call a commutator expansion of Eq. (5.33). Let us 
begin with some preliminaries. First we define 

(Pa... Pat),,, (6.2) 


as the average of P*---P* over all possible permuta- 
tions of the factors. Since for s=0 the P*’s commute, 
it is clear that the difference between P*'- - -P* and its 
average must be at least linear in s. For example, 
Papa (PP), on 1 ( Papa P2p%) 


_ i (su1e2— gaze) 


1s g@1o2— sa2al 
= ( ——— ), (6.3) 
2 s 


where the factor in parentheses is independent of _the 
magnitude of s. In general, we can write 


pa... Pat= (Pa... Pat), 4sA,, 


(6.4) 

where A, is a linear combination of products of P*, each 

of order (P*)'*. Now we define A,,; by the equation 
SAri1= Ap— (Ay). (6.5) 

Then clearly we can develop P*---P*! in a series of 

the form 

pa... Pets (pa re Pt\,, 


+5(Ai)y+ ‘ »SU2NCA re79) Avy (6.6) 
where 


[t/2]=1/2 for 
=(t—1)/2 for 


t even, 


6. 
t odd. 6-5) 


Such an expansion we call a commutator expansion. 
We now develop each term of the power series (5.33) 
of H,,(P) in a commutator expansion and then collect 
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all terms corresponding to a given power of s. This 
results in the commutator expansion of H,,(P) which 
has the form 


A1,.(P) = Am.o( P) +5 m;1(P) +52 m:2(P)+ owes 


Each term Af, ;(P) in this expansion is a linear combina- 
tion of completely symmetrized products of the P*. 
We shall see later that in general all A/,,,;(P) are of 
similar magnitude so that (6.8) is a suitable expansion 
for sufficiently weak fields. 

Let us now recall that any completely symmetrized 
operator M(P) is completely defined by the function 
M(k) to which it reduces when the operators P* are 
replaced by the c-numbers k*.'* Hence each of the 
Hamiltonians M,;(P) is completely characterized by 
the corresponding function H,,;;(k). The procedure 


(6.8) 


which we have followed gives us power series expansions 
for these functions. For example, from (5.17) and (6.8) 
we see that!® 


Hm;0(k) = ot| pores aa 


re 


Wmn’ 


Jerse see, (6.9) 


1 a 


a 
al *Prn’ al TP mn’ PD am | 


, 
ad Wmn’ 


Fm.1(k) = -— 


+--+. (6.10) 


The deficiency of this formalism, as developed so far, 
is then that it is restricted to sufficiently small k- 
vectors for which these series converge adequately. 
One of the main objectives of Part II of this paper is to 
obtain expressions for the functions H,,;(k) without 
recourse to power series expansions. 


PART II 


To obtain explicit expressions for the functions 
Am; :(k) we shall be following a rather complicated 
procedure. It may therefore be helpful if we give here 
a brief outline. 

In Sec. 7 we show very easily that Hm,o(k) is just the 
energy band function ¢,,(k). This has the well-known 
periodicity property 


ém(k+ K)=en(k), 


where K is a reciprocal lattice vector. 

This suggests that also the functions A/,,,;(k) for 
i>0O might have this property. However, these func- 
tions depend on the as yet largely arbitrary coefficients 
Cr™""*** occurring in the canonical transformations 
S‘ [see Eqs. (5.28), (5.31) ]. In Secs. 8-14 it is shown 
how these constants can be chosen so that (a) the 
Ffm,i(k) can be evaluated in terms of the Bloch waves 
¢nx and without recourse to power series expansions in 


18The function k*'-- 
(Pa. «+ Pat)ye. 

Tt so a that the leading term of order k°, which is here 
given, vanishes. 


-ke* defines uniquely the operator 
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k«, (b) they have the periodicity property 
An: (k+ K)=A,,. :(k). 


This is accomplished in several steps. In Sec. 8 it is 
shown that, when s=0, the C,,’"'*' can be so chosen 
that the new basis functions generated by the unitary 
transformations S™, 


mk =>, Xnx(n'k’ | exp(S™) (expS®)---| mk), 


have the property of being real at the arbitrarily chosen 
origin r=0. This fixes the phase of these functions— 
which are just Bloch waves—so as to assure their 
periodicity in the sense that 


Omk4+K > — Ymk- 


In Secs. 9, 10, and 11 the same results are obtained 
when s+*0. The new basis functions, 


Smu= >, Xnv(n'k’ | exp(S™) exp(S®)-- - 


are constructed so as to be real at r=0, and they are 
periodic in the same sense as ¢mx. Furthermore the 
unitary transformation matrix connecting $m with 
Xn’ is constructed without recourse to a power series 
expansion in k*. 

In Sec. 12 the algorithm for constructing the functions 
H,,,:;(k), having the properties (a) and (b) above is 
developed; and in Secs. 13 and 14 it is applied to con- 
struct the first few An.i(k) (i=0, 1, ---) for crystals 
with and without a center of inversion, respectively. 


:mk), 


7. Discussion of the Leading Term H,,,,o(P) 

In this section we shall derive very easily an explicit 
expression for H»,o(P) which makes evident the im- 
portant periodicity property 

Fm,9(P+K)= (7.1) 
where K is any reciprocal lattice vector. In subsequent 
sections, we shall discuss the higher order Hamil- 
tonians, H».;(P) with i>0. 

When the magnetic field is switched off, clearly 

H,(P) a Hm;0(P)- (7.2) 
Now the eigenfunctions of H»n,o(p)—as those of p 
itself—are plane waves exp(ik- r) and the corresponding 
eigenvalues are Hm,o(k). But these must be just the 
energies of the Bloch waves ¢m;x so that 


Aim.o(k) = ém(k). 


We recall again that a completely symmetrized 
operator M(P) is uniquely determined by the function 
M(k) to which it reduces when P is replaced by k. In 


particular, if 
=> A(R)e®-*, 
R 


An;o(P), 


(7.3) 


M(k) (7.4) 


then 


M(P)=> A(R)e®”, (7.5) 
R 
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This fact may be applied in the present situation. 
For ém(k) is a periodic function of k and hence may be 
written as a Fourier series of the form 


ém(k) = 30. ao exp(iR™ -k), 


where the R are the lattice vectors of the crystal. 
Therefore we have 


Am;o(P)= Xr a0 exp(iR™ - P). 


From this form we see at once that Hm;o(P) has the 
important periodicity property (7.1), analogous to the 
periodicity of €m(k). 

It is sometimes a convenient notation to write 


Am;0(P)= (em(P))m, (7.8) 


where €m(P) is any function of the operator P which 
reduces to ém(k) when P is replaced by k. 

_ We wish next to draw attention to the uniqueness of 
Hyn;o(P), in spite of the fact that the sequence of 
unitary transformations S‘ leading to A,(P) con- 
tained a great deal of arbitrariness [see comments after 
Eqs. (5.13) and (5.30) ]. How this uniqueness comes 
about may be understood as follows. Suppose we con- 
sider a different sequence of canonical transformations 
5S’) which also uncouple the band m from the rest. 
The resulting effective Hamiltonian H7,,’(P) must then 
be related to H,,(P) by a unitary transformation, 
exp7,,(P). Thus 


A,’ (P)=exp[— Tn(P) JA n(P) expl[Tn(P) ] 
=Ff,,(P)+[An(P),7m(P) J+: Jager 


When we now make a commutator expansion of H’(P) 
only H,(P) in (7.9) can contribute to the leading term 
Hn.'(P). For all the following terms, because of their 
form as commutators, contain one or more factors of s. 
The identity of Hn,o'(P) and Hn,o(P) is now obvious. 
On the other hand, the higher order terms in the com- 
mutator expansion (6.8) do depend on the particular 
choice of the transformations S$“. 


(7.6) 


(7.7) 


(7.9) 


8. Periodicity in k of Bloch Waves 


We have just seen that for 7>0, the functions 
Hn; ;(k) depend on the choice of the coefficients Cm" **#. 
We shall find it extremely useful that they may be 
chosen in such a way that, in a certain gauge, the new 
basis functions are real at r=0. These functions can 
then be shown to be “periodic” in the variable k. We 
begin by demonstrating this in the absence of a mag- 
netic field. 

For A=0 we have of course P= p, and therefore the 
Hamiltonian (4.14) is diagonal in k: 


(nk| H | n'k’)=6(k—k’) Ayn (k). 


Consequently the solutions of the Schrédinger equation 
(5.1) are of the form 


Ay ™ (k)= By ™ (k°)6(k—k°), 


(8.1) 


(8.2) 
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and the corresponding eigenfunctions of H are just the 
Bloch waves ¢nx* associated with the vector k°: 


Pnk*= Don’ A n’™ (K)Xnx 


= exp(ik’- r) Lin’ By (K°)uno(t). (8.3) 


In this case, where the Hamiltonian is already initially 
diagonal in k, the removal of the interband matrix 
elements leads to a complete diagonalization. Therefore 


¢mu= {expLS® (p)] expLS®(p)]-++}Xme, (8.4) 


where ¢mx is a Bloch wave associated with quantum 
numbers m and k. We say a Bloch wave, because its 
phase will depend on the disposition of the arbitrary 
constants in the transformations S$“, 

We shall now show how the C,,*""*** can be chosen 
to make 


Im[ ¢mx(0) J=0. (8.5) 


The procedure will be to assure the property (8.5) up 
to any power of k*, from which the reality for all k 
follows by analytic continuation in the variables k™, 
k®), and k().2 

We have previously chosen the functions u,o(r) as 
real. Let us now further assume that #m0(0)>0.7! Next 
we write 


mx’) =>- Xnk’ 

X (nk’| {expLS (p) ] expLS® (p)]- ++} «| mk), 
where the notation { }, implies a power series expan- 
sion of expLS“] expLS®]--- in powers of p* up to 
order (p*)‘ inclusive. Clearly gmx‘? has the form 
gm? (1) =e***[ mo (1) + Rm (1) 

+..+pal.. Re," @*(p) |, 


(8.6) 


(8.7) 


where the functions #,%!"''*! are periodic in r. We must 
now show that ¢mx‘(0) can be made real for all ¢. 


To lowest order we have 
Gm (0) =[tmo(r)e**'* ]e 0, (8.8) 


and this is clearly real. 
To first order we obtain, with the help of Eqs. (5.8), 
(5.14), and (5.15), 


mx" (0) 
_ {> ntno(L)Lbum +S nm" ) (p) Je*"*} r=0 


; _ Pnm* 
= {etl gno(r) +l tC %tmo— 2, %no—— 
nm Wnm r=() 


P nm” 
= ting) +8( iC (0)— Zz; Un0(0)- ). (8.9) 


nm Wnam 


2” This assumes of course, without proof, the analyticity of gm 
as a function of k. For one-dimensional Bloch waves this analytic- 
ity has been established under mild restrictions by W. Kohn, 
Proc. Phys. Soc. (London) 72, 301 (1958), and to be published. 

21 Tf t4mo(0) should accidentally vanish, we choose another origin. 
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As Umo(O) and u,o(0) are real, and the pn»»* are imagi- 
nary, this can be made real by choosing 


a 
m 


1 Pn 
Cn* =~ a Tn- ™*> 


1 nm Wnm 


(8.10) 


where 


1n=Uno(0)/Umo(0). (8.11) 


It will be seen that C,,*, defined by (8.10), is real as 
required. [See comment after Eq. (5.16). ] 

Proceeding in this manner it is easily seen that 
¢m«‘” (0) can be made real by an appropriate choice of 
real coefficients C,,%°***', which may be taken as in- 
variant under permutation of the indices ay---ay,. 

It may be mentioned in passing that if the crystal 
has a center of inversion at r=0, the C,,~"*-** can be 
taken as zero. 

This procedure defines ¢mx(r) in terms of a power 
series in k* of the form (8.7). By analytic continuation 
this function is then defined for all k. It clearly satisfies 
the following conditions: It satisfies the wave equation ; 
it is normalized; it is real at r=0; and it has the quasi- 
periodicity property 


¢mu(t+R) = explik-R® ]gmu(r). (8.12) 


These conditions define ¢,_ uniquely, apart from sign. 
Consequently it must have the following “‘periodicity”’ 
property, as a function of k: 


Pmk+K(T)=Ogmx(T), (8.13) 


where for a given crystal and band index m, 6=+1 de- 
pending on the vector K. Examples show that both 
signs do in fact occur. However for simplicity of writing 
we shall assume in the following that 6=1, so that 


Omk4K(T) = Om (1). (8.14) 


Where the case @= —1 leads to significant differences, 
they will be explicitly mentioned. 


9. Expansion of the New Basis Functions 
in Powers of P* 


The canonical transformations S$, S®, dis- 
cussed in Sec. 5 define a new set of basis functions which 
we shall denote by $nx: 


Par=)_, Xn! 


X (n’k’ |expLS (P)] expLS@(P)]---|nk). (9.1) 


For A=0 these were just the Bloch waves ¢nx discussed 
in the preceding section. In this section we begin a 
study of the ¢,x in the presence of a magnetic field. 
We shall first show that if we define 


ou =). Xn'k! 


X (n’k’ | {expLS (P)] expLS@(P)]---}.)mk), (9.2) 


where the symbol { }, denotes the truncated power 
series of the argument up to (P*)‘, the constants 
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Cm™***@" can be so chosen as to make 
Im[ mx‘? (0) ]=0, 


for all values of ¢. Since the phases of the Zax depend 
on the gauge of the vector potential, we must settle on 
a particular choice. It is convenient to take 


(9.3) 


A=}(xXn); (9.4) 


in this gauge we write s**’=¢**’, so that 
P= $2-+-92'x%", (9.5) 

and 
o2 = —g3=JC,/c, 


cli=g@=g%t=(, 


(9.6) 
(9.7) 


CL. 


Our final results will, however, be gauge invariant. 

The procedure of removing off-diagonal elements to 
higher and higher order in P*, which was described in 
Sec. 5, gives us $nx‘” in the form of a series 


t 


On? _ 3 War”, 
t’=0 


(9.8) 


where wx‘? is of order (P*)". Let us begin by study- 
ing the first two of these, for n=m. 
Clearly for t’=0 we have 


Wm © (4) =Xm(T) = Umo(r)e****, (9.9) 


and since all #,9(0) are real, so is Wmy(0). The next 
term is, in view of Eqs. (5.8), (5.14), and (5.15), 
Wm«™ (r) => tno(r) 

x [iC m6 nm— (1 Ra Pin*/tan jr e™*, (9.10) 
whose imaginary part at r=0 is 


a 
m 


ees (9.11) 


pn 
Imm“! ) (0) — Umo(O) (i mn ie + Tn 


nx~m Wnm 


where r, is defined by Eq. (8.11). This may be made to 
vanish by the same choice of C,,%, as in the absence of 
the magnetic field [see Eq. (8.10) ]. 

So far there has been no significant difference between 
the cases of nonvanishing and vanishing magnetic fields. 
But in higher orders (¢’> 2), the noncommutativity of 
the components of P does introduce such differences. 
We shall now show that nevertheless there exists a set 
of constants Cm™, ++ *Cm*"*'® which will make gmx‘ (0) 
real for arbitrary ¢, that these constants satisfy the 
requirement that the quantity 
x; (9.12) 


Ge ais at Pay te Pet 
” 
t 


ay a 


is Hermitian [see Eq. (5.31) ], and are real for odd ¢ and 
imaginary for even f. 

We give a proof by induction. Suppose we have 
chosen appropriate constants Cy, «+ -Cy%°°**—, Then 
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Wmx'” has the form 


WmeO=P> tno DL 
n 


G@i++-aye 


(6 nmCn*! : ‘tt Ones" s ‘) 


« Pa Bre o Pag. (9.13) 


where the constants Qnm%!''**! are real if ¢ is even and 
imaginary if ¢ is odd. Let us take the case of even t. 
From (9.13) we have, at r=0, 


Wm (0) =tmo(O) So (iC Rg °°) 


XP": . » Patgik-t) Lo, (9.14) 


where 


Ron FeQam™ (9.15) 


is real. We want to choose the C,,%"**** to make 
Im (wmx') = 0. (9.16) 


Now the product P™---P*! may be divided, in a 
gauge-invariant manner, into a Hermitian and anti- 
Hermitian operator. One way of effecting this division 
is by a commutator expansion. For example, 

Pu pa— (P™P%) +3 (Pa pPa— p22 pa) 
— (P% P22) y+ (1/2)o%22 
= (PP P22) 7+ (P11 P22) ,, (9.17) 
since 


gee }(smiet— sem) = (1/i)[Pa,Pe], (9.18) 


and is gauge invariant; the subscripts H and A denote, 
respectively, the Hermitian and anti-Hermitian parts. 
Similarly 
Pu pe Pas pas 
= (Pa Por Pas Pas), 
+ i(o™a2( Pas Pas), + gias( Par Pas), + gues Par Pas), 
+02243( Pai Pas),,-+- ga2aa( Pai Pas), gasas( Pai Pa2),,) 
sia (o%142g 73044 g 143g 204 + g1a4g 243) 


= (Pu PPas Pas), 4 (Pai Par Pas pas) (9.19) 


where the Hermitian part contains all terms even in o, 
and the anti-Hermitian part all terms which are odd 
in o. The general product P™---P** can be similarly 
decomposed. 
We can therefore write R,,“"''*** as the sum of two 
terms, 
Rgt etm th A (9.20) 


where 
Z. A, °%t Pa. .- Pat 
Q1++-a 


= z. Ry (Pa. 
i++ ae 


t 


*ssatPai,,, Pat 


= & 


Gyr++ag 


Ry@*°##( Pa. ee Pt) ao 


at and A»,™"*'@ are real. 
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We now choose 


(9.23) 


Cott 5A gett, 


By (9.22) we have 
ze 
Ql+++a@ 


Cyt’ %¢ Pa. Pat 
t 


=i DL Ryt-e(Par..-Pat)4, (9,24) 


Ql+++at 


and this is properly Hermitian. Furthermore this 
choice gives, by (9.14), 
Wm‘ (0) = Umo(0) } Ry **%* 

Qle++a@ 


t 


xXL(P"- ° » Pt) ne®* ] Wo. (9.25) 


It remains to verify that, in the gauge (9.4), the 
imaginary part of (9.25) vanishes. Since the R,,“!"***! 
are real, inspection of (9.17) and (9.19) shows that this 
will be so provided that for arbitrary ¢ 


Im[(P@- - - P%*)ye™* | 9 = 0. (9.26) 


To establish this fact substitute in the product P®- + - P*# 
the expression (9.5) and pull all factors « to the right of 
all factors p. This results in 


[ (Ps. ° » Pat)eik-T ) 9 


= pu.. + Rett jg eia2pas. ° katt cee, (9.27) 


When we now perform the operation ( )w on (9.27), 
all terms containing one or more factors o*® vanish be- 
cause of the antisymmetry of o%°. Hence we have 


[(pa. “ Pat), e*®F ) o= bu. _ ket, (9.28) 


which proves (9.26). 

When ¢ is odd an analogous demonstration can be 
given. 

We have therefore succeeded in constructing func- 
tions of the form (9.2), or equivalently of the form 


ouO=>> UnoLOn'n to, GarnOP™+ ae 
a? 


al 


+ >» Gant tP™- ‘ - Pas |etk-s 


Ql-++ae 
=> Xn (kK | bn td, GanB P+ aie 
al 


re 7 Gain tt P- ° - pat! k), (9.29) 


Qy++-ae 


which for »=m have the property (9.3). 


10. Commutator Expansion of the New Basis 
Functions; Degree of Decoupling 


The expansion (9.29) has the drawback of converging 
only for small values of &. Our previous experience sug- 
gests rearranging the terms in (9.29) in a commutator 
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expansion. We therefore write formally 


bwin XL Gun P+ D Gynt tPaPary «+ 
a) 


aja2 


=U arn,o(P)+5U nin (P) ASU wn:o(P)+---, (10.1) 
where the operators Uy’»;;(P) are completely sym- 
metrized in the P*. Because of this property they can 
be completely characterized by the functions U »’n;:(k) 
of the c-numbers k*. For example, we see from (10.1) 
that 


U arn; o(k) =Sunt>, Gan RO+ Pa Gy pO ARUR2+- dedi 
a 


aja2 


U nn; (k) = ze (40°92 /25)G yy? P rens 


a1a3 


(10.2) 


etc. The functions U’,’,;;(k) are defined by their power- 
series in k* where these converge, and by analytic con- 
tinuation™ elsewhere. 

Combining (9.29) and (10.1) gives the following 
formal expansion for $nx: 


Gu(r)=>_ tno U nn:o( P)+5U wn:i(P) 
+$°U yn2(P)+++-Je* (10.3) 
=D Xn (ke! | Unrnjo(P)+SU wn:(P) +--+ |k). 


For some purposes it is convenient to define 
F,;:(8; P)=> 2 Uno(P)U nn: i(P). 


They are functions of r and completely symmetrized 
functions of the operators P*. As the definition (10.4) 
shows, the P* are to be thought of as on the right of 
the r, i.e., not acting on it. With the help of these ob- 
jects, we can rewrite (10.3) also in the compact form 


Gax(¥)=[F p.0(r; P)+5Fn;,1(1; P) 
+s*Fy.0(r; P)+ o° - Ie-e, 


(10.4) 


(10.5) 


So far our entire development has been purely formal. 
We are now in a position to examine to what extent the 
functions ¢nx have in fact been “decoupled” from the 
Pax With n¥m. 

Let us first note that the transformation functions 
S“/(P) have been so constructed that the equation 


Han (P)(n’ | expLS(P)] expLS®(P)]---|m) 
—(n|exp[S(P)] exp[S@(P)]---|m) 
<f7,,(P)=0 (10.6) 
is a formal identity when both terms are formally ex- 
panded in powers of P*. If now these power series are 
regrouped in a commutator expansion, (10.6) is also 
clearly a formal identity to all orders of s of this ex- 
pansion. Let us now denote by [Q(P) ]; the commutator 
expansion of any function Q of the operators P* up to 


* Again the analyticity is assumed without proof; see reference 
0. 


WALTER KOHN 


order s‘. Then clearly we must have 


Haw (P)(n'|CexpLS™(P)] exp[S@(P)]-- +]; |m) 
—(n\[exp[S (P)] expLS (P)]- ++ ];|m) 
X(Hn(P)]:=O0(s**). 


Now let us denote by $nx;; the series (10.5) truncated 
after the power s’: 
Pnk;i=LF n,0(8; P)+ ne ‘S'F,, (8; P) Je** 


=P uno(n' | [expLS™ (P)] expLS@ (P)]- ++]. |m) 
xem, (10.8) 


(10.7) 


Then (10.7) can be rewritten as 
Hémx:i= >, Fmx’si(k’ | An.0(P)+ oa +s‘, :(P) | k) 
3 
+0(s**). (10.9) 


This answers our question concerning the degree of 
decoupling: %mx;; is decoupled up to order s* inclusive. 
Consequently, if we solve the one-band equation 


¥(k| (A,.(P)];| k’)A,.(k’) = EA, (k), (10.10) 
“ 


where 


(An (P)]i=Am;o(P)+5A m;1(P) _ 
+-++s'Hm,:(P), (10.11) 
the energy & and the corresponding wave function 


y=> An(k) Omk; i (10.12) 


will have errors which vanish with s like s‘*!. 
It may be remarked that if we define 


F,.(r=L Am(k)e**, 
then (10.10) is equivalent to the equation 
(Hn (P)]F'm(t)= EF m(r). 


(10.13) 


(10.14) 


11. Analytic Continuation of New Basis Functions 


In the preceding section we have seen that the new 
basis functions ¢,x(r) can be developed in a commuta- 
tor expansion of the form (10.5). As the operators 
F,,:(r; P) are completely symmetrized in the com- 
ponents of P*, they are uniquely determined by the 
functions F,,;(r;k). The procedure of the previous 
section gave us a series development of these functions 
in powers of k*. In the present section we shall show 
how they can be obtained for arbitrary k without re- 
course to a power series. 

To determine Fym.o(r; k) we first consider the case 
s=0 which has already been treated in Sec. 8. Here 
one finds 

Pmk= Ymk= Fm;o(1; ke, (11.1) 
so that 
Fm;o(¥; k) = um (1), (11.2) 


where, by our construction, Im[#mx(0) ]=0. 
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This gives us at once the result that in the presence 
of a magnetic field 


Zmk(T) = Ump(re*'t+O(s), (11.3) 


where P does not act on the argument r of ump, and the 
operator ump is considered completely symmetrized in 
the components P*. Now since 


Ump(r)e™* = thm, 4 (1/0) A(r)(K)e*F+-O(s), (11.4) 


we can also write 


Pm (1) = Um, + (1/c) A(t) (r)e*'¥+O(5) 
= Pm, + (1/e) A(x) (KF) expl—i(1/c) A(r)- 4] 
+O(s). (11.5) 


We now turn to the higher-order terms in the com- 
mutator expansion of $m. As we are interested in the 
case of small s but arbitrary values of k, it is natural to 
try and discuss the whole problem in a new representa- 
tion based on the Bloch waves at an arbitrary point g 
in k space, rather than the X,x representation which 
was based on the Bloch waves uno at k=0. In analogy 
with (3.1) we therefore define the following new basis 
functions: 


Xan(¥; $)= ¢ng(r) exp(ih-r), (11.6) 


where the phases of the gng are so chosen that 
Im ¢ng(0)=0, (11.7) 


and the sign of Regmg(0) is that which one obtains by 
analytic continuation of ¢mx from k=0, under the con- 
dition (11.7). For n#m the sign of gng may be arbi- 
trarily chosen. In complete analogy with Eqs. (5.5) and 
(5.6), we find that in this representation the Hamil- 
tonian matrix is 


(nh| H\n'h’)=(h| H,,/(P; g)|h’), (11.8) 


where 
Haw (P; $)=SnnLén(8)t+pnn®($)P%+ 35° P*PP | 
+ (1 —Snn’) Pan’*(8)P2; 


here €,(%) is the energy of band m at wave vector g 
while 


(11.9) 


(2r)8 
Pnn'*($)=— f ¢ne*Pynegdt. (11.10) 


s cell 


As before, we shall now eliminate the interband matrix 
elements of (11.9) and in this way be led to new basis 
functions Zny(r; 8). For m=n it will be shown that 
these are related as follows to the functions gnx(r) of 
Sec. 10: 


Pmn(¥; $)= Smgin(T)- (11.11) 


This connection will enable us to construct the function 
Fm;i(t; k) of Eq. (10.5) as well as the functions H,,, :(k) 
of Eq. (6.8) without recourse to power series expansions 
in k, 

We begin by eliminating the first-order off-diagonal 
matrix elements of Hnn(P; g) by means of the unitary 
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transformation expLS“ (P; g) ]: 


A (P; g)=exp[—S(P; g) ]A(P; g) explS(P; g)] 
=H(P; $)+[H(P; g),S(P; g)] 
obese, 

We take S“ of the form 


San’ (P; 8)=Dan'*(8)P%, 


(11.12) 


(11.13) 
where the Dnn*(g) must satisfy the condition 


Pmn*(8)+@mn(8)Dinn*($)=0 nm, (11.14) 


and 
Dan'*($)= —[Darn® (8) ]*; (11.15) 
here 
Wnn’ (8)= €n(¥)— €n’ (8). (11.16) 


We choose 


Dmnn*(8) = dmniCm*(g) 
— (1 — bmn) Pmn*($)/wm n (g), 


Dam*= = (Daa*)’, 
Dywe*=0,; 


(11.17) 
(11.18) 
n¥m and n’~m (11.19) 


[see Eq. (5.14) ff]. As before, we choose the real con- 
stants C,,“(g) so as to make the new basis functions 
Ema (1; $)= X Xan (5 $) 
| X(nh’|1+S5©(P; g)|mh), (11.20) 
real at r=0, to first order in P*. This requires 
Pnm*($) 


1 
—Im } ¢ng(0) —, 
mg(0) Wnm g) 


iCn*($) = (11.21) 


Continuing in this manner we construct, in analogy 
with Eq. (9.29), the functions 


Onn? (rr; 2)=>, Png Onn td, Garn™ (8) P™ 


a 


iin we pS Gain" °#(g) Pa... Pat] 


al at 


Xexp(th-r). (11.22) 
These series may be rearranged in a commutator series 
leading to an expansion analogous to Eq. (10.3) 


Pnn(¥; $)= Don One Un'n;o(P; g) 


+sUnni(P; $)+--+]exp(ih-r), (11.23) 


where the functions Uy’,;:(P; %) are completely sym- 
metrized in the P*. 
Now let us set 
h=k—g, (11.24) 
and use the following identity, valid for any function 


Z(P): 


Z(P) exp[i(k—g)-r] 


=expl —ig-r]Z(P—g) exp[ik-r]. (11.25) 
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This allows us to re-write (11.23) as 


Pnk s(t; $)=Din' One exp(—ig-r)[Unn;o(P—8; g) 

+sU wni(P; g)+--- Je". (11.26) 
Next we observe that the functions gp’, exp(—ig-r) 
=Uy’,(r) are completely periodic in r, so that we can 
write 


Gn’g Exp( —ig-r) =Don Uno( E)Cnn’ (8), 


(2xr)8 (11.27) 
Cnn (8) = f Un'0*Un’ gd X, 


Q cent 
where C,,’, is a unitary matrix. If we now define 
Oars; i(P; 2)=>. Carin (2) U arn: i (P— g; g), (11.28) 
Eq. (11.26) assumes the form 


Pnk—g(T; g) 
=P unl U nn:o(P; $)+-5U wrn;1(P; 8) +--+ Je? 
=> Xnn(k’| U nn:o(P; 8) 

+sU wai(P; $)+---!|k). 


These expressions have the same form as those occurring 
in Eq. (10.3). We shall now show that, for n=m, 
Pmk—g(F; 8) and Smx(r) are in fact identical. 

We begin by inverting the power series expansion 
(9.29) and rearranging the inverted series in a commuta- 
tor expansicn of the following form®*: 
weeds te 

K(k | Xeno P)ASN wenja(P)+-- + | kK). 
Substituting X,-x from (11.30) into (11.29) and re- 
grouping in a commutator expansion gives an equation 
of the form 


(11.29) 


(11.30) 


Pnk o(?; g) =>) Pn’ k’ 


x (k’ W n’n:o( P)+5W araa(P)+ oo | k). 
Now for s=0, and n=m, we have by construction 


(11.32) 


(11.31) 


Pmk—e(F; £)= Sm (Tr) = Ymx(F), 


where the phase of the Bloch wave is in accordance 


with (11.7) ff. Hence 
W ntm:o( P)=8n'm- 


(11.33) 
Furthermore the transformation matrix 
W an (P)=Wan'o(P)+5Wana(P)+--- (11.34) 
is formally unitary, i.e., 
Tar Worn t(P)W wrrar(P)= Sara, 


where ‘ denotes the Hermitian conjugate. 


(11.35) 


% This is of course again a formal expansion which has the 
following property: If ¢n’«’ is expressed as a formal commutator 
expansion, according to (10.3), substituted into (11.30) and the 
resulting terms regrouped in a commutator expansion, (11.30) is 
an identity to all orders in s. 
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Finally, since the original Hamiltonian H has no 
interband matrix elements connecting @mx(r) with 
Gnx(¥), m#~m, or connecting Smx—g(r; $) with Sax, 
X (r; £), n#~m, it follows that Snx—¢(r; 8) must be a 
linear combination of Znx:(r) with the same m. Thus 


W nm i(P)=5nmW.(P), (11.36) 


and we have 


Pmk- At: g)=>. Pmk’ 


x(k’ |1+5W1(P)+s2W2(P)+---|k). (11.37) 


Now let us turn to W,(P). From the unitary property 
(11.35), it follows that 


W,(P)+Wi*(P)=0. (11.38) 


Setting s=0 gives 


W(k)+W1*(k)=0, (11.39) 


so that W,(k) is purely imaginary. Next let us use the 
reality of gmk—(r; $) and Gnx(r) at r=0. This gives 


Im[> Zmx’ (0)(k’| W1(P)| k)]=0. (11.40) 
Now letting s — 0 gives 


Im[{W1(k) ]=0, (11.41) 


so that W,(k) is purely real. Hence 


W1(k)=0. (11.42) 


In the same way one shows next that W»2(k) vanishes, 
etc. This gives finally the required result 


Pukug( Ts g)= Smx( . (11.43) 


which is equivalent to (11.11). 
One important consequence of this result is the fol- 
lowing. Since by construction 


Pmn(T; K)=0 Snn(r) (11.44) 


[see Eq. (8.13) ], it follows from (11.43) that 


mk +k (1) =O Smu (4). (11.45) 


Thus, also in the presence of a magnetic field, the new 
basis functions have the quasi-periodic property (11.45). 

It is obvious how (11.43) can be used to obtain the 
analytic continuation of $nx(r) for arbitrary k. Setting 
g=k, (11.43) gives 


Omx(r) = Gmo(K; k). (11.46) 


Now by Eq. (11.29) we have 


Pmo(T; k)=> Xnn(k’ | Unmo(P; k) 
+5U wmi(P; k)+---|k), (11.47) 


while by (10.3) 


Smx(r)=>- Xn (k’ U a'm.o(P) 


+2U nim;1(P)+--+|k). (11.48) 
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On subtracting and using (11.46), we get 


0=> Xun’ (k’ | LU nm;o(P)— Unrm;o(P; k) | 
+5LU n'm;1(P)—U arm; 1(P; k) J+ eg i | k). 

As the Xyx are not orthonormal, we cannot immedi- 

ately conclude that the coefficients of X»x vanish. 


However, this fact may be established as follows. 
Set s=0 in (11.49), which then reads 


0= a. n’ Xnrel l nm: 0( kK) — Uarmo(k; k) ]. 


Hence 


(11.49) 


(11.50) 


U a'm;0(k) = Unrmo(k; k), (11.51) 


and 


U n'm:o(P)= Un'm;o(P; k). (11.52) 
Now, by using (11.52) in (11.49) and dividing the 
latter equation by s, we get 
0=> Xn’ (R’ | [Unrm:1(P)—U n'ma(P; k) | 
+5. U ntm;2(P)—U nrn:2(P; k)J+---{k). (11.53) 
Now set s=0 and obtain 


U n'm:1(P)= Uym;1(P; k), (11.54) 
etc. Thus we see that for all 7, 
U wm;i(k) = U wm; (ki; k). (11.55) 
Finally, by (11.28), we have 
U wm; i( 5 kK) = Carne (KU emi; ki). 


But at h=0,' each coefficient Uym,:(h;k) can be 
evaluated explicitly in terms of a finite number of the 
coefficients of the power series (11.22). Thus 


(11.56) 


U wm; 0(0; k) =dnm, (11.57) 


1g “102 


Onrm:1(9; k) oe ps Gar m™3( k), 


aiaz 25 


(11.58) 


etc. We see then that for example to obtain U nm;1(k) 
we require a knowledge of Gyrm™'%?(k) which may be 
obtained by two canonical transformations. This should 
be compared to Eq. (10.2) which required the summa- 
tion of an infinite power series in k*, i.e., the perform- 
ance of an infinite sequence of canonical transformations. 


12. Explicit Construction of the 
Effective Hamiltonian 
In Sec. 6 we saw that the effective Hamiltonian 77, ( P) 
could be expanded in a series of the form 
Fin.(P) = Hm;0(P)+sHm;1(P)+5*H m;2(P)+ +++. (12.1) 


We have already remarked that since the H,;(P) are 
completely symmetrized functions of the P*, they are 
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determined by the functions /7;;(k). In fact, if 


An; i(k) =X B(R)e®-*, 
R 


(12.2) 


then 


H,,.:(P) =X B(R)e®-?. (12.3) 
R 


However at the stage of Sec. 6, we had only expansions 
of Hn; ;(k) in powers of k*. We are now in a position to 
calculate these functions explicitly for arbitrary k. 

Let us begin by writing Eq. (10.3) for »=m in the 
form 


Zk (L)= >> Xnrur(n'k’| U(P)| mk), (12.4) 


where 
U,.(P)= Unrno(P)+sU n'a (P)+ Pine 
= (n’|exp[LS(P) Jexp[S®(P)]--+|m). (12.5) 

Similarly, (11.23) can for n=m be written as 
Pmu(¥; $)=d 0 Xen’ (8; $)(n'h’| U(P; %)| mh), 
where 
Unn(P; 8) 

= Un'n;0(P; 8)+5U nin; (P; 8)+-°: 

= (n’|expLS(P; g) ]expLS@(P; g)]---|m). (12.7) 


By (11.5) and (11.28), U(P) and U(P; g) are con- 
nected by the relation 


Unen(P)=D0 Carn (8) U nin(P—§; 8). 
We shall now prove the following identity. Let 
A, (P)= Uma (P) Han (P)U nm(P), 
where Uy’,(P) is defined in (12.5) and 


Haw (P)=San Len t+3P°P* + pan*P?; (12.10) 


and let 

An(P; 8)=Unn(P; 8)Han(P; $)Unm(P; $), (12.11) 
where Ux ’»(P; g) is defined by (12.7) and 

Han (P; $)=SnnLen(S)+3P°P“]+ pan@($)P*, (12.12) 


Then 


(12.6) 


(12.8) 


(12.9) 


H,(P)=H,,(P— €; 8). 
We first transform (12.9) with the help of (12.8): 


H,(P)= Unn 1(P—¢; B)Cnn”’ '(g) 
K Aarne (PyCgn (BU nm(P- g; g). 


(12.13) 


(12.14) 
Next we note the identity 
n= (tno, (p?/2M+V uno). (12.15) 

This allows us to write 
Cnn’ (BD) nrrnir€nrl ‘n’?’n’ (¥) 

= (UngUn''0) (Uno, (p?/2m+ V temo) (tn’*0,tn’g) 

= (¢ng exp(—ig-r), (p?/2m+ V) ons exp(—ig-r)) 

= (Eng +32 )bnn’ — Pnn'*(8)g". (12.16) 
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Similarly 


Can (8) Pnrrnr Cnn’ (B) _ Pnn’*(8)—g%5nn’. (12.17) 


Therefore (12.14) gives : 


A,,(P)= Van (P—§; g) 
X {8nnLen(8) +3 (P*—g%)(P*—g*) 
+ Pun'*(g)(P*—g*) }} Un'm(P— g; g) 
=f7,,(P—; 8), (12.18) 
as was to be shown. 

This relation allows one to calculate explicitly the 
functions 77,,.;(k) which determine the successive terms 
of the commutator expansion of H,,(P). We make a 
commutator expansion of both sides of (12.18), 


Fin:o(P)+sH m.1(P)+--: 


=Am,o(P—$; $)+sHm;1(P—$; $)+---. (12.19) 


We equate corresponding terms, setting P= k and g= k. 
This gives 


Am: i(k) =n; :(0; k). (12.20) 


Now suppose that H7,,(P; k) has the following power 
series expansion in P* 
F7,.(P ; k) = €m(k) + Em@ (k)P™+ E,™2(k) P™ P+ «+ - 
+ Et -8t(k) Pee ++ Parpe-+, (12.21) 
This may be rearranged in a commutator expansion 
f1,.(P; k)=Fp.0(P; k)+sAm1(P; k)+--+, (12.22) 
where 
FAm;o( P ; k) = €m(k) + Em (kK) (P*)n 
+ En ?(k)(P™P%) y+ ++, (12.23) 


aj\a2 


Aln..(P; k)=i  ——En™*2(k)+0(P2), (12.24) 


aiag § 


10270304 a143,0204 a1a4,0203 
0 o o o o 7 


aS 


ajlagaga4 s? 


Am.2(P; k)=- 


K En @228%4(k)+O(P*), (12.25) 
Hence by (12.20) 


Fm:0(k) — ém(k), 


gtia2 


An.:(k) =i 2 - —F_,@2(k), 


aiag § 


oA A3A4 1 Galas azad_t Galag,azas 


An;(k)=— & - 


ajagqaga4 s? 


Em™220904(K), (12.26) 

etc. These functions can all be explicitly calculated. 
Now by construction of H,,(P;k) the coefficients 

E,@'°**(k) have the periodicity of the reciprocal 
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lattice, so that we may write 
Ain; i(k) =E a; exp(iR®-P), 


where the R™ are the lattice vectors of the crystal. 
Hence the effective Hamiltonian can be written in the 
form 


(12.27) 


A, (P)=X1a(s) exp(GiR®-P), (12.28) 


where 


a ($)= ag +5ay+stag +--+, (12.29) 


Denoting by 7 the volume of the fundamental Brillouin 


zone, we have 


1 
aitian f én(k) exp(—ik-R)dk, (12.30) 
T 


zone 


(is%92/s) F122) 
Xexp(—ik-R)dk, 


a,0=- 


T“ zone 


(12.31) 
etc. 

Equation (12.28) has as a consequence the important 
periodicity property 


H,(P+K)=H,(P), 


where K is an arbitrary reciprocal lattice vector. 


(12.32) 


13. Application to Crystals with a 
Center of Inversion 


Using the methods of the preceding sections, we shall 
now work out the first few terms of the commutator 
expansion of the effective Hamiltonian A,,,(P) for 
crystals with a center of inversion. 

For such crystals it is easily shown (Appendix B) 
that with our convention of 


Im[ ¢ng(0) ]=0, (13.1) 


one has the result?4 


Im[ pn'n’*(8) ]= 0. 


This results in the vanishing of the terms of odd degree 
in s in the commutator expansion of H,,(P). 

Working in the representation of the Xn»(r; $), we 
begin with the Hamiltonian 


A(P; $)=Hot+ Hi4+- Hp, 


(13.2) 


(13.3) 
where 
(13.4) 


(13.5) 
(13.6) 


(Ho) nn’ = bnn’€n(B), 
(Hi) nn’ = Pnn’*($)P2, 
(He) nn’ = Sonn P*P*. 


To this H we apply a sequence of unitary transforma- 
tions expL.S‘(P; g) ] to eliminate off-diagonal elements 
to higher and higher order in P*. We take S“ to be of 


* For isolated values of g, for which ¢ng(0)=0, (13.1) does not 
fix the phase of gng. In such cases we define gng=limgg¢ng’. 
With this understanding, (13.2) holds also at such points. 
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order (P*)'. Then, up to order (P*)‘ we have Now, 


A(P; g) [3(S®) Xm Jr=o 
ee -exp(—S®)) exp(—S)H exp(S) exp(S®)- ee . 
=H) nile Pmni* Pim P«P8 exp(ih- 
oe Sena(0) ©" [P*P# exp(iht) 0 
+ {Hot [AS® ]+3[[HoS° 1S ]+[HoS® J} bmi? Pint 
+ {THiS J+ [Hs,S + [HSS] 4 ong(0) 5 —— —-[(P*P8)w exp(ih- 1) Jra0 
+45} S°}+ AH SISOS] 1 tims in 
+[Ho,S®]} +{(H2,S@]+3[[Ho,S],S ] Pmi* Pim? 
+[11, SS ]+3[[H2,SP],S] =}¢me(0) X hzhs, (13.16) 
+H Me,S]S}S®}+ 20H, SP]SO}S%) 2m tm in 
+ (1/24) (CLA 0S 1,5],S°],S] , —— oe it oi er 5) ieee 
+[[Ho,S]S}4(H, 5° ]4 [HS] and this is clearly real. The second term in (13.15) gives 
oe tee (13.7) [S@Xnh ]rmo= ¢mg(0) [Simm (P) exp(ih- r) |=0 


The first-order off-diagonal elements are eliminated 4 44 1 
by the choice + p ¢ig(0)— — (Pis* Pim? — Pim Pm?) +—P it" Pim? 
ixm Wim | Wim Wkm 
SmiD = — (S) int = ras (Pmi%/wmi)P*, ix~m 


” 


Sun =0, all other n’, n”, 9) X[P*P? exp(ih-1)]}-0. (13.17) 


But 
Here we write Pan’* and Way short for Pan’*(g) and ™ -— co gram 
Wnn’($). Because of (13.1) and (13.2) we see that the [PoP  exp(ih-r) Je0=heh?+ise®. (13.18) 
corresponding new basis functions, Therefore. if we call 


Pmg+h= exp(S“) exp(S®)+++Xma= (1-504 --+)Xmn r= o..(0 0 13.19 
= Ye exp(ih-r)—D, Lig(Pim®/wim)P* exp(ih-r) ig Pig( )/ Pme( ), ( 3. ) 
+--+, (13.9) then condition (13.15) requires that we choose 


have to first order in P* the required property a, 4 
Sua seeilaaete i ————{ ( a" in? — im” ee 
Im[ Pmg+ n(O) ]=0. (13.10) ; P) ye (0) —| ? P P P 
The second transformation matrix S®) must be 1 
chosen to eliminate the second-order off-diagonal ele- +—Pua"de ACP P®). (13.20) 
ments from H(P; g). By (13.7), this requires that an fe” 


(Hot [HS }+ LHS ]S9}4 [Ho,S®]) mi=0, 
iXxm, (13.11) 
or Sim? = Dig t? 1 P*PFPP, (13.21) 


{ Pom? (— Pmi*/emi) + (Pmi2/wmj) PjiP} P2PF where 
tamiSmi®?(P)=0, ixm. (13.12) 


Similarly we construct S“. The result is 


1 
Da? = — 
Here the band index 7 runs over all values except m. Wim 
This equation can now be solved for S,;°. Recalling 


the anti-Hermitian property of S®, we find 1 1 
P P y x|  ps*| (PP Pim— Pam) + — ps? Pon | 
Smi) = —(Sim®)*=DystP2P8, (13.13) win nm 


where 1 1 
Di? = (1/cmi) { (1/cmi) Pm Pmi? — (1/cmj)Pmj*piiP}, +| — (is* Pim? — Pim” Pmm®) +- i =" Pat [Pon 


Wim” WimWkm 


ixm. (13.14) 


All other off-diagonal matrix elements of S® can be FO tall 
taken as zero. Simm‘? is chosen so as to satisfy the con- sis ia 
dition (13.10) up to second order in P*. This requires 

evidently that ( 3 1 


- meen 2 _ ) im* 7 ."|, 1x~m (13.22) 
Im{[3 (S245 PXmn}eo=0. (13.15) 3 a J Pw Pat Ps 


WimYjm Wim” 
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while 
Sam (P) 
1 ten® | wo im? im” 
- = ral - = : = Dat) +Dact—| 


6 Dim Wmj jm Wim 


x (3P8,P7]Pe+4[P2, Py ]Pe+3[P2,Pe]P?), (13.23) 


Am 


The transformation S$“ is not required for computing 
the m-m block of /(P; g) up to fourth order in P* 
[see Eq. (13.7) ]. 

Substituting our results for SY, S®, and S® in 
Eq. (13.7) gives the following result: 


AL, (P; $)=Am:o(P; $)+5Hm;1(P; 8) 


+s m.2(P; 8)+-++, (13.24) 


where 
Ano (0; g) _ Emgy 


Fim.1(0; £)=0, 
(13.25) 
" o%g7 t+ o%7q85 4 gaighy 
Hy;2(0; 2=— X - E,%°7*(g), 
aBys s? 

and 

: | 
fan*faf— 5 Pri" Pim | 


). 
Wmi 


Pe 1 
E,,?"*(¢)=—{ — 


Wm i | Wm j 


(1 1 
x Pin Pan! — 


Wj 


Pix” Pum’ | 


Wm k 


1 1 
_ { Pmi™ Pim?5 1 — 2 pmi2® Pim> +5 Pini” Pim®} 


a 
Wmi 


1 1 1 

ios i Pi Pim? Pk” Pkm® 
¢ 9 9 
2 Wm i Wmk WmiWMmk 


| 1 im” 1 im” 
_> ral Pim Pmi° P Da) 


ixm 6 Wim Wmj Wjm 


Pim™ 
+d,“ [ona (13.26) 


Wim 


The coefficients Dj,,°° and Dj,*87 are defined in Eqs. 
(13.14) and (13.22). 

Finally, using (13.25) and (13.26) in (12.26), we 
have now explicit expressions for H,;:(k) for i=0, 1, 
and 2. 

The absence of a term linear in s will be noted. It is 
easy to see in general that for the case of a crystal with 
a center of inversion, our construction leads to an ex- 
pansion in even powers of s. 


14. Application to Crystals without a 
Center of Inversion 


In this section we construct the effective Hamiltonian 
for a crystal without a center of inversion. We again 
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choose the phases of the ¢ng in accordance with the 
reality condition (13.1). But now, in contrast to the 
case of an inversion center, we have 


Pnn’*(8) = Qnn’*($)+ilnn’®(8), 


where g and ¢ are real and in general neither vanishes. 
We make the transformation (13.7) and find, as 
before, that 


(14.1) 


Swi? os (S®) mt = ony (Pmi*/Wmi)P*, 1A~m. (14.2) 


However Smm? does not now vanish. Instead, since 


~F viel)" 


a 
im 
pe 
ixm Wim 


Fmn(0; g) ne ene( +] 


+ ¢ne(0)San'(P) ) exp(ih-1)| +--+, (14.3) 


r=0 


the reality requirement leads to 


Smm™ (P)= Dan*P*, (14.4) 
where 

bim™ 
Dam? = —1 Y rig —. 


ix~m Wim 


(14.5) 


This is enough to determine A,,(P; g) up to second 
order in P*, which is as far as we shall go. By (14.7) 
we find 


,,.(P; g) _ €m($)+ Pmm*P* 
+ {56+ Pni*Pim?/ ems 
—1(Dem "Pam? — Pom" Dn?) } P#PP 
fees, (14.6) 


Fn.o(k) - Hn; 0(0; k) =; ém(k), 


fe - ae - 
| k) =H»; 1(0; k) oe =o (Pmni* Pim? — Pmi® Pim) 


Ss Wmi 


a (Dan*Pun?— Daur ban*) | . ( 14.7) 


It will be noted that H,,.; does now not vanish. In 
general it cannot be removed by any unitary trans- 
formation. This fact is of importance for the location 
of the energy levels in the presence of a magnetic field. 
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APPENDIX A. DEMONSTRATION THAT NO 
SOLUTIONS ARE OMITTED 


In Sec. 4 it was stated that the solutions of Eq. 
(4.17) give rise, through Eq. (4.1), to all solutions of 
the original Schrédinger equation HY= Ey. As k was 
taken to be a continuous variable, the dimensionality 
of the Hilbert space spanned by the Xn» is nondenumer- 
able. To avoid the associated mathematical difficulties, 
we shall here prove the corresponding result for a 
vector space of finite dimensionality and assume that 
it can be generalized to a Hilbert space of the above 
type. 

Consider then a vector space of dimensionality D, 
spanned by the orthonormal basis vectors 


(A.1) 
(A.2) 


Ca, a=1,2,---D 


e,*6,=8.., a a’=1,->-D. 


Let H be a Hermitian Hamiltonian operator in this 
space, with matrix elements (a’|H|a) defined by 


D 
Hea= >. ea (a’|H\a), a=1,2,---D. (A.3) 


a’=1 
This operator has D orthonormal eigenvectors y‘” : 
HYOSEOY, 1=1,---D 
--D. 


J 


YOY) = 55,7 i, i’=1 
Now consider a redundant set of normalized vectors 


--ND (A.6) 


the first D of which are identical with the basis (A.1). 
Let the effect of operating on these eg with H be de- 
scribed by a matrix (a’|H|a), such that 


ND 
Hea= >, @a'(a’|H\a), a=1,-- 


a’=1 


‘ND. (A7) 
Because of the redundancy of the e,, the matrix 
(a’|H|a) is not completely defined by (A.7). But, in 
view of the completeness of the eg, some matrix 
(a’ | H1\a) certainly exists for which (A.7) holds. Finally 
assume that the matrix (a’|H|a) is Hermitian: 


(a’| H|a)*= (a| Ha’). (A.8) 


This corresponds exactly to the situation of Sec. 4. 
Now consider VD solutions of the eigenvalue problem 
ND 


> (@|Ala’)AgY=EMA, a=1,--: 


a’=] 


ND. (A.9) 


These may be taken as orthonormal in the sense 


ND 
* \* 
ze A, A =O; 


a=1 


1,l’=1,---ND. (A.10) 
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In view of (A.7), the corresponding vectors 


ND 
e=> 


a’=] 


Aw ey, 1=1,-+-ND  (A.11) 


satisfy the equation 


He®=Eg™, l=1,++-ND. (A.12) 
Clearly, because of (A.1), at most D of these can be 
linearly independent. 

We next show that also at least D of these y are 
linearly independent. For, from (A.10) follow the re- 


ciprocal relationships 


ND 
4 * | (To) a . 
é a ta’ 4) aa’, 


l=1 


a,a’=1,---ND. (A.13) 


Now multiplying (A.11) by 4, * and summing over / 
from 1 to ND gives, by (A.13), 


ND 
€a= > Aa*e™, a=1,-+-ND.  (A.14) 
l=1 


In particular therefore the D independent vectors 
€1, °°, @p can be expressed linearly in terms of the 
g‘”. Hence the latter must contain a linearly inde- 
pendent subset of D independent vectors, say 


fo a (lp) 
P ° 


? “9 (A.15) 


Since the g) are D independent solutions of the 
same eigenvalue problem (A.12) as the ¢, each y 
must be a linear combination of the g. (If there is 
no degeneracy there must in fact be a one-to-one cor- 
respondence.) It follows a fortiori that each y is a 
linear combination of all the gy“ (l=1,---ND) ob- 
tained from the solutions A,“ of Eq. (A.9). Thus by 
solving (A.9) no solutions of the original eigenvalue 
problem (A.4) are lost. 


APPENDIX B. REALITY OF MOMENTUM 
MATRIX ELEMENTS 
Consider a crystal with a center of symmetry at r=0. 
Let g be a general point in k space and choose the 
phases of the Bloch waves ¢,, such that 


Im[ ¢ng(0) ]=0. (B.1) 


We shall then show that the momentum matrix ele- 
ments are all real: 
Im[ pnn*(g) ]=0. 


Let us first assume that at g none of the set of Bloch 
energies €,(%) coincide and that none of the ¢n¢(0) 
vanish. Then if the function 


(B.2) 


Png (T) = Ung(r) exp(7g- 1) (B.3) 
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satisfies the Schrédinger equation and periodicity con- 
dition described by g, so does 


Gne( —1)*=Ung*(—r) exp(ig-r). (B.4) 


From the assumed nondegeneracy of the e,(%) we 
conclude that 


Ung*(—r)=Cu,,(r), (B.5) 


where C is a numerical constant. Setting r=0, we see 
that C=1, so that 


Ung(—T)=Ung* (Fr). (B.6) 


Now let us divide u,, into a real and imaginary part: 
Ung(¥)=Ung(F) +iwng(r). (B.7) 

Then, by (B.6) we have 

(B.8) 

(B.9) 


Ung(—T)=Ing(F), 


Wng(—F)= —Wng(F). 
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But as the operator . 
pa= (1/1) (0/dxq) 


is odd under inversion we have 


Pnn’*($) 
_— 


2 


Png P* en’ gt 


(2:)* 


) 
= Sauget —— f tae pune 
Q 


2r)3 


=S ang? +— J (inghtWnrg—tWngPn'g)dr, (B.11) 
2 


and in view of (B.10) this is clearly real. 

At special points g, where either of the conditions 
mentioned after Eq. (B.2) fails, the property (B.2) 
will still hold if one chooses gpg as the limit of gag’ 
where g¢’ — g. 
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Self-Diffusion and Nuclear Relaxation in He’ 
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Direct spin-echo measurements of diffusion coefficient (D) and spin relaxation time (7; and 72) have 
been performed on He’, with an accuracy ~2% in the range 0.5°K to 4.2°K and at pressures to 67 atmos in 
the liquid, the solid, and in dilute solutions of He* in He‘. Unactivated diffusion is observed to the lowest 
temperatures in the liquid, but not in the solid. By measurement of D at 19 atmos we find an activation 
energy of 13.7°K for the production of scatterers in He II. There is an extended discussion of experimental 


details. 


I. INTRODUCTION 


N order to resolve some questions raised by thermo- 
dynamic measurements,! we began some time ago 
an experiment to measure by the spin-echo technique 
the nuclear susceptibility of pure liquid He’ and so to 
exhibit the expected Fermi degeneracy. By the use of 
spin echoes’ it is possible to obtain, in addition to the 
nuclear spin susceptibility, accurate values of the spin 
relaxation times, transverse JT, and longitudinal 7,, and 
also of the diffusion coefficient®* of a He*® atom among 
other identical He® atoms or in Het‘. It is on this last 
that we concentrated after the cw resonance measure- 
ments of the spin susceptibility appeared.‘ 
In addition to studying the diffusion of a purely 
1 Weinstock, Abraham, and Osborne, Phys. Rev. 89, 787 (1953). 
2 E. L. Hahn, Phys. Rev. 80, 580 (1950). 
3H. Y. Carr and E. M. Purcell, Phys. Rev. 94, 630 (1954); 


hereafter referred to as CP. 
4 Fairbank, Ard, and Walters, Phys. Rev. 95, 567 (1954). 


quantum particle among its identical neighbors, we 
expected to find for dilute solutions of He* in He‘ that 
the He’ diffusion coefficient (under isothermal condi- 
tions with uniform concentrations) would increase 
rapidly with decreasing temperature below the A point,® 
and'we wanted to exhibit the absence of scattering of 
He® by He‘ at low temperatures. 

Since both the actual diffusion coefficient and the 
spin relaxation time (both in liquid and in solid) are 
measured accurately (~2%) and independently in this 
experiment, we have also some information of interest 
in the general mechanisms of nuclear spin relaxation.® 

Some examples of qualitative (and hence interesting) 
questions we hoped to answer are: 


® Garwin, Kan, and Reich, Proceedings of the National Science 
Foundation Conference on Low-Temperature Physics and Chem- 
istry, Baton Rouge, Louisiana, 1955 (unpublished). 

6 Bloembergen, Purcell, and Pound, Phys. Rev. 73, 679 (1948) ; 
hereafter referred to as BPP. 





SELF DIFFUSION AND NUCLEAR RELAXATION 


(1) Does diffusion persist to 0°K in pure He? liquid? 

(2) Does diffusion persist to 0°K in pure He? solid? 

(3) What is the detailed behavior of He’ in He‘ at 
temperatures below those required for freezing out 
excitations? Is there a Fermi degeneracy corresponding 
to a changed effective mass and the lower number 
density of the He*® atoms? 

(4) Is there a discontinuity in He’ diffusion at the 
solution \ point? 


Our early measurements on He’ were performed with 
1 cc STP of gas, but with the reduced price of He® we 
now use 200 cc STP at pressures from the vapor 
pressure to 67 atmos and at temperatures from 4.2°K 
to <0.5°K (obtained by a continuous He’ refrigerator’). 
It is of considerable interest to extend these measure- 
ments to temperatures low compared with the Fermi 
temperature to make comparison with the predictions 
of the many-body theory of Brueckner® and the 
phenomenological theory of Abrikosov e/ al. This will 
shortly be accomplished by the use of a continuous 
demagnetization refrigerator operating with a sink 
at 0.5°K.! 

II. METHOD 


The only direct method of distinguishing a particle 
from its identical neighbors is by means of the phase 
of its nuclear spin precession in an external magnetic 
field. Fortunately a beautiful and convenient method 


exists, which was discovered by Hahn? at the time of 
his invention of spin echoes. Not only does this method 
allow a measurement of the true self-diffusion of atoms 
or molecules but it is both more accurate and much more 
convenient than conventional techniques of measuring 
diffusion. 


The full derivation of this method of diffusion 
measurement is to be found in reference 2 and is 
extended by Carr and Purcell.* We shall just recall the 
spin echo method pictured in Fig. 1. The sample of 
spins is placed between the poles of a magnet producing 
a steady field Ho, which however has some uniform 
derivative over the sample in the direction perpendicular 
to Ho. A source of pulsed radio-frequency current is 
used to produce an rf field of magnitude H; at the 
gyromagnetic frequency wo=yHo, allowing one to 
produce arbitrary nutations of the spin magnetization, 
among which the “90°” and “180°” pulses are of 


particular interest. The application of a 90° pulse to the 


sample (i.e., one for which 


tptw 
f (yH,/2)dt=/2, 
t 


p 


7K. N. Zinov’eva, J. Exptl. Theoret. Phys. U.S.S.R. 34, 609 
(1958) [translation: Soviet Phys. JETP 34(7), 421 (1958); 
H. A. Reich and R. L. Garwin, Rev. Sci. Instr. 30, 7 (1959). 

8K. A. Brueckner and J. L. Gammel, Phys. Rev. 109, 1023 
(1958). 

9A. A. Abrikosov and I. M. Khalatnikov, J. Exptl. Theoret. 
Phys. U.S.S.R. 32, 1083 (1957) [translation: Soviet Phys. JETP 
5, 887 (1957). 

10 Heer, Barnes, and Daunt, Rev. Sci. Instr. 25, 1088 (1954). 
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Fic. 1. Simplified schematic of spin echo apparatus. “180” and 
“90” refer to the precession angle produced by the rf pulse. The 
spin echo sample is placed inside the coil shown. 


where H,/2 is the magnitude of the component of 
circularly polarized rf field rotating with the spin 
system, ¢, the time of application of the pulse, w its 
width, y the gyromagnetic ratio) is followed by an 
“induction tail,” a voltage at frequency wo induced in 
the coil by the equilibrium (temperature 7, field Ho) 
nuclear spin magnetization precessing about Ho. This 
induction tail is quenched in a time 7* short compared 
to the spin relaxation time because spins at different 
points in the magnet experience different fields Ho, and 
therefore get out of phase. The shape of the tail depends 
upon the shape of the sample cell and on the spatial 
variation of the field Ho. A 180° pulse is never followed 
by a tail, since the spin system immediately afterwards 
is just the same as before but inverted with respect to 
Hy. The sequence of pulses 90° and 180° applied to the 
spin system initially in thermal equilibrium produces a 
“spin-echo” at a time after the 180° pulse equal to the 
time between the 90° and 180° pulse. Neither the 90° 
pulse, the 180° pulse, nor the combination 180° then 90° 
produces the slightest trace of an echo since the echo 
arises from the precession back into phase of the nuclei 
at various points in the magnet. The precession phase 
of each of these spins was reversed by the 180° pulse 
and if each spin remains in the same position in the 
magnet, they will all come into phase exactly at the 
same time. Thus the echo shape is that of two induction 
tails back-to-back. Measurements on the echo do not 
suffer from amplifier saturation and recovery problems 
as do those which are done on the induction tail itself, 
and we therefore measure only echoes, providing always 
an additional echo at short times so that picture-to- 
picture comparisons are never made for measurement 
of the diffusion coefficient or of T:. In the absence of 
diffusion, as the time ¢ [ Fig. 1(b) ] is increased the echo 
amplitude falls as h=exp(—t/T>2), thus allowing an 
accurate determination of the transverse relaxation 
time 72, which may range from one microsecond to 
many seconds. This may be done even though the line 
width measured in a cw resonance experiment is a 
million times wider than that corresponding to T7,—that 
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is, for a sample which is in a very inhomogeneous field 
H, producing a short tail decay time T* (T*KT>2). On 
the other hand, if the atomic species has a large diffusion 
coefficient D (D=d/dt(y)) the spins in their precession 
after the 180° pulse will experience different fields than 
before, being in slightly different positions of the 
magnet, and the echo will be smaller than it would 
have been from relaxation alone. This effect may be 
very large (a few milliseconds versus many seconds for 
T:). It is also the basis for measuring D directly over a 
remarkably wide range. In fact, in the presence of 
diffusion, 


h,=exp(—t/T2) exp(—7y’G@’DF/12), (1) 


where / is the time between 90° pulse and echo, D is the 
diffusion coefficient in cm*/sec, G=0H»/dy is the uni- 
form variation of H» with position over the sample, 
y is the gyromagnetic ratio (yHo=wo), and T» is the 
transverse relaxation time of the spins. 

In our case G~3 gauss/cm; 2K 10-4 sec<t< 200 sec 
(the lower time being set by the width of the echoes, 
the larger by the liquid 7), and y for He’~2X10* 
radians/sec gauss. Thus, a large effect (damping by a 
factor 3 due to diffusion) sufficient to measure D to 
<1% may be obtained with the same apparatus for D 
between 10 cm*/sec and 500 cm*/sec, truly a remark- 
able experimental range. Over this entire range the 
fractional accuracy of the diffusion coefficient is con- 
stant, being determined by the accuracy of time delay 
measurement, the knowledge of the gradient, and the 
scatter of the data due to noise. 

That the diffusion coefficient measured is that of an 
atom and not simply the “spin-diffusion” due to 
interaction of neighboring nuclear moments may be 
seen by observing that the field of one spin at the other 
is ~1 gauss, producing a precession rate ~104 
radians/sec, corresponding with oplimum correlation 
time to a pure “spin-diffusion” no more rapid than a 
jump rate 10‘ sec to distances ~3X10~* cm. Thus 
D,=)0/3=22/31< 3X 10- cm?/sec. 

Thus, any D measured to be larger than 3X 10-" 
cm?/sec, as are all the results of this paper, is pure 
atomic diffusion. 

Our spin relaxation times 7; are frequently long 
(several minutes) and amplifier gain drift might be a 
problem in measuring successive echoes, while the 
comparison of an echo amplitude with that of a 90° tail 
involves amplifier recovery problems. We therefore use 
the pulse pattern of Fig. 2 (“90-180-180”) which 
produces automatically two echoes—the first for 
measurement of initial amplitude /;, which is a measure 


90 180 echo 


Tel > 7: ny 
cet b Fic. 2. Pulse sequence 


used in determination 
of transverse relaxation 
time 7, and diffusion 
coefficient D. Echoes are 
shown. 
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90 180 echo 90 180 echo P 
[amplitude h, amplitude hp Fic. 3. Pulse se- 
quence and echoes 


r VARIABLE 1 | used in determina- 
e 


tion of longitudinal 
, relaxation time 7). 
; +] 


of equilibrium magnetization Mo, the second for 
measurement of the effects of relaxation and diffusion. 
The sample spin-state at the time of the first echo is 
exactly the same as it was immediately after the 90° 
pulse except for a small (and constant) relaxation the 
effects of which are rigorously negligible. Thus a 
second 180° pulse produces an echo hy as described 
above and allows us to measure 7» and D. In this case 
Eq. (1) describes the time dependence of the ratio 
h2/h,, where t is now the time between the echoes. For 
liquid He*, 7, damping [Eq. (1)] is substantially 
negligible during the time ¢ in which the echo is observed 
to decay, and a single measurement of the ratio he/h, 
yields D to ~1%. Actually, if T, is exceedingly short, 
as in the case of the a-phase of solid He’, it may be 
difficult to determine the additional damping due to D 
and vice versa. The damping due to D may be reduced 
either by reducing G or by applying a method due to 
Carr and Purcell* which was derived by us inde- 
pendently" for the purpose of testing for damping due 
to convection. 

The longitudinal spin relaxation time 7, (spin-lattice 
relaxation time) is measured with our two-echo philoso- 
phy by applying the pulse train shown in Fig. 3 (“90- 
180-90-180”’), where the second pair of pulses is moved 
as a unit with respect to the first and now 


hte/hy=1—e- 4", (2) 


Thus, two photographs at different delays allow one to 
compute 7; to high accuracy. The “null method” of 
7, measurement of CP could also have been used, but 
offered no advantage in the present case. 

In practice, because of the rather large field gradient 
required for our diffusion measurements, it is necessary 
to use narrow rf pulses to insure equal Fourier com- 
ponents of H; for each spin frequency w. These narrow 
pulses are obtained by discharging a condenser into the 
resonant circuit, condensers charged to different volt- 
ages being used for the 90° and the 180° pulse. Since the 
sample diameter is 6 mm and the gradient ~3 gauss/cm, 
the spin spectrum is 6 ke wide, while with our H, rf 
pulse of exponential envelope with a decay time ~2 
microseconds the rf field spectrum extends to 80 kc and 
is flat to <1% to 10 kc. The H, peak amplitude for the 
180° pulse is ~ 160 gauss. 

Since the echo width is ~ 300 microseconds, while the 
time between the two echoes of a single pulse train is as 
long as 200 seconds, one would have great difficulty 
seeing the echoes at all at a scope beam intensity which 


4 Anderson, Garwin, Hahn, Horton, Tucker, and Walker, 
J. Appl. Phys. 26, 1324 (1955). 





SELF DIFFUSION AND 
did not completely fog the film. We have, therefore, 
triggered the sweep once for each echo at time t2~500 
microseconds before the echo is expected, thus display- 
ing the details of each echo, which are photographed 
superimposed. The scope displayed the actual un- 
rectified rf to insure linearity of the system. 


Ill. APPARATUS 
A. Cryogenics 


The Dewars used to reach 1.2°K were of conventional 
all-metal design, with extended tail assembly 2-in. o.d. 
and 1}-in. i.d. reaching into the center of the resonance 
magnet. Attached to the nitrogen bath was a copper 
tail for radiation shielding of the He* bath which was 
contained in the inner tailed Dewar. A guard vacuum 
was used between He* and N2 Dewars separate from 
that which isolated the N. Dewar from the outer 
jacket to allow quick cooling of the inner Dewar if the 
occasion demanded. The N2 Dewar was refilled auto- 
matically by an N2 level control and had an evaporation 
rate of 0.5 liter per hour. The 1.5-liter Het Dewar with 
the experimental apparatus inserted had an evaporation 
rate at low temperature of 0.1 liter per hour. Tempera- 
tures to ~1.2°K were reached by means of a 46 cfm 
Kinney pump located in the next laboratory and 
connected by 6-in. diam tubing. Temperatures between 
0.5°K and 1.2°K were readily obtained by the use of a 
small Welch pump modified to act as a continuous He? 
refrigerator,’ using 80 cc STP of He’. 

Adequate temperature control was achieved by 
means of a mercury manostat in the high-temperature 
range (above 1 cm Hg pressure) and by manipulation 
of throttle and bypass valves in the low range. 
Secondary carbon resistance thermometers (Allen- 


Fic. 4. Schematic of low 
temperature apparatus. The 
helium Dewars are of con- 
ventional design and are 
not shown. A—needle valve 
used to control continuous 
flow of He* to B—evapora- 
tor, where cooling to 0.5°K 
takes place. C—carbon re- 
sistance thermometers and 
heater. D-isothermal space 
filled at room temperature 
with 1 atmos of He.A E— 
filling capillary for space D. 
F—sample cell. Not shown 
are the leads to the sample 
cell and resistance ther- 
mometers. 
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Fic. 5. He® sample cell. The rf coil is wound on a split Araldite 
form (parting line shown), the filling tube is inserted and the 
assembly is cast into the shape shown. 


Bradley j'5 watt and Speer Carbon Company } watt) 
were calibrated against the vapor pressure of liquid He* 
using the Ts5z scale. A calibration was performed upon 
each cooling of the apparatus. The drifts were significant 
for the Speer but negligible for the Allen-Bradley 
resistors. Resistance values were measured on a 100 cps 
Wheatstone bridge constructed for the purpose and 
yield temperatures accurate to 5 mdeg. The sensitivity 
was, of course, much higher. Temperatures below 1.1°K 
were determined by extrapolation, using the 3 constant 
formula of Clement and Quinnell.” Different ther- 
mometers indicate the same temperature upon extra- 
polation, when in thermal equilibrium at 0.5°K. The 
estimated uncertainty due to systematic errors is 
<0.010°K at 0.5°K." 

The experimental apparatus inserted within the He 
Dewar is shown in Fig. 4. One can see that all leads are 
brought in through either Het or He® gas, the needle 
valve (A) and refrigerator section (B), the sensing and 
heating resistors (C) in the refrigerator space for use 
with an automatic temperature regulator not yet 
needed. Below the refrigerator section is a region (D) 
filled with He‘ through a capillary (£) to insure good 
thermal contact between the sample cell (F) and the 
refrigerator. Starting from 77°K one could reach 1.2°K 
within 20 minutes and 0.5°K in five minutes more. 


4 


B. Sample Cell 


The sample cell, Fig. 5, is made of cast araldite, Ciba 
No. 502. It is fabricated in two pieces (parting line 


12 J. R. Clement and E. H. Quinnell, Rev. Sci. Instr. 23, 213 
(1952). 
18N. Kurti (private communication). 
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indicated), the coil is wound on, filling tube inserted, 
and is then cast into the form shown. The two disks 
(2-mil copper foil) on the filling tube form feather-edge 
seals, and are necessary to prevent leaks which other- 
wise appear upon recycling several times (slowly, to 
prevent cracking) to low temperatures and high 
pressures. The sample cell is tight to superfluid helium 
in either direction at high pressure. It will withstand an 
internal pressure of 2000 psi at room temperature, and 
probably a higher pressure when cold, although this 
has not been tested. The copper filling tube is used 
to insure isothermal conditions in the bulk of the liquid] 
in view of the low thermal conductivity.“ A carbon 
resistance thermometer is cast into the block just below 
the coil for purposes of monitoring the temperature as 
close as possible to the liquid. Presumably because of 
strain effects, the calibration curve of this resistor is 
quite different from that of all others. Upon applying 
rf pulses a transient heat pulse can be observed, which 
dies away in less than one second, thus giving an order 
of magnitude for the time required to reach therma, 
equilibrium. The thermal relaxation time of the He’ 
sample, calculated from the thermal diffusivity K/pc and 
the known geometry is ~5 seconds, for 0.2°K<T<4°K. 
Following the lead of Low and Rorschach, we have 
more recently employed cells machined from nylon bar. 
Gas seals are made by machines brass cones, which 
employ the differential thermal contraction to aid the 
seal. These cells survive repeated chilling in liquid 
nitrogen, withstand 5000 psi, and are easier to make. 


C. Pressure System 


Pressures are generated by a Mansfield and Green'® 
dead-weight pressure unit, which is guaranteed to have 
+% accuracy. This unit is used to compress oil, which in 
turn pushes ‘mercury into a steel Toepler-like gas 
compressor. To prevent entrance of mercury into the 
low-temperature region the level is monitored by means 
of a thick-walled glass capillary tube which has been 
tested to 5000 psi, and by an electrical alarm relay. A 
monel capillary, 0.020 0.d. 0.008 i.d., carries the com- 
pressed gas to the low-temperature sample cell. Dead 
volume is minimized by filling the capillary with 
0.008-in. wire. Care is taken to allow no ferromagnetic 
or superconducting materials near the sample. 


D. Gas Handling System 


The He’ gas, of which 200 cc STP was required to fill 
the apparatus with solid at 67 atmos, is stored in a glass 
gas-handling system equipped with a conventional 
glass Toepler pump. Dilute mixtures with He‘ are 
obtained by mixing known pressures of the gases at the 
same volume, and the gas is admitted to the experi- 
mental system through a liquid-helium cooled trap. 


4 Lee, Donnelly, and Fairbank, Bull. Am. Phys. Soc. Ser. IT, 2, 
64 (1957). 
16 Mansfield and Green Company, Cleveland, Ohio. 
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E. Electronics 


Magnet.—The magnet is a small homemade electro- 
magnet with a 3-in. diam yoke and 6-in. diam pole 
pieces. The gradient was obtained by a 33-in. aluminum 
shim at one side of the 2-in. gap. The magnet was 
operated at ~1600 gauss (~6 amperes at 20 volts) 
and was stabilized to ~1 ppm by a transistorized 
current regulator.'* A transistorized quartz-crystal- con- 
trolled nuclear resonance marginal oscillator!’ was used 
with a glycerin sample in a fixed monitor position to 
adjust the field to the desired value. It was customary 
to tune the entire apparatus when cold, admit the He’, 
and to photograph a perfect echo of maximum ampli- 
tude without having previously seen the He* nuclear 
resonance. This ability was necessary since the relaxa- 
tion time of the liquid was several minutes, and one 
had no chance to adjust parameters while searching for 
the He’ echo. 

Spin-echo electronics.—The spin-echo apparatus was 
required to generate appropriate complex trains of 90° 
and 180° pulses with delay times varying from 100 
microseconds to 10 seconds, to give scope trigger pulses 
accurately 500 microseconds before each echo was 
expected, to displace the two echoes (vertically) on the 
display, and to amplify linearly and with good noise 
figure the spin-echo signal itself. 

The rf amplifier which serves to amplify the spin-echo 
signal for presentation on the scope is of conventional 
design, except for a few details shown in Fig. 6 (receiver 
and transmitter). The input stage is of the cascode type 
for good noise figure, and the tuned circuit in the cryo- 
stat has as high a Q as was obtainable with the long 
leads to the sample cell, which must carry the rf with 
little loss but not contribute too much heat leak. We 
have used in the past gold- or copper-plated Constantan 
wire and now use Litz wire, yielding a 0~30. Most of 
the resonant capacitance is at helium temperatures to 
reduce losses. The resonant impedance is chosen to be 
~10000 ohms for best signal-noise ratio. A larger 
impedance would diminish s/n due to “induced grid 
noise,”’ a smaller because of shot noise. The impedance 
is purposely matched for best s/n and not for maximum 
power transfer. The single long RC time constant at 
the receiver input insures that the grid-cathode diode 
of the input stage does not load the ringing of the 
sample coil after the first half-cycle. Two stages of the 
amplifier employ remote cutoff tubes, and the gain can 
be varied from 1.5X10* to 1.510%. Typical 4; echoes 
at 1°K induce a signal of ~0.1 mv in the coil. The 
equivalent noise temperature is 50°K. The rf amplifier is 
used with a resonant output into a Tektronix 535 
oscilloscope set usually at 1 volt/cm. A convenient test 
of receiver gain and stability is provided by coupling 
into the sample coil through a 0.1-mmf condenser a 


16 R. L. Garwin, Rev. Sci. Instr. 29, 223 (1958). 
17 Garwin, Patlach, and Reich, Rev. Sci. Instr. 30, 79 (1959). 
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Fic. 6. Schematic of spin echo transmitter and receiver. The energy storage capacitors shown in the transmitter have been chosen to 
make the plate voltage approximately equal for both 90 and 180. Filament, screen, and plate decoupling networks to avoid oscillation 
are of conventional design and are not shown in the receiver. The scope matching network is a simple 7, chosen to match the low line 


impedance to the high scope impedance. It has a gain of 10. 


few-volt step function from a mercury-switch pulse 
generator.!8 

The transmitter consists of a pair of miniature hydro- 
gen thyratrons VC 1258."° each of which discharges its 
charging condenser into the tuned circuit. The pulse 
amplitudes for 90° and 180° are adjusted independently 
by varying the high-voltage to the thyratrons. Since 
the receiver input noise voltage is about a microvolt, 
one must be very careful about noise from the thyra- 
trons while they are deionizing and from the other 
circuit elements which have just been subjected to a 
two-kilovolt pulse. The thyratrons are noiseless after 
a few microseconds, but one must use for the dc- 
blocking condenser a 7.5-kv ceramic condenser to avoid 
large noise pulses from the condenser itself. The actual 
pulse coupling condensers have dc voltage for only a 
few microseconds and contribute no noise pulses after 
50 microseconds. Note that we vary here the magnitude 
of H,; and not its duration, contrary to the usual spin- 
echo practice. The exponentially decaying rf pulse is 
just as effective as the more usual square pulse, having 
a Fourier spectrum [1+7?(w—wo)?}" rather than 
[r(w—wo) |"! sinr(w—wo) where 7 is the exponential 
damping time or the pulse half-width. It should be 
noted that the transmitter is incoherent, which causes 
no trouble with the proper echoes to be measured. 

Pulse timing and sequencing.—Four delays suffice to 
make the measurements of diffusion and relaxation 


18 R, L. Garwin, Rev. Sci. Instr. 21, 903 (1950). 
19 Chatham Electronics, Newark, New Jersey. 


considered here. For the 7, and D measurements they 
are shown in Fig. 7(a), together with a block diagram 
of the pulse generators [ Fig. 7(b) ]. The corresponding 
diagram for 7, is shown in Fig. 8. The various times are: 

fo—a fixed delay, usually about 0.4 msec to obtain a 
first echo not much damped by diffusion or relaxation. 

ty—a delay of 0.5 to 1.0 msec, which serves to trigger 
the scope twice at a time f, before each echo. This 
enables the echoes to be displayed in full detail, as in 
Fig. 12(a). 
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Fic. 7. (a) Timing scheme, and (b) block diagram to produce 
90-180-180 sequence. to=0.4 msec, #2=0.5 msec, ¢; variable from 
0.5 msec to 10 sec. fz is chosen to center the echo on the oscilloscope 
sweep. 
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{,;—-variable in 7, or D measurements from 0.5 msec 
to 10 sec to allow transverse relaxation or diffusion. 
In 7; measurements /; is about 2.0 msec to obtain an 
echo not much damped by diffusion or relaxation but 
at a delay different from /o. This avoids spurious super- 
positions in case pulses are not exactly 90° or 180°.!! 

t;—variable from 1.0 msec to 10 sec. Used in 7, 
measurement of pure solid He’ to allow partial recovery 
of equilibrium magnetization. 

The rather peculiar repeated time intervals were 
chosen so that we might use Tektronix series 160 time 
interval generators rather than digital counters which 
were not at hand at the time. Since the second echo 
in the 7, or D measurement is well off the scope if the 
second /;+/2 precession interval is 0.1% different from 
the first, the only convenient way of obtaining this 
equality is to trigger the same pulse generators twice. 
The system used was very convenient since for JT; or D 
measurements only ¢; needed to be varied, leaving (0, 
‘2, and the scope sweep speed unchanged. Similarly 
for 7; measurement only /; was varied. The distribution 
of triggers to the various delay generators, rf pulse 
generators, scope trigger, scope beam shift, etc. was 
accomplished by a simple thyratron ring driving a 
diode matrix pulse distributor. A transistor ring, glow 
counter or core counter would have done as well, but 
would have required amplification in order to trigger 
the rf pulse thyratrons and delay generators. Four 
conventional blocking oscillators (not shown) were 
available for producing sharp trigger pulses from the 
slow rise at the end of long delays from the Model 162 
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Fic. 8. (a) Timing 
scheme, and (b) block 
diagram to produce 90- 
180-90-180 sequence. fo 
=0.4 msec, #2=0.5 msec, 
t;=0.6 msec. #; may be 
as large as 10 seconds. 


90 


pulse generators. The only time which enters sensitively 
into the measurements is that between the two echoes; 
and rather than calibrate the delay generators each 
time, we have made use of a digital counter” which 
measures the time interval between the two oscilloscope 
sweeps to an accuracy better than 0.01%. The interval 
to be measured is simply obtained by connecting to the 
“‘gate-output” terminal on the scope and is thus exactly 
the time / in Eq. (1) or (2). 


IV. TECHNIQUE 
A. Cryogenic 


Before making a run, the He‘—N» and No-case 
vacuum spaces were evacuated with a diffusion pump 
and liquid N2 admitted to the Nz reservoir which was 
thereafter maintained full by an automatic Ne level 
control. Nz was admitted to the He‘—N» and He 
to the He*—He‘* spaces to a pressure ~ 100 wu, which 
brought the inner Dewar to N2 temperature in a few 
hours, after which the He‘— N2 space was pumped with 
the diffusion pump. In ~30 minutes the He* transfer 
was begun using ~} psi pressure on the He* storage 
Dewar. About 4 liters of He‘ is expended in collecting 
1.5 liters of He* in the experimental Dewar, sufficient 
for more than ten hours work. Refills are made with 
~1.7 liters each. The apparatus warms to about 20°K 
overnight, and thus succeeding runs are very economical. 
A few minutes after filling the sample cell is at 4.2°K, 

” Type 524-B, Hewlett-Packard Company, Palo Alto, Cali- 
fornia. 





SELF DIFFUSION AND 
at which time the vapor pressure—carbon resistor 
calibrating run is made. If temperatures <1.1°K are to 
be used, the He*— Het‘ guard space is pumped with the 
diffusion pump for ~ 20 mean pressure time constants 
as measured by an ionization gauge attached to the 
diffusion pump. The guard vacuum is isolated from 
the pumps and the system brought by the He’ refri- 
erator to the desired temperature. At this time the 
sample He* is transferred into the pressure system 
through a He‘-cooled capillary. Normally the sample 
He’ is kept in the high-pressure system. The sample He*® 
is now ready to be condensed and to be compressed 
by the dead-weight gauge tester. 


B. Electronic 


Before transferring helium the magnet current regu- 
lator'® is switched on and power applied. Within a few 
seconds the field is stable as seen by the crystal-con- 
trolled proton resonance monitor!’ (which has a press- 
to-measure switch and normally draws no current from 
its battery but displays the proton resonance instantly 
upon depressing the switch). The electronics is turned 
on, the pulse train is monitored to determine that no 
tubes have burned out, the receiver input (the sample 
coil) is tuned by the injection of a small crystal-con- 
trolled 5.224 Mc/sec signal which is maximized on the 
scope by adjustment of the receiver input tuning con- 
denser. If modifications have been made to the resonant 
circuit, it is necessary to adjust the 90° and 180° pulse 
amplitude, which can be done a priori by comparison 
with the calibrator response; but it is performed most 
conveniently by cooling the apparatus and raising the 
He’ pressure to produce solid He’, on which the effects 
of pulse amplitude may be observed visually since the 
solid spin relaxation time is a few tenths of a second 
rather than the 100 seconds liquid relaxation time. A 
proper setting of 90° and 180° amplitude may be made 
in the usual way—absence of induction tail on 180° 
pulse, the absence of induction tail following the second 
90° pulse of a pair or, best of all the absence of extra 
echoes due to 3 arbitrary rf pulses.”*! Amplifier re- 
covery time prevents observation of these tails, so a 
further “180°” pulse is used to convert the tail into an 
echo at a time not subject to interference. Figure 9(a) 
shows the nuclear spin-echo response for proper tuning 
and pulse amplitudes, and 9(b) for wrong pulse ampli- 
tudes. Incorrect pulse amplitudes cause no error in the 
measurements, since one measures at all times only the 
incremental attenuation due to additional delay. 

Some of the spurious echoes are of the so-called 
“stimulated” type?! which decay not in time 72 
but in time 7, and so can persist several minutes to 


*! For a discussion of the echoes due to 3 rf pulses of arbitrary 
amplitudes, see for example A. Lésche, Kerninduktion (VEB 
Deutscher Verlag der Wissenschaften, Berlin, 1957), Chaps. 2 
and 3. 
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Fic. 9, (a) Photograph of spin echoes resulting from 90—180—180 
sequence. The second echo is not much damped by diffusion. 
(b) Photograph of spin echoes from same pulse sequence as in (a) 
but with 180 pulse amplitude reduced by half. Notice the diminu- 
tion of echo amplitude, as well as the appearance of three extra 
echoes. Sweep speed in both pictures is 1 msec/cm. 


cause interference with the echo amplitude on the 
measurement of the next point. We avoid such diffi- 
culties by choosing ¢o different from /;. These unwanted 
echoes can also be eliminated by a small pulse of current 
in a wire near the sample coil.!! Since our successive rf 
pulses are incoherent, an echo which is the sum of signals 
from two or more echoes (“mirror” plus “stimulated” 
for instance) will have an amplitude which fluctuates 
on successive measurements, and such falsified echoes 
may be identified in this manner. The analysis of super- 
posed echoes is complicated, and we avoid such 
circumstances. 

In measuring 7, there-appears to be no substitute for 
having ¢ comparable to 7; if the equilibrium signal 
amplitude is also unknown, but in measuring D or 7%, 
one can avoid waiting between pictures for times long 
compared to 7; (~5-10 minutes) by saturating the spin 
sample with several (~10) 90° pulses after a measure- 
ment, then waiting a standard time (~1 minute) 
before taking the next point on the h2/f; vs ¢ curve. 
This avoids the superposition of false (stimulated) 
echoes from the previous measurement. In the measure- 
ment of 7), which can be as large as 900 seconds in 
dilute solutions, the following procedure was used to 
gain a maximum amount of information per picture, 
using the 90-180-180 scheme. After saturating the spins, 
various times, such as 60, 120, 240, --- seconds were 
allowed to elapse. The pulse sequence 90-180-180 was 
then applied, with differing times between the 180 
pulses for each picture. The echoes 4; hence give us 
information regarding 7, while the ratio /2/h, enables 
D to be determined from a single picture. 
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C. Recording 


The two traces containing the two echoes of amplitude 
h, and hg are recorded (superimposed or displaced 
vertically as desired) on Polaroid film, 3 measurements 
to a film. The pictures are processed, inspected for the 
relevant quantities—order of magnitude of 7; and of 
the time ¢ at which the echo begins to die, in order to 
obtain further points in the more significant regions 
of the decay curve. The points are spaced uniformly 
in #, out to values of ¢ such that Ae is approximately 
equal to the noise. This is done instead of taking 
several points clustered about the value of ¢ for which 
h2/hy~0.5 because of the necessity for determining an 
additional parameter of the decay curve (the zero inter- 
cept). This is usually less than 1, because of improper 
pulse size. Pulses less in amplitude than “180°” are 
sometimes intentionally used to avoid gas discharges 
which may appear at low pressures. A broad spacing 
of points in & also permits the calculation of TJ, and 
gives evidence of the absence of convective spin 
damping. 

The photographs are then measured, using a Wilder 
Microprojector equipped with a micrometer stage. 
The echo amplitudes, less baseline and noise, and the 
resulting double amplitudes (in mils) are entered in 
the data book. 


D. Data Reduction 


An IBM-650 computer was used with a general 
purpose iterative least-squares program’ to perform 
all the computation associated with the experiment. 
This is done not so much because of the additional 
accuracy attained by the least-squares method with 
more data points than parameters, but mainly because 
we have in this way a routine method for finding 
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2 R. L. Garwin and H. A. Reich, Mathematics Tables and Other 
Aids to Computation (to be published). 
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experimental or computational blunders. To this end, 
we fit for Eq. (1) a function 


yo™=hy"A exp(—/m/T2—7°G?Dtn®/12), (3) 


with m referring to the mth data point at that tempera- 
ture and pressure. A should be unity for rf pulses 
exactly 90° and 180° (Fig. 10) and usually is computed 
to be ~ 0.90. T2, A, and D are, however, left free to be 
adjusted by the 650 for minimum > m(/2"— 2"). More 
than 90% of the runs have > m(h2"—y2™)?< 40 (m—2) 
mil’, the rest having }> 6 very much larger due to 
blunders and errors. Since the echo amplitude is usually 
~ 800 mils it is evident that a measurement accuracy, 
linearity, noise-to-signal, etc. much better than 1% is 
routinely achieved. A similar computation is carried 
out for Eq. (2), fitting now 


yo" = hy 1 —B exp(—t/T)) ], (4) 


with B~1, and B and 7, free. Of course, the damping 
due to D may be negligible compared to that due to T2, 
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Fic. 11. The line is the theoretical echo shape shown in the 
equation. The circles are experimental points derived from a 
typical picture, adjusted to fit the curve by choice of the field 
gradient G, which is thereby determined. 


but the least-squares solution gives also the errors in 
the parameters. 

The same program with a different subroutine is used 
to determine the best calibrating function for the 
resistance thermometry. 


E. Echo Shape 


The shape of the spin echo signal is that of two 
“tails”, back-to-back. The detailed behavior in time of 
the decay of a “tail” is given by CP Eq. (4). For our 
case, where we have a cylindrical portion excluded 
from a spherical sample, we can evaluate the contribu- 
tions from the cylindrical and spherical parts separately, 
and then subtract, taking into account the fractional 
volume of the cylinder. The contribution due to the 
sphere is M,(t) = M¢'[sinx/x*—cosx/x? ], where x=yGrt 
and r is the radius of the sphere. Mo is the total spin 
magnetization. The contribution from the cylinder is 
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M.(t)=MoL2J1(y)/y], where y=yGr't and r’ is the 
radius of the cylinder. Combining these two equations, 
and taking into account the fractional volume of the 
cylinder in the approximation r’<r, we obtain 


M(t) 1 (= =) | 


My, 1-—$a? es ax 


where a=r’/r. The factor in front of the square brackets 
normalizes M/Mp to unity at «=0. A plot of Eq. (5) is 
given by the solid line of Fig. 11. The circles are experi- 
mental points. In plotting the experimental points, 
correction has been made for the spot width of the 
oscilloscope trace. The only adjustable parameter here 


Fic. 12. (a) Photograph of the two echoes superimposed, in the 
90-180-180 scheme, showing decrease in amplitude of second 
echo. (b) Shape of echo produced by sample cell filled partially 
by liquid He’, partially by solid. The liquid is saturated in this 
case by repeating the 90-180-180 sequence several times a second. 
If we then wait until the liquid has recovered its full moment 
(5 min) the first echo thereafter has the shape shown in (a), 
although the sample is partly liquid and partly solid. Sweep speed 
in both pictures 100 usec/cm. 


is the gradient, G, which is determined by the require- 
ment that yGrt have the value 3.73 at the observed 
time of the minimum. G thus has the value for this run 
of 4.12 gauss/cm, which agrees well with that obtained 
by direct measurement of the field using a proton reso- 
ance probe. The agreement between theoretical and 
experimental line shapes is considered satisfactory, 
especially in view of the following: I. The gradient has 
been assumed to be uniform over the volume of the 
sample. The gradient actually varies from point to 
point in the magnet, ranging from 3.0 to 4.2 gauss/cm, 
depending on the particular position the sample cell 
occupies. The gradient is redetermined upon each 
reassembly or other mechanical change in the appa- 
ratus. II. The ratio a=r2/r; is 0.340 for the particular 
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Fic. 13. Diffusion coefficient D vs temperature for various 
pressures in pure liquid He’. The curves are obtained by plotting 
logD vs T at constant density, using the extensive tables of 
R. H. Sherman and F. J. Edeskuty (private communication). The 
points are systematically higher than the curves at the lowest 
temperatures, but the data are fitted to within experimental 
accuracy by the equation shown between ~1.5°K and 4°K. No 
theoretical significance is to be attached to the form of the 
equation. 


sample cell used, and hence the neglected volume is not 
inappreciable. III. The rf magnetic field (H,) is not 
exactly uniform over the volume of the sample, since 
the filling tube ends just past the edge of the sphere. 
The measurements of D, 7), and T; do not depend on 
the echo shape. 

The echo shape is the same [Fig. 12(a) ] when the 
cell is filled with solid as it is with liquid, but a cell 
partially filled with solid He* can be made to yield the 
different shape of Fig. 12(b), as follows: One saturates 
the liquid (7,~ 100 sec) and then looks at the echo due 
to the solid (7:~0.3 sec). This echo shape is the 
Fourier transform of a different spin population n(w), 
since the solid is presumed to be in the form of a rather 
thin shell. It will be seen also that the echo shape shows 
that the spin relaxation is a bulk phenomenon. 

Figure 10 shows the decay of h2/h; predominantly 
due to diffusion damping as expressed by Eq. (3). The 
good fit is typical of our liquid results. The same 
gradient in the solid gives damping due to diffusion 
only at much longer times, at which the 7; decay is 
already appreciable. 


V. RESULTS AND DISCUSSION 
A. Pure He? Liquid 


The plots of D as a function of specific volume at 
constant temperature 7, and of D and 7; as a function 
of temperature at constant pressure, are shown in 
Figs. 13-18 for pure He’ and for He’ in dilute solution 
in He‘. There has been some question whether the spin 
relaxation might be a surface effect, due to oxygen gas 
condensed onto the surface of the sample cavity. That 
this cannot be so is shown by the fact that the echo 
shape is preserved even at short times after saturation, 
for instance, at 0.017), when observed with scope sen- 
sitivity increased one hundred times, whereas if the 
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Fic. 14. Specific volume vs diffusion coefficient ; 
logV vs logD. 


relaxation occurred at the surface, the short-time echo 
would be more wiggly (shell-like). One might say then 
that the wall relaxation was weak, requiring many 
cross-sample diffusings of a He* atom for it to be relaxed 
—that this is not true is easily shown by observing that 
with the measured D we can compute the time required 
for diffusion of an atom to the surface and find it con- 
siderably longer for pure He® than the measured 7), 
indicating that most of the atoms cannot reach the 
surface even once in a relaxation time, a forteriori not 
the many times required to have surface relaxation 
masquerade as a homogeneous relaxation under the 
vigorous test of inspection of the shape of the 1% 
recovered echo. The decay time for the lowest space 
mode in a sphere whose surface is black for spin relaxa- 
tion is, of course, r= R?/x’D with R the sphere radius 
and D the diffusion coefficient. For our geometry the 
ratio of the decay time for the lowest mode to that for 
a sphere was determined by measuring the cooling 
curve for a billiard ball immersed in running water, 
then the cooling curve for the same sphere with a hole 
bored through it. For our geometry we find, then, 
r= R*/30D=3.3X 10-*/D seconds. 
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Fic. 15. Relaxation time 7; in pure liquid He* vs temperature, 
pressure as a parameter. 
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Thus for D<3X10~- in Fig. 13, relaxation by the 
walls cannot be faster than 110 seconds. For such D’s 
the 7;’s are all smaller than this value, and we see that 
our nuclear spin relaxation is at least in part a bulk 
phenomenon. On the other hand, Careri® has found 
that surface relaxation may be induced by packing a 
sample cell with glass fibers, in the same temperature 
and pressure range that we operate, although at a 
different magnetic field. The quality of our relaxation 
data is markedly inferior to that of the diffusion data, 
presumably because of the accidental presence of 
oxygen, which has a well-known strong paramagnetic 
relaxation effect. 

From Fig. 13 it is apparent that the diffusion of 
atoms in He’ liquid is not a thermally-activated process 
—in fact all the curves are very flat below 1.5°K and 
extrapolate to 0°K at finite (and large) diffusion coeffi- 
cients. Thus the diffusion process is not that of an 
ordinary liquid in which the probability of occurrence 
on a single atom of energy sufficient to surmount the 
average potential barrier between neighbors (and also 
the probability of occurrence of low barrier heights) 
goes like exp(—AE/kt) where AE is the “activation 
energy” for diffusion. Nor is the diffusion like that of 
a gas, for which the diffusion coefficient (D=)i/3) 
according to kinetic theory goes like 7}, the independent 
molecules scattering each other with a cross section 
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Fic. 16. LogD vs 1/T for 1% and 2% solutions of He? in Het. 
The data for 2 atmos and 2% mixture are not shown, for clarity, 


but they can be fitted by the same form (see Fig. 17). Values of 
A, B, and A are given in the text. 


*3 Careri, Modena and Santini, Nuovo cimento (to be pub- 
lished). 
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more or less independent of energy while the mean 
velocity of the molecules is 7?. On the other hand, the 
diffusion of He* can be qualitatively understood in 
terms of the zero-point energy of the individual particles 
and the tunneling of the atoms through the potential 
barriers. These are single-particle approximations and 
surely do not allow an accurate picture of the diffusion 
mechanism, but the accurate experimental results 
should provide a test of the validity of the more 
sophisticated theories, allowing an interpretation at 
low temperatures as a scattering of the atoms near the 
Fermi surface by the few thermally excited holes and 
He’ atoms. 

The classic treatment (6) of spin relaxation gives 
approximately the following results which are essential 
to an understanding of our experimental relaxation 
times: 


2r 
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5 (6) 
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where vy is the gyromagnetic ratio, b the average inter- 
spin distance, and 7, the “correlation time” of the 
motion. wo is the precession frequency in the field 
Ho, (wo=yHo). This formula is obtained by calculating 
the amplitude of Fourier component of the perturbing 
field at wo. For wor-<K1 this yields approximately 


1/7, =y'h?b-*r = 1/T». (7) 
For wr.>1 we have from (6) 


1/T=y'h°b-*(1/2u0?r.), 
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Fic. 17. LogD vs logT for various concentrations and pressures, 
showing the 1/T behavior at low temperatures, where the diffusion 
is limited by He*-He’ scattering. 
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Fic. 18. Relaxation time 7, for various concentrations of He’ in 
He‘ at the same pressures as in Fig. 17. 


and 
4 T2=0.5y7hb c= 5 (2wo?r-/T1) oF (9) 


The authors of reference 6 have also specialized 
their treatment to the case of liquids with diffusion 
coefficient D, in which case for atoms at distance r, 


r-(r)=r2/12D (10) 


and the average over all r from the atom size, a, to 
infinity gives 

1/T,=y'hNo/aD, (11) 
with No the number of magnetic moments per unit 
volume, for the usual case with wor-1. In our case, of 
course, we always have the de Broglie wavelength, \, 
much larger than the radius of the atom, so that (11) 
becomes 1/7, =y*h?No/AD. 

Since we have in the present experiment measured D 
directly, we can calculate the order of magnitude for 
T; from (11) in the case that D is high enough for 
wor <K1. If we take for He* at reasonable pressure 
No=2X10” cm and D=2X10-' cm? sec then we 
calculate (with a=3X10-% cm), 7;~200 seconds, in 
reasonably good agreement with the measured value. 
A comparison of Fig. 15 with Fig. 13 shows that the 
diffusion coefficient is a much stronger function of 
temperature than is the relaxation time 7, and in fact 
varies with temperature in the opposite direction, 
although the discrepancy is not serious since the value 
of (a) in (11) also changes with temperature. The 
qualitative agreement of the computed value for 7; 
with the measured value must be regarded as satis- 
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factory and (10) indicates that r-¢ 10~" sec so that at 
our wo=3X10" we certainly have wor<K1. Thus one 
expects from (7) that 7x7, and both should be 
independent of magnetic field. 


B. Pure He’ Solid 


We shall present our results on the solid in a later 
publication, but it should be noted that the diffusion 
coefficient is 10~* that of the liquid at the melting 
pressure even at high temperatures and decreases 
rapidly with decreasing temperature, leading to 
shortened and presumably field-dependent 7; at the 
lower temperatures. 


C. 2% He’ in He‘ Liquid 


Figures 16 and 17 show the rapid rise in diffusion 
coefficient for 2% He’® in He‘ with decreasing tempera- 
ture below the A point. This lends credence to our 
regarding the He‘ at low temperatures as analogous to 
the vacuum of quantum electrodynamics, its function 
being only to dissolve He’ and to allow us to work with 
a He* gas of variable density and thus of variable 
degeneracy temperature. There are, of course, excita- 
tions possible in the He‘ liquid and in Fig. 16 we see 
the freezing out of one type of these excitations (the 
rotons) with an activation energy near 13.5°K. In 
sufficiently dilute solutions the He’® diffusion at low 
temperature would be limited by He*-phonon scatter- 
ing, predicted® to give D« T-*, but the D« (1/T) region 
has much too small a D to be caused by phonon scatter- 
ing and must be ascribed to a He*— He’ transport cross 
section at 0.5°K of ~3X10-' cm’, the corresponding 
transport mean free path being about 6 He*’— He’ 
distances in the mixture. A temperature-independent 
scattering cross section in a nondegenerate gas would 
correspond to D« T! as computed also by Abrikosov and 
Khalatnikov® and Zharkov and Khalatnikov,™ but the 
degeneracy temperature of an ideal Fermi gas of 
number density corresponding to 2% He’ in He‘ and of 
mass 3 is 0.6°K, so that we may very easily be seeing 
here the diffusion coefficient of a partially degenerate 
gas, which point will be resolved by measurements at 
lower temperatures to be obtained with adiabatic de- 
magnetization. At lower temperatures the diffusion 
coefficient is expected to rise as the longer He* wave- 
length averages over more He’ scatterers. 

It is not clear from Fig. 16 whether D is continuous at 
T=T) or not. This is of some interest and will be 
pursued further. Our value for D at 1.5°K is about a 
factor 10 below that of Beenakker,”* but this is success- 
fully accounted for if we extrapolate our results to 
atmospheric pressure. Our activation energy for ro- 

*“V.N. Zharkov and I. M. Khalatnikov, Doklady Akad. 
Nauk S.S.S.R. 93, 1007 (1953); I. M. Khalatnikov and V. N. 
Zharkov, J. Exptl. Theoret. Phys. U.S.S.R. 32, 1108 (1957) 
[translation: Soviet Phys. JETP $ 905 (1958) ]. 

26 Beenakker, Taconis, Lynton, Dokoupil, and Van Soest, 
Physica 18, 433 (1952). 


L. GARWIN AND H. A. 


REICH 


tons is 50% higher than the value usually obtained 
from thermal data, while Meyer and Reif®® find 8.3°K 
for the activation energy of scatterers of positive ions 
in He II. Their data are of accuracy comparable to ours 
and the cause of the discrepancy is not known to us. 
At 1.5°K our D for 2% He? is 2X 10 cm?/sec while the 
Einstein relation D=(kT/e)u gives for Meyer and 
Reif’s positive ions D=0.5X 10~ cm?/sec, a factor four 
smaller than our value. This large scatter cross section 
is perhaps an indication of the large size of the ion 
complex as compared with the small size of the He’. 
If one plots the Khalatnikov kinetic coefficients B and 
C*’ derived by Bendt et al.28 by dividing measured values 
of viscosity and thermal conductivity by integrals over 
the experimental excitation spectrum of He II, one 
finds “activation energies” of 12.1° and 13.5°K for the 
mean excitation scattering involved in viscosity and 
thermal conductivity respectively. There is thus im- 
plied both by our He*-roton results and by the viscosity 
and thermal conductivity measurements a consistent 
but large variation of roton scattering cross section 
with temperature. 

The relaxation data of Fig. 18 should in principle be 
computable from the diffusion coefficient of Figs. 16 
and 17 by the use of Eqs. (7), (8), (9), and (11). Taking 
an independent-particle diffusion model we can calcu- 
late from (10) the correlation time as at most 5X 10-! 
sec near 7), decreasing rapidly below 7). Then (11) 
should give 7; « D whereas a comparison of Figs. 18 and 
16 shows 7; decreasing rapidly as D increases. This be- 
havior would be expected if the correlation time de- 
creased as expected with increased diffusion coefficient, 
but the absolute magnitude of correlation time were 
very long just below 7), so that wor.>1. Such a case 
can be reconciled with the large diffusion coefficient 
only by assuming clustering of He*® atoms. 7, would 
then vary inversely as magnetic field, and 7; from (9) 
would necessarily be much less than (71/w)', ie. 
T2<5X10-* sec, whereas T2 is observed to be much 
longer than 0.2 second. Thus there is some question as 
to the cause of the spin relaxation in the roton range 
in dilute solutions of He* in He’. 

Since at 0.5°K the time required for the decay of the 
lowest space mode of the saturated spin system would 
be ~1 second if the walls were perfectly black, the 
observed relaxation thus would be obtained if the wall 
uniformly required ~ 108 collisions to relax a He’ spin 
(see Appendix A). Our present signal-to-noise ratio is 
not sufficient to allow us to discriminate against wall 
relaxation which has been demonstrated for certain 
surfaces by Fairbank” and Careri. Akhiezer and 
Aleksin® compute the spin relaxation time 7; for dilute 


261. Meyer and F. Reif, Phys. Rev. 110, 279 (1958). 

277, M. Khalatnikov, Uspekhi. Fiz. Nauk 59, 673 (1956). 

28 Bendt, Cowan, and Yarnell, Phys. Rev. 113, 1386 (1959). 

2 R. H. Romer and W. M. Fairbank, Bull. Am. Phys. Soc. 4, 
150 (1959). 

% A. Akhiezer and V. Aleksin, Doklady Akad. Nauk S.S.S.R. 
92, 259 (1953) (National Science Foundation NSF-tr-183). 
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solutions of He* in He‘ and obtain a relaxation time 
T,« T-4 well above the degeneracy temperature and 
T,«T~ for the fully degenerate solution. Their high- 
temperature computation is certainly wrong since they 
considered the He’ as an ideal gas, whereas the relaxa- 
tion rate is very strongly enhanced by the frequent 
collisions with rotons which serve to increase greatly 
the correlation time. Their calculation gives for the 
fully degenerate gas at 0.5°K, 7,;=16 hours, whereas 
we find T7\-~30 sec. 

We have also measured D in 1% He’ in Het at 19 
atmos and in 2% He’ in He‘ at 2 atmos to see whether 
(a) the D«1/T behavior at low temperatures is 
He’— He’ scattering and (b) whether the 13.5° ‘“‘activa- 
tion energy” is concentration dependent. The diffusion 
data may be fitted with an equation of the form 
D“=AT+[B exp(4/T)}". For the 2% mixture at 
19 atmos, A=719, B=4.17X 10-8, and A=13.7°K. For 
the 1% mixture at the same pressure, A=382, 
B=4.5X 10-8, and A= 12.9°K, thus demonstrating that 
the 1/T behavior below 1°K is indeed He*— He’ scatter- 
ing in the He‘ “vacuum.” The 2-atmos curve for 2% 
He’ is similar to that at 19 atmos, except for the 
pressure shift in the A point. The difference between 
the A’s for the 1% and 2% mixtures is not regarded as 
significant. 

VI. WORK IN PROGRESS 


Measurements are being made on the susceptibility 
of dilute solutions and on the solid, on the diffusion 
coefficient and relaxation times 7, and JT, to 0.01°K, 
and on the field dependence of 7}, especially in the 
solid. With the onset of complete degeneracy in the 
solid and in dilute solutions, and with the disappearance 
of excitations in the pure liquid, interesting qualitative 
results should be obtainable. 
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APPENDIX A 


The relationship between macroscopic ‘‘wall relaxa- 
tion” and the microscopic nature of the wall collisions 
is an interesting one. For example, let us take a case in 
which the mean free path of the fluid atoms is large 
compared to the range of magnetic interaction of 
particles with the wall. If the probability of relaxation 
due to paramagnetic adsorbents, etc, in a single 
collision with the wall is e, then we may estimate the 
expected relaxation time 7. Suppose the sample cell to 
be compact, of volume V and surface S. Let there be n 
atoms per cc with thermal velocity v and mean-free- 
path A. Then there will be ~v/4 atoms per second 


NUCLEAR RELAXATION IN 


He? 





Fic. 19. r vs eshow- 
ing relaxation limited 
by (A) wall collisions 
and (B) by diffusion. 


T, Seconds 











incident on each cm? of wall; i.e., nvS/4 wall collisions 
per unit time. If the wall relaxation is very weak—i.e., 
¢— 0, then this means that in a completely saturated 
sample nvSe/4 spins are relaxed per second, giving 


nV 1 4V 
cnieenataiecman (A) 
nvSe/4 ev S 


This can only be true, however, if the 7 so calculated 
is long compared with the decay time rp for the lowest 
space mode of the diffusion equation [rp~1/D(V/S)*]. 
For €>€crit, Where €crit=4S5D/V», the relaxation time 
r will become constant at rrp. The relaxation will be 
slightly nonexponential, most strikingly in that the 
relaxation rate at late times will extrapolate to a 
nonzero moment at zero time. Thus the relation be- 
tween 7 and e is that shown in Fig. 19, which has 
been calculated for V=1 cm’, S=3 cm?, v= 10 cm/sec, 
\=3X1077 cm. 

In case one finds experimentally that r=rp, one can 
distinguish between wall relaxation and bulk relaxation 
because a perfectly “black wall” evidently will produce 
an initial rate of change of total magnetization propor- 
tional to ¢-4, and a total recovered magnetization which 
is proportional to /*, as is characteristic of heated bodies 
subjected to sudden jumps in surface temperature. 
With e=1 this behavior will indeed be observed, but 
in cases where 1>¢€>10~°, all of which give r=7p, the 
evidence would not be so clear. In this case the rate of 
flow of relaxation into the sample is 


where R is the fraction of total magnetic moment re- 
covered, averaged over the volume, and r is the local 
value at the surface. For e<1, r=0 at ‘=0 and we have 


: envS 
R(0)=— 
4nV 
Thus 
' ; wS 4V\? Vv € 
7R(0)~rpR(0)~— ( )~ sii 
4VD‘ S ADS écrit 


(B) 


and we see that the initial rate of recovery of a sample 
relaxed by gray walls is larger by a factor €/écrit than 
that which corresponds to exponential recovery with 
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Fic, 20. Diagram showing the departure from 
exponential recovery, exaggerated. 


7r=rTp. Indeed, for €/€-ri:>>1, this is such a large relaxa- 
tion rate at the wall that the diffusion away of the 
relaxed atoms cannot remove all this relaxation and 
r— 1 very shortly. In fact, in time @ all atoms to a 
depth d, where d’=@6D, are affected reasonably uni- 
formly, since they are mixed by diffusion. In time @ 
then, relaxation R=¢6/éritrp has diffused in (for 
1>€> écrit). The initial high relaxation rate thus con- 
tinues until r~1, i.e., until 


(€/€crit) (8/7) 1 
TT 








r= 


The high relaxation rate continues until 0= 7p (€crit/€)? 


ANID H.-A. 


a 


REICH 


after which time r~1 near the wall and the relaxation 
becomes indistinguishable from that due to a black 
wall. Thus R will behave as in Fig. 20. 

To estimate the possibilities of detecting such uniform 
wall relaxation effects, we note from the above that 
relaxation at the initial rate continues until €¢rit/e€ of 
the sample has been relaxed, after which the recovery 
is slower, like ¢#, until the diffusion front covers a con- 
siderable part of the sample, say to time rp/2, after 
which the recovery is exponential with characteristic 
time rp, but with a considerable recovered moment 
extrapolated to zero time. So we see that for most of 
the range of ¢ for which wall relaxation is independent 
of «, even the details of the relaxation vs time are 
e-independent, and are just about what one would 
expect from a black wall. To distinguish ¢ from unity 
requires investigation of the first (€cri/€)7 seconds of 
the recovery. On the other hand, uniform wall relaxa- 
tion with r= rp has a much greater initial recovery rate 
than the bulk exponential and so is clearly distinguish- 
able from it. 

Of course, the relevance of this simple example is 
much diminished by the formation of “liquid crystals” 
near the wall, and the concomitant increased relaxation 
rate seen even near an inert wall, and by any magnetic 
nonuniformity of the wall itself. However, the analysis 
may be of some general interest. 
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The tight-binding (LCAO) method is a convenient, if qualitative, way of comparing energy bands in the 
zincblende and wurtzite structures. Using this method it is shown that wurtzite states along [—I” (“c’’ 
direction in the zone) can be obtained by perturbing corresponding zincblende states along '—A ([111] 
zone direction). The perturbation is the difference of crystal potential, V’= V(ZB)—V (Wur), which takes 
the zincblende structure into the wurtzite and results in a splitting and shifting of these corresponding states. 
For k perpendicular to these directions the correspondence of k vectors and of states is not so clear, although 
some comparison can still be made. 

The discussion of corresponding zincblende and wurtzite states is helpful in understanding the nature of 
the energy states in mixed crystals: (wurtzite structure [111] twinned on zincblende), and in faulted 
crystals: [111] stacking faults. We can show that barriers (discontinuities in energy surfaces) exist for 
electron propagation (current) parallel to the “‘c’’ axis of a twinned or faulted crystal due to two effects: a 
symmetry effect and a polar effect. The second effect is simply illustrated for rotation-twinned zincblende. 
For randomly faulted crystals, band gaps depend on a, the probability of faulting. Quantitative theories of 





these effects remain to be developed. 


1. INTRODUCTION 


HE discussion to be presented in this paper is an 

outgrowth of work specifically connected with 
the cellular calculation of the band structure of ZnS 
in zincblende (ZB) and wurtzite (Wur) structures. 
Although the writer’s primary interest is in the band 
structure of ZnS, the “simplified LCAO method” to be 
discussed here can equally well be applied to other 
materials which are dimorphic with zincblende and 
wurtzite structures and which have valence and con- 
duction bands “‘s’’- and “‘p”-like, for example SiC, 
CdS, CdSe, etc.! While this method does allow certain 
general conclusions to be drawn about band structures 
in the two crystalline forms it must be emphasized that 
only specific knowledge for each material is reliable, and 
hence no general statements can replace detailed calcu- 
lations and experiment for each particular compound. 
On the other hand, the general conclusions one can 
draw in the LCAO (linear combination of atomic 
orbitals) framework are free of the particular assump- 
tions needed for detailed cellular? or OPW (orthogonal- 
ized plane waves), or PW (plane waves), or variational 
type calculations, such as sphericity of potential, or 
number of terms kept in Fourier expansions, etc., and 
therefore may be helpful as a framework. The LCAO 
method? has an additional advantage for this discussion, 
namely that its formalism enables one to make direct 


* Most of the material of this paper was presented at the 
American Physical Society Meeting, March, 1958, Chicago, 
Illinois [Bull. Am. Phys. Soc. II, 3, 121 (1958) ]. 

1 Although the entire discussion in this paper is (except where 
noted) of general applicability to all compounds with zincblende 
and wurtzite structures, where it is necessary to make an illustra- 
tion, we shall refer to ZnS specifically, as a typical compound (this 
is particularly true in the figures). 

Birman, Phys. Rev. 109, 810 (1958); C. Shakin and J. 
Birman, Phys. Rev. 109, 818 (1958). See also F. Hund and 
B. Mrowka, Ber. Sich. Akad. (Math. Phys. Klasse) 87, 185, 325 
(1935) for an earlier treatment of the zincblende-wurtzite problem 
by a cellular type method. 

3 J. Slater and G. Koster, Phys. Rev. 94, 1498 (1954). 


use of the similarity of first and second neighbor arrange- 
ments in zincblende and wurtzite. Not only can we com- 
pare the (structurally) pure zincblende and wurtzite 
energy states, but also we shall examine the effect on the 
band structure of a particular stacking fault (the rota- 
tion twin in zincblende), and of the existence of [111] 
twin crystals (zincblende regions twinned on wurtzite). 
Again, although particularly aimed at ZnS, the dis- 
cussion will apply as well to other dimorphic 
materials.‘ 


2. COORDINATE SYSTEMS: SYMMETRY ELEMENTS 
IN CRYSTAL SPACE 


It is usually convenient® to describe atomic positions 
in zincblende (space group 7,*) in terms of a Cartesian 
set of axes coinciding with the cubic x, y, z axes; in 
wurtzite (space group Cs‘) hexagonal axes are intro- 
duced in a similarly conventional manner. The unit 
cells on these axes are shown in Fig. 1. However, use of 
these different axis systems in each structure obscures 
similarities long since known to crystallographers.® 
Hence, we define a new set of Cartesian axes in 
zincblende: 


i’ 0 1/v2 —1/v2 j[i 
j’ ae —2//6 1/\/6 1/\/6 j ’ (1) 
k’ 1/v3 1/v3 1, ‘v3 k 


where 1, j, k are the old Cartesian set. With this choice 
of directions k’ or z’ in zincblende is along the cube 
body diagonal ([111] direction) or three fold rotation 
axis 3, and the y axis is on a mirror plane oq. This is 
shown in Fig. 2. The same Cartesian axes will be used 
in wurtzite, and Fig. 2 also shows their orientation with 
respect to the conventional hexagonal axes a), a2. The 

4 Jeffrey, Parry, and Mozzi, J. Chem. Phys. 25, 1024 (1956), 
Table I. 

5 W. Zachariasen, Theory of X-ray Diffraction in Crystals (John 


Wiley & Sons, Inc., New York, 1945), Table 2.6, p. 51. 
6G. Aminoff and A. Broomé, Z. Krist. 80, 355 (1931). 
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Fic. 1. Unit cells of zincblende and wurtzite on conventional 
Cartesian or hexagonal axes. (Sulfur atom is at the origin.) Base 
plane (g=0) projection is shown in the lower half of the figure. 
In this, and the following figures sulfur atom is shown as the 
larger, open circle, zinc as the smaller blackened circle [see 
footnote (1) ]. 


x’y’ plane contains three different types of sites, labeled 
A, B,C, in Fig. 2; only one of these types is occupied 
in any given plane, giving rise to a close packed net 
(i.e., a net such that the nuclei are in the close packed 
position—no implication as to atom/ion sizes is meant). 
In both structures Zn and S close packed nets alternate, 
and in both after one Zn-S double layer another follows, 
in a displaced position. This is illustrated in Fig. 3 in 
terms of a “stacking diagram” or [110] section through 
zincblende and wurtzite. 














"Je 
®ve fio) 

Fic. 2. Base plane projection of the wurtzite structure (only 
sulfur atoms are shown); conventional hexagonal axes are shown 
aS @, a2. Sites A, B, C are indicated; the origin is at C, along 
the six fold screw axis. The x’ direction is along a; (the glide 
plane, m4, contains x’) the y’ direction along the mirror plane, 
oa. The magnitude of vector x’=a, that of vector y’=8 [see 
Eq. (2)]. The twelve second neighbors of sulfur atom (1) are 
the six labeled A, in the plane, and three B above and three below 
the plane. In zincblende the sites C are occupied by S (and Zn) 
atoms, hence the axis along c is no longer 64, and the x’ no longer 
on a m;. The (11-0) planes are perpendicular to the [11-0] direc- 
tion indicated. 


BIRMAN 


In Table I the coordinates of the atoms of the base 
are given with a variety of choices of origin, in terms of 
the conventional zincblende and wurtzite axis systems. 
If a is the distance between neighbors in the close 
packed plane (Fig. 2) it is convenient to measure dis- 
tances along (x’, y’,2’) in units of (a,8,), respec- 
tively, where’ 


a=a/2, B=Vv3a/2, y=}(§)4e=c/2. (2) 


In Table II first and second neighbors of the S atom 
at the origin in zincblende, and of the two S atoms of 
the base in wurtzite, are given in conventional units 
and in units of (a, 8, y). The identity of first neighbor 
shells, and the close similarity of second neighbor 
shells in the two structures, can be obtained from the 
coordinates in Table II. In Fig. 4 a perspective drawing 
of the first and second neighbor arrangements in zinc- 
blende and wurtzite is shown. 
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avce’acrea ac 

Fic. 3. Stacking diagram or [11-0] section through zincblende 
and wurtzite. Note the 3 layer zincblende repeat, and the 2 layer 
wurtzite repeat. The primed sites (A’, etc.) are on (11-0) planes 
displaced from the origin (see Fig. 2). Hence 2 of the 4 Zn neigh- 
bors are on the same (11-0) plane as the S atom indicated, while 
the other 2 are one each in front and behind the plane of the 
drawing. 


It will later be convenient for us to have available 
symmetry elements for the space groups 7,° and Ce‘. 
These are given in Table III, with the operations ex- 
pressed in terms of the primed Cartesian coordinate 
system (1) illustrated in Fig. 2. We shall choose our 
origin of coordinates in zincblende at the site C, which 
is occupied by an S atom, and at the same point in 
wurtzite, which is on the six-fold screw axis, (not 
occupied by any atom). It is of interest to note that the 
site symmetry in zincblende is T7g=43m, while in 
wurtzite it is only C;,=3m; that is, although in the 
ideal wurtzite structure each S is tetrahedrally sur- 
rounded by 4 first neighbor Zn, the four are not equiva- 
lent. The symmetry elements given in Table III are 
needed in formally reducing the £ integrals which arise 
in the LCAO® method and in specifying G(k), the group 


7M. Miasek, Phys. Rev. 107, 92 (1957). 
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TABLE I. Coordinates of the base atoms in zincblende (ZB) and wurtzite (Wur). Column (a): coordinates in terms of the usual 
Cartesian axes [see J. Birman, Phys. Rev. 109, 810 (1958), Table I]; (6), (d), (/), (4): Coordinates in primed Cartesian axes (a, 8, y) 
[see text Eqs. (1) and (2)]. We have used w=3/8 here; (c), (e), (g): C oordinates in conventional hexagonal axes. Note atoms labelled 
® ® @ @ and see Figs. 2 and 3. Columns (a) and (6) are for ZB, Columns (c), (d), (e), (4), (g), (4) for Wur. 





Struc- 
ture 


ZB 
A, B, or C 


Wur* 
B 


(b) (c) (d) 


S @ (0,0,0) (0,0, 0) (0, 0,0) (0,0, 0) 

@® (2/3, 1/3, 1/2) (1, 1/3, 1) 
Zn@ (1/2,1/2,1/2) (0,0,3/4) (0,0, u) (0, 0, 3/4) 

( (2/3, 1/3, u+1/2) (1, 1/3, 7/4) 


Origin at (a) 


® u =3/8 in the “‘ideal’’ wurtzite structure. 


of the wave vector at various points of interest in the 
Brillouin zones. 


3. ZONES AND CORRESPONDING k VECTORS 


To facilitate the comparison between zincblende and 
wurtzite zones and k vectors, it is now necessary to 
take into account the difference in base (2 atoms in 
zincblende, 4 in wurtzite): Table I. We introduce the 
Jones or energy zone*® for the two structures and sup- 
pose that we deal with a zincblende crystal, and a 
wurtzite crystal (grundgebiet)® each containing the 
same number of atoms, and hence half as many primi- 
tive cells in wurtzite as in zincblende. The Jones zone 
of wurtzite is twice the volume of the reduced zone of 
wurtzite, while the Jones zone of zincblende is identical 
with the reduced zone (the familiar truncated octa- 
hedron). Both Jones zones contain the same number of 


(e) (f) (g) (h) 


(0, 0,0) (1/3, 2/3, 0) (0, 2/3, 0) 
(0,2/3,1) (2/3, 1/3, 1/2) (1, 1/3, 1) 
(0,0,3/4) (1/3, 2/3, u) (0, 2/3, 3/4) 
(0,2/3,7/4) (2/3, 1/3,u+1/2) (1, 1/3, 7/4) 


(0, 0, 0) 

(1/3, 2/3, 1/2) 
(O, O, «) 

(1/3, 2/3, u+1/2) 


states per atom. These zones are shown in Fig. 5. A 
number of points of interest in the two zones are 
marked. The median section (k,’k,’ plane) through the 
zones is shown in Fig. 6. In Table IV the coordinates 
of various points in both zones are given in terms of? 


n=Bk,, ¢= (3) 


where ky, ky, ke are the Cartesian components of the 
k vector, parallel to the primed system (1), and (a, 8, y) 
have been defined in (2). We now discuss k vectors 
along the polar (‘‘c” or [111]) directions: [—A(ZB) 
and I'—I’(Wur) and perpendicular to them: '— K and 
I'—M in both structures. 

Now, to each k along [—A(ZB), there corresponds 
an advatieal k along '—I’(Wur), irons k=T to k=A 
or I’. Thus, considering plane wave propagation, we 
may choose plane waves of exactly corresponding wave 


yk, 


t=ak,, 


coordinates in terms 


TABLE II. First and second neighbors of the base atoms, zincblende - and wurtzite (Wur). Column (a): 
: Coordinates in con- 


of the usual Cartesian axes; (6), (d), (f): Coordinates in terms of (a, 6, y) [see Text Eq. (1) and (2)]; (©), (e): 
ventional hexagonal axes. Columns (a) and (b) are for ZB, columns (c), (d), (e), (f) for Wur. 


Wur> 
(2/3, 1/3, 1/2) =(1, 1/3, 1) (1/3, 2/3, 0) = (0, 2/3, 0) 
© @ (e) (f) 


Structure ZB* 


Base atom (0, 0, 0) 


First N eighbors 


(2/3, 1/3, 7/8) 
(1/3, —1/3, 3/8) 
(1/3, 2/3, 3/8) 
(4/3, 2/3, 3/8) 


(0, 2/3, 3/4) 

(0, 4/3, —1/4) 
(—1, 1/3, —1/4) 
(1, 1/3, —1/4) 


(1, 1/3, 7/4) 
(1, —1/3, 3/4) 
(0, 2/3, 3/4) 
(2, 2/3, 3/4) 


(1/3, 2/3, 3/8) 
(2/3, 4/3, —1/8) 
(—1/3, 1/3, —1/8) 
(2/3, 1/3, —1/8) 


(172, 2/2.1/2) 
(1/2, —1/2, —1/2) 
(—1/2, —1/2, ee 
(—1/2, 1/2, =) 


(0, 0, 3/4) 

(0, —2/3, —1/4) 
(—1, 1/3, —1/4) 
(1, 1/3, —1/4) 


Second Neighbors 


(4/3, 2/3, 1) 
(—1/3, —2/3, 1/2) 
(5/3, 4/3, 1) 

(1/3, 2/3, 0) (0, 2/3, 0) 
(1/3, —1/3, 1) (1, —1/3, 2) 
(2/3, —2/3, 1/2) (2, —2/3, 1) 
(2/3, 4/3, 1/2) (0, 4/3, 1) 
(4/3, 2/3, 0) (2, 2/3, 0) 
(2/3, 2/3, 4) (0, 2/3, 2) 
(5/3, 1/3, 1/2) (3, 1/31) 
(—1/3, 1/3, 1/2) 1, 173,14) 
(1/3, —1/3, 0) (1, —1/3, 0) 


(—1, 1/3, —1) 
(—1, —1/3, 0) 
(—1, 1/3, 1) 
(1, —1/3, 0) 
(1, 1/3, —1) 
(1, 1/3, 1) 

(0, 4/3, —1) 
(2, 2/3, 0) 

(0, 4/3, 1) 

(1, 5/3, 0) 

2, 2/3, 0) 
1, 5/3, 0) 


(—1/3, 1/3, —1/2) 
(—2/3, —1/3, 0) 
(—1/3, 1/3, +1/2) 
(1/3, —1/3, 0) 
(2/3, 1/3, —1/2) 
(2/3, 1/3, +1/2) 
(2/3, 4/3, —1/2) 
(4/3, 2/3, 0) 

(2/3, 4/3, +1/2) 
(4/3, 5/3, 0) 
(—2/3, 2/3, 0) (- 
(1/3, 5/3, 0) ( 


(4:2@) 

(1, —1, 0) 
(—1, 1, 0) 
(—1, —1, 0) 


(1, —1/3, 1) 
(—1, —1, 0) 
(1, 1, 0) 

(—1, 1/3, —1) 
(1, 0, 1) (—1, —1/3, 1) 
(1, 0, —1) (1, —1, 0) 
(—1, 0, 1) (4.4.0) 
(—1,0, —1) (1, 1/3, —1) 
(0, 1, 1) (0, 2/3, 1) 

(0, 1, —1) (2, 0, 0) 

(0, —1, 1) (—2, 0, 0) 

(0, -—1, -— (0, —2/3, 1) 


(2.2/3; 2) 
(0, —2/3, 1) 
(2, 4/3, 1) 


* Origin at a sulfur site. 
b Origin at site C (see Table I). 


32; A. H. Wilson, Theory of Metals (Cambridge 


8 J. R. Reitz, Solid State Physics (Academic Press, New York, 
ot Press, Cambridge, 1953), second edition, p. 90. 
Slater, "Handbuch der Physik (Springer- Verlag, Berlin, 1956), Vol. 
Pipe “(Verlag Julius Springer, Berlin, 1933), Vol. 24, part 2, p. 373. 


1955), Vol. 1, p. 


19, p. 13; A. Sommerfeld and H. Bethe, Handbuch der 
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Fic. 4. First and second neighbors in zincblende and wurtzite. 
Large circles are S atoms, small ones Zn. Open circles are in the 
same plane. Comparing the two structures, we note that only 
three of the twelve second neighbors differ, and even these are 
disposed symmetrically. These are shown as the 3 atoms ‘“‘above”’ 
and are rotated by x/3 in comparing zincblende and wurtzite. 


lengths in both structures. The planes of constant phase 
(close packed nets) are, of course, the same in both 
structures, alternating Zn and S as discussed above. 
The minimum wavelength (k=kmax) is X= 2d111 where 
di, is the Zn-Zn or S-S interplanar spacing. This is 
illustrated in Fig. 7. 

The situation along [—K and I'—M is not so 
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clear. As shown in Fig. 6, the distance '—K(ZB) 
>I'—K(Wur). In fact, as Herring has shown” a wave 
vector along the prolongation of [— K(Wur), is equiva- 
lent (differs by a lattice vector in the Fourier Space) 
to a vector along the line K’—M’. The point in the 
wurtzite zone equivalent to K(ZB) is shown as “en- 
circled K” in Fig. 6. In words this means that we can- 
not propagate a single plane wave (chosen from within 
or on the energy zone) in wurtzite in the direction [— K 
with wavelength equal to the wavelength of the vector 
K(ZB), but we must superimpose 2 plane waves: “en- 
circled K” plus (—b,+b:). Similarly M(Wur) does not 
coincide with M(ZB); the point equivalent to M (ZB) 
is shown as “encircled M” in Fig. 6. This lack of identity 
of zincblende and wurtzite points K and M makes for 
ambiguity in the comparison of states along '— K and 
I'— M. We may either (a) compare states at the same k 
thus necessitating folding of certain wurtzite k vectors 
as discussed above, or (b) compare states at the same 
fraction of kmax along these directions. A third alterna- 
tive, possible since the two energy zones contain the 
same number of states, would be to shift all zincblende 
k’s which are outside the corresponding wurtzite vol- 
ume in such a way as to duplicate this volume, and then 
to identify wurtzite states and shifted zincblende 
states. In Fig. 8 we illustrate the plane wave propaga- 
tion at points K and M in zincblende and wurtzite, 
showing the shortest waves which may occur in these 
directions. 

While '— X is a principal symmetry direction in zinc- 
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Fic. 5. Jones zones for zincblende and wurtzite. Each zone has volume = z*v2/a*. a is defined in Eq. (2). Now |[—A|=|I'—T’| 
= |Kmax|. For the section perpendicular to c see Fig. 6. 


1 C, Herring, J. Franklin Inst. 233, 525 (1942). 
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blende, it is clear from Fig. 5 that no principal direction 
in the wurtzite zone corresponds to it. Consequently we 
shall not attempt to make a correspondence for k vec- 
tors, or states, along this direction, even though the 
zincblende secular determinant factors simply (see refer- 
ence 3, p. 1522). 


4. CRYSTAL POTENTIAL IN ZINCBLENDE 
AND WURTZITE 


In discussing the relationship between the crystal 
potential in zincblende: V(ZB), and in wurtzite: 


TaBLE III. Symmetry elements zincblende and wurtzite.*:» 





Zincblende 


—v3/2 0 
—1/2 0 
0 1 


0 
1 
0 


0 0 

1/3 ) 

—4/3v2  —1/3 

—1/v3 —2//6 

—2/3 2/3v2 ) 
2/3v2 —1/3 


+1/(12)# —2//6 
5/6 2/(18)4 }: ' 
2/(18)* —1/3 - 
3/(12) 0 

—1/2 
0 
Wurtzite 


—v3/2 
64: 


my(x): 

/ 
my (Ox): 
my’" (62x) : 


oa(yz): 


1/2 


oa’ (65 yz) : v3/2 


* These elements are convenient for later discussions rather than a 
generating set, in the sense of Zachariasen.® 

b The elements are expressed in the primed Cartesian coordinate system, 
Eq. (1), except for those matrices indicated ¢: the latter are in terms of the 
original zincblende Cartesian system. 


V(Wur), one must distinguish two factors which enter: 
(a) the relationship between the bonding of a given 
compound in zincblende and ideal wurtzite structures, 
(b) the deviation of the wurtzite structure from 
ideality. 

It is rather unlikely, on the face of it, that there be 
no difference in bonding when a compound occurs in 
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Fic. 6. Composite section through the zincblende and wurtzite 
Jones zones perpendicular to c. The outer hexagon refers to zinc- 
blende, the inner to wurtzite. The vectors b,/2 and be/2 are half 
the wurtzite Fourier (reciprocal) lattice vectors, hence points 
K, K’, K” in the wurtzite zone are equivalent. The point K (ZB), 
considered as a point in the wurtzite zone, is equivalent to “en- 
circled K,” and similarly M(ZB) and “encircled M” are equiva- 
lent. The point K’(ZB), [on the prolongation of !— K’(Wur)] is 
not equivalent to K (ZB). 


either zincblende or wurtzite structures." The nature 
of the bond may be characterized by specifying the 
crystal charge density p(ZB) and p(Wur) and these are 
amenable to experimental determination by refined x- 
ray Fourier synthesis techniques. To date no work has 
been reported on this important problem, although some 
recent x-ray work has gone on in dealing with certain 
zincblende and wurtzite structures separately.‘ From 
somewhat general arguments it seems likely that the 
chemical bond of the same compound in wurtzite struc- 
ture should be more ionic than in zincblende structure." 
For example, this may mean that the “effective charges” 
on the ions are somewhat greater in the wurtzite then 

in the zincblende structure. 
Closely connected with this problem of bonding is that 
of the ideality of the wurtzite structure. We may take 
ZINC BLENDE WURTZITE 


7 7 
he — (in ke ool) 
Zn) «+22 00 

















Fic. 7. Plane wave propagation at the edge of the zone in 
[‘‘c] direction, at k=A(ZB) and k=I'’(Wur). Note that these 
shortest wavelength waves change phase by x on progressing from 
one close-packed plane to the next identical one, in either struc- 
ture (i.e., from an S to an S plane). 

See A. v. Hippel, Z. Physik 133, 158 (1952); reference 4; and 
also F. Keffer and A. M. Portis, J. Chem. Phys. 27, 675 (1957). 

2, A. Jumpertz, Z. Elektrochem. 59, 425 (1955). 
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IV. Coordinates of points in zincblende (ZB) 
and wurtzite (Wur) zones. 


k = (£,»,¢) See Eq. (3) 
Wur k 


(0, 0, 0) I (0, 0, 0) 

(0, 0, x) c (0, 0, x) 
(32/8, 92/8, 0) K (1/3, w, 0) 
(99/16, 9/16, 0) M (x/2, r/2, 0) 


as prima facie evidence for the essential identity of 
bonding in zincblende and wurtzite structures the 
ideality of the latter. Our argument is if those forces 
tending to make the wurtzite structure locally ideal 
[u=3/8, c/a=(8/3)'] win out over other steric and 
ionic forces which tend to produce the nonideal wurtz- 
ite, then the bonding is essentially determined by first 
neighbor interaction. Now these first neighbor inter- 
actions may be essentially responsible for the existence 
of a stable zincblende structure, and consequently it 
seems reasonable to infer that if a compound exists in 
zincblende and (essentially) ideal wurtzite structures, 
the bonding is substantially the same in both. It would 
then be natural to use the same valence charge densities 
per atom pair: p(ZB)=p(Wur)=p4tpz, where A and 
B are the two atoms/ions (i.e., Zn and S) of the com- 
pound AB, in constructing the crystal potentials V (ZB) 
and V(Wur). (This approach was used in the numerical 
work now under way for ZnS in zincblende and wurtzite 
structures. )* 

If the same atom/ion charge densities are placed on 
both zincblende and ideal wurtzite lattices, then for 
the crystal potentials we may take 

V(ZB)=V(Wur)+V’, (4) 
where V’ is a small perturbation. If the wurtzite struc- 
ture differs from ideality (w#3/8), then V’ will be the 
larger, as will the matrix elements computed from V’. 
In any event we shall use (4) in establishing various 
relationships between zincblende and wurtzite energy 


states. 


u(w) 


(zB) A 
ap-n a8 


ap-ort 
8 


Fic. 8. Plane wave propagation at A and M in zincblende and 
wurtzite. Note the different phase changes A@ on progression 
through an identical distance (i.e., different wavelengths) making 
for ambiguity in comparison of states in zincblende and wurtzite 
along this direction. (See Text.) 
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5. CORRESPONDENCE BETWEEN STATES IN 
ZINCBLENDE AND WURTZITE 
In Table V we give, for various k vectors of interest 
in zincblende® and, wurtzite“: (a) the symmetry ele- 


TasLE V. LCAO Bloch functions® for zincblende 
and wurtzite. 


Zincblende 
lr: G(l)=Ta=43m 


I, =as,+bs; 
aI,’ =a(—2/(6)4y1+1/v321) +b(—2/ (6)4y3+1/v323) 
A: G(A)=C3,=3m 
A, =as, +02: +¢s3+dz; 
DA3;=ax,+bx; 
K: G(K)=C,,=m’ 


v1 v1 21 c 1 3 q 
Kie™05, +0) ——+-—t+— [+-—] -—-—— 
v2 (6)! v3 v2 v2 (6) 
Xs 3 83 ji xs 3 
+dss+e] ——+——+ + a ee 
v2 (6)t v3 v2 v2 (6)! 
atv x1 M1 221 b X3 V3 223 
= pt —— TH 1 
v2 v2 (6)! V3 v2 v2 (6) V3 
M: G(M)=C,,=m", 
x MN ; yO 
M,.=as,+b 4 --—+ —---+— 
v2 (6)t v3 (6)* v3 
X3 v3 3 f X3 V3 2z3 
+dsste _ ——-—+—], 
v2 : v2 v2 (6)! v3 
x, v3 
M,,=a] —-—y 
> ie 


Pr: G(L) =Cev=6ymy 


Wurtzite 
I’, = (a/v2) (5; +52) + (b/v2) (21 +22) 
+ (c/V2) (s3+s3) + (d/v2) (z3+24), 
DI 5= (a/V2) (x; +42) + (b/v2) (x3 +454), 
K: G(K) =C3y= (64)?my, 
K,= (a/v2) (s1+52) + (0/v2) (21 +22) 
+ (c/V2) (ss3+54) + (d/v2) (23+24), 
)K3= (a/v2){ (—4$a1— (V3 /2) 91) +[—4.x2— (V3 /2) yo J} 
+ (b/v2){[—4x3— (v3/2) 93 J+ [—$a1— (v3 /2) ya}, 
M: G(M) =Cov= (64)8aa’ (65yz) my” (6x) 
M,= (a/v2) (81 +52) + (b/v2) (21 +22) 
+ (c/v2) (s3+s4) + (d/V2) (ss+24) 
M 3=al[ (x1/2—V3 91/2) + (42/2 —v3yo/2) ] 
+b[ (x3/2—Vv3.y3/2) + (44/2—V3y4/2) ] 
M y=al(v3x1/2+41/2) + (V3x2/2+-42/2) ] 
4 BL (V3x3/2+42/2) + (v3x4/2+94/2)] 
® All functions (Bloch sums) are expressed in terms of the primed axes 
Eq. (1). Where degeneracy exists the degree is indicated by a number ina 
circle, preceding the single partner which is given. For more conventional 
listing of the functions for ZB see D. Bell, Rev. Modern Phys. 26, 311 


(1954). The elements of various G(k) are explicitly given in Table III. In 
the functions, a, 6, c, d, e, and f are constants. 


81). Bell, Revs. Modern Phys. 26, 311 (1954); R. Parmenter, 
Phys. Rev. 100, 573 (1955); G. Dresselhaus, Phys. Rev. 100, 580 
(1955). 

4G. Dresselhaus, Phys, Rev. 105, 135 (1957); W. J. O’Sullivan, 
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ments (see also Table III) contained in G(k), the group 
of the wave vector k, (b) the LCAO wave functions for 
the different states [irreducible representations of G(k) ] 
considered. A function such as x; (j7=1, 3 forzincblende; 
j=1, 2, 3, 4 for wurtzite; see Table I) is an ortho- 
normalized Bloch sum of “x’’-like Léwdin functions,’ 
quantized on the primed Cartesian axes defined by 
Eq. (1), and centered on a lattice of sites of the jth 
sort (j=1,2 are S sites; 7=3,4 are Zn sites: see 
Table I). In Table VIa, VIb we give matrix components 
for s and p functions taking account only of first 
neighbor interactions for zincblende and wurtzite 
structures, in terms of the £ integrals.* (We also worked 


TABLE VI. Matrix components of energy for 
zincblende and wurtzite. 


a. Zincblende.* ro = (0, 0, 3/4) in units of (a, 8B, y)—Eaq. (2). 
(é, n, ¢) are defined in Eq. (3) of text. 
R on (0) (eset ‘Le it m2 cost cos (n/3) +cos(2n/3)) 
+i(2 cosé sin (n/3) —sin(2n/3))]}, 
(s/z)13= Eg (70) {E4S/4— AeA... YY 
(s/x)13=[47/ (6)* JE. (70) singe®(1/3-1/4) , 
(s/y)1s= — (2V2/3) Ez (70) [e7* 218+ 5/9) — coset (n/3-$/4) ], 
(s/s)13= Ezz (ro) e5/4+ (1/9) [8E,2 (70) +Eze(70) Je AL + © J, 
(x/x)13= Ezz (70) [635 /4+- et 2n/3t 14) 7 
+4 cost[ (4/3) Ezz (70) + (8/3) 
(y/y)13= Ezz (70) Le*t4+§ costei(n/s-t/0] 
+ (1/9) [Eee (40) +8E ee (10) {et@ns+ 90 4-2 costet(nis-t/0) 
(x/y)1s= (4V3i/9)[E,2(70) — Exx (70) ] singe’(1/3-1/) 
(x/2)13= (4V3i/9V2) [Ezz (70) — Ezz (70) ] singe*(9/3-1/) 
(y/z)13= (4/9V2) [Eee (10) — Ez (70) ]{e7*20/8+8/ — cosgei(a/3-£/4) }. 
b. Wurtzite.» ro =(0, 0, 3/4); r1=(0, —2/3, —1/4) in units of | 
(a, B, y)—Eq. (2). (é, ” 5) are defined in n E a. (3) of text. 
(n/m)13= (n/m) 24== Enm (10) eit! . ; 
(n/m) 14= Enm(71)[(2 cost cos (n/3) +cos(2n/3)) 
+i(2 cosé sin (n/3) —sin (2n/3)) Je“! 4, 
(n/m) 23= Enm(71)((2 cosé cos (n/3) +cos(2n/3)) 
—i(2 cost sin(n/3) —sin (2n/3)) Je“*#/4, 
(n/x)14=1V3 Eny (71) singe®(1/3-$/4), 
(n/x)23= —ivV3 Eny (11) singe *(9/3+ 5/4), 
(n/y)14= Eny (11) [e*@n+8 — costet(nis-t/9 
(n/y)o3= — Eny (171) Let @r8-5/ —coske t(n/stt/4) 7}. 
(p/p)is= (p/p) a= Ezz (roe! 
(p/p) 14= Exe (rile @v3tt/9) +2 costE rs (riei/s-t/), 
(p/p)os= Ezz (rie*@V8-0) +2 costE zz (rien tt), 
wae ha at (p/q)13= (p/q) a= sssbinatii pen 





(s/)u™ 


Ezz(to) Jest, 





® The [--+] in the expressions for (s/z)13 and (z/z)1s for zincblende is the 
same function of (£, 7, ¢) as given inside the square brackets in the expres- 
sion for (s/s)13. 

bm, m can be (s and 2); p, g can be (x and y). m and m may be equal; 
p “~ 


J. Chem. Phys. 30, 379 (1959); M. L. Glasser, J. Phys. Chem. 
Solids (in press); R. Parmenter (unpublished notes); F. W. 
Quelle, Jr. Quarterly Progress Reports, Solid State and Molecular 
Theory Group, Massachusetts Institute of Technology, Cam- 
bridge, Mass. July 15, 1958 (unpublished) p. 28; R. C. Casella, 
Phys. Rev. 114, 1514 (1959); J. L. Birman, Phys. Rev. 114, 1490 
(1959). I am indebted to Dr. Glasser, Dr. Parmenter, and Dr. 
Casella for making preprints of their papers available. 
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TaBLe VII. Two-center sienna and perturbation poseenatan. 


Zincblende 
Exus(10) = Ess(— 10) =ssa (70), 
—E,,(170) =spo(ro), 
=ppr(ro), 
ppa (ro). 





E.2(t0) = 
Ezz (10) = Exz(—70) 
E.2(10) = E22(—70) = 
Wurtzite 
E4:(t0) =spa (ro), 

Exz(t1) = —4spo(r1), 

E2zs(r0) = — Esz(10) = —Spo(ro), 

Ezs(t1) = —Ese(11) = 45po (71), 

Evy (11) = — Eya(71) = — 3V2spo (71), 

Ezz (10) = Exr(—70) = ppr(ro), 

71) =ppr(r1), 

E.2(10) = Ezs(—70) = ppa (ro), 
E.2(71) = (1/9) ppo (11) + (8/9) ppm (71) = ppw(71)* 
Ey2 (11) = (2V2/9) [ppo (11) — pp (71) ]=0" 


Ega(T0) =ssa (70), 


Exs(71) =sso(71), 


Ezz (11) = Ez:(— 


Perturbation Parameters 
sso (79; ZB) =ssa(79; Wur), 
Spa (ro; ZB) =spo(ro; Wur), 
ppo (ro; ZB) = ppo(ro; Wur), 

sso(71; Wur) =sso (79; ZB) +4), 
Spa(71; Wur) =spo(7r0; ZB) +62, 
ppo(r1; Wur) = ppmr(7,; Wur)* 


= ppa(ro; ZB) +63, 
= ppr(ro; ZB) +54. 


& These equalities follow from the fact that Ezz(r1; Wur) = Eyy(71; Wur), 
i.e., the apparent Co symmetry of the Emn integrals in Wur. 


out the results including second neighbor interactions 
but these would only unnecessarily complicate, without 
really clarifying, the relationships of interest. Of 
course, an accurate LCAO numerical calculation or 
interpolation should include these and any other 
necessary shells of neighbors.) Reduction of the EF 
integrals to the independent ones shown, has been 
carried out by using the elements of crystal symmetry 
(Table III). It is of interest to note that although the 
site symmetry in wurtzite is only C;,=3m the E in- 
tegrals have the higher symmetry characteristic of 
point group Cs,=6m. This is due to the “rotational” 
part of the crystal symmetry operations 6, and m,. In 
Table VII is given the reduction of the E integrals to 
two center integrals and also the definition of the four 
perturbation elements 6;, which are the matrix elements 
of the perturbation V’ of equation (4), in terms of which 
we shall obtain the relationship between certain zinc- 
blende and wurtzite energy states. 

Consider states along T—A(ZB) and ['—I'’(Wur). 
Using the information in Tables IV and VII, we have 
set up the secular determinants for energies of states 
at the end points'® (I! and A and I”), and these are 

15 Tf the wurtzite reduced zone is used, '(Wur) =I’ (Wur), and 
the states ',’(Wur) and I’s’(Wur) are respectively I's and Ig (see 
Dresselhaus, reference 14). 
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Fic. 9. Secular determinants k= (0, 0,0) zincblende and wurtzite. Two center, nearest neighbor approximation. For definition of the 
matrix elements see Tables VI and VII. Note that when all 6; > 0 [V’ in Eq. (3)], 0:(Wur) =1, (ZB) +1,’ (ZB), 1';(Wur) =I,’ (ZB). 


shown in Figs. 9 and 11 in terms of the two center 
integrals and the 6; defined in Table VIIT. In Figs. 10 
and 12 we indicate reasonable separations of the various 
states in zincblende and wurtzite, assuming these occur 
in normal order (as seems to be the case in ZnS, accord- 
ing to our most recent numerical results at these points). 
Note the effect of the perturbation V’ in mixing and 
shifting zincblende states to yield the corresponding 
wurtzite states. Since the relative change in magnitude 
of the corresponding matrix elements is greater at I 
than at A, I’, we conclude that the shift of correspond- 
ing states is greater at I’ than at the end points. Further, 
since within this simplified LCAO theory (first neighbor, 
2 center) the change in matrix elements varies con- 
tinuously from k=T to A, I’ we conclude that the shift 
of corresponding states varies continuously as illustrated 


WURTZITE 


ZINC BLENDE 





ince 





Fic. 10. States at k= (0, 0,0), illustrating shift and splitting 
of corresponding states. States are shown in “normal” order and 
separated by roughly the amounts indicated in the ZnS calcula 
tion now in progress 


in Fig. 13. Now this is certainly an oversimplification, 
since one would hardly expect the simple ¢(k) curves 
shown to apply to a real material. However, in the 
absence of exact information, this approach enables 
certain conclusions to be drawn with varying degrees 
of accuracy. Hence, for "—A, !—I”’ propagation: 


1. States (as well as k vectors) show a correspondence 
along this direction. 

2. The perturbation (shift and splitting)'® of corre- 
sponding states depends on k. For simple model (nor- 
mal order of states, band extrema at I’), it appears that 
the perturbation is greatest at I’, diminishing as k 
approaches the zone edge. It must be pointed out that 
G(k) in wurtzite=6,m, for k along ['—I’(Wur), 
G(I'(ZB))=43m, and G(k)=3m in zincblende for 
other points along '—A. Hence in our approximation, 
G(T(Wur)) is effectively a subgroup of G(I'(ZB)), 
while along '—I’(Wur) and '—A(ZB), G(k(Wur)) 
and G(k(ZB)) are effectively isomorphic. 

3. At fixed k the perturbation is the greater, the 
larger is V’. The factors influencing V’ are the change 
in bond type (ion charge) in going from zincblende to 
wurtzite, as well as the departure from ideality of the 
wurtzite structure. Another factor affecting V’ will be 
mentioned in discussing faulted structures: the proba- 
bility of faulting, a. 

4. As 6; 0 (wurtzite — zincblende) : 


r:(Wur)=1;(ZB) +1 (ZB) | 
I'5(Wur) =I'y’(ZB) 
I';(Wur)=A;(ZB) |at other points along T—A(ZB); 
1')(Wur) =A3(ZB) Jand '—T’(Wur). 


| at k= (000), 


For '—M and '—K propagation the situation is 
more complicated. First, it is not clear how to compare 


16 See, e.g., E. P. Wigner, Gruppentheorie (Edwards Brothers, 
Ann Arbor, Michigan, 1944), p, 128, 
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Fic. 11. Secular determinants: at k=A, k=I” for zincblende and wurtzite, respectively. 
Note that as 6; + 0; ';’(Wur) =A; (ZB), I's’(Wur) =A3(ZB). 


k vectors: three choices have been mentioned above. 
In addition the G(k) of zincblende and wurtzite along 
these directions are not simply related. For example 
G(k) for "— K(ZB) is Ci, (see Table III) where m’ is 
a mirror plane containing '— K(ZB) but at an angle to 
the x’y’ zone section of Fig. 5, while the G(k) for 
I'—K(Wur), except at K, is isomorphic to Ci, but the 
“mirror” plane is the glide plane m,. In addition, at K 
wurtzite the G(k) is isomorphic to C;,=3m (see Tables 
III and V). This comes about because the points in 
Fig. 5 labelled K, K’, K’’ are equivalent (differ by a 
lattice vector in the Fourier space for wurtzite). This 
is not true for the corresponding zincblende points, 
since the lattice vectors in the zincblende Fourier 
space are not in the plane of Fig. 5 nor does any com- 
bination of such vectors send K(ZB) into a vector 
rotated by 27/3. Consequently at K(Wur) there is a 
degeneracy: representation K;(Wur) is double de- 
generate. In Fig. 14 secular determinants at K(Wur) 
= k(ZB) = (1/3, 7,0) are shown and in Fig. 15 states 
are listed in a ‘‘normal” order. Consequently, a simple 
perturbation argument does not hold in this direction, 
for zincblende-wurtzite states. Evidently the same con- 
clusion holds for zincblende-wurtzite states along !— M. 


6. BAND STRUCTURE OF A MIXED CRYSTAL; 
EFFECT OF FAULTING 


The correspondences established above can help us 
understand the band structure of mixed and faulted 
crystals after we have established some geometric- 
crystallographic preliminaries. 

Actual synthetic crystals of ZnS,’ SiC, and related 
materials often show structural imperfections: stacking 
faults (random and periodic) and twinning. In fact 
structurally homogenous ZnS (either zincblende or 
wurtzite pure structure) is an extreme rarity in our ex- 

17H. Miiller, Neues Jahr. Mineral. Abhandl. 84, 1, 43 (1952); 
L. W. Strock and V. Brophy, Am. Mineralogist 40, 94 (1955). 


perience. A typical single crystal needle of ZnS is 
shown in Fig. 16. This crystal was grown in this labora- 
tory by H. Samelson, and shows (x-ray and optical) 
features characteristic of [111] faulting and twinning. 
In terms of the stacking diagram, Fig. 17(a), 17(b) 
shows a mixed crystal (wurtzite twinned on zincblende) 
and a particular kind of stacking fault: the rotation 
twin in zincblende. The latter kind of fault is helpful in 
understanding what we shall call the “polar effect” in 
producing a barrier. 

(a) Polar effect—Consider a crystal consisting of 
many regions of zincblende rotation-twinned with re- 
spect to each other. The polar axis (unique “c’’) has 
the same direction throughout the crystal. At each twin 
plane however, the local stacking is characteristic of 3 
layers of wurtzite structure [i.e., ---BCB--- in Fig. 
17(a) |. The atoms in the twin plane region are subject 
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Fic. 12. States at k=A and [”, in “normal” order and separated 
as indicated in the ZnS calculation now in progress. 
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Fic. 13. Shift of corresponding states along T—A(ZB) and 
I'—I’(Wur) (schematic). Heavy line is for zincblende, light one 
for wurtzite. This illustrates (in simple if exaggerated fashion) the 
conclusion that the shift of corresponding states is k dependent, 
and apparently greater at the origin than at the end of the zones, 
for the simplified model treated. The states I’,’(ZB) (upper valence 
state) and (ZB) (lower conduction state) and corresponding 
wurtzite states are illustrated. For Wur’ (faulted wurtzite-like 
structure) separations at each k depend upon a, the probability of 
faulting. 


to unbalanced forces causing departure from normal 
positions. The local (uniaxial) strain gives rise to a 
sheet of polarized material'* (piezoelectric plus pyro- 
electric effects) included in the zincblende. This thin 
sheet of a dipole layer is equivalent to a charge double 
layer, and it is well known that the potential exhibits 
a discontinuity on passing from one to another side of 
such a charge double layer.” The discontinuity of 
potential is a barrier (see Fig. 18). We call this a polar 
effect since the locked-in strain and polarization arises 
largely because of the polar (partial ionic) character of 
the chemical bond in this material. Quantitative work 
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on the magnitude of locked-in strain and potential dis- 
continuity still remains to be done. 

(b) Symmetry effect—Consider the mixed crystal in 
Fig. 17(b) and let us ignore distortions and polarizations 
in the boundary layer, and assume unchanged [111 ] 
interplanar spacings in the twinned crystal. Hence our 
mixed crystal consists of ideal zincblende-wurtzite re- 
gions in series for propagation parallel to [111], and in 
parallel (for propagation perpendicular to [111 ]). Be- 
cause of the identity of corresponding zincblende and 
wurtzite k vectors along [—A and '—I”, respectively, 
we see that a plane wave propagating parallel to [111] 
can pass undistorted from zincblende to wurtzite sides 
of the boundary for all k parallel to [111]. Otherwise 
stated we may preserve k on either side of the boundary. 
Now assume that in the zincblende region the crystal 
potential is V(ZB), and in wurtzite it is V(Wur). Then 
we may use Fig. 13 to determine relative location of 
energy levels of the appropriate states on either side of 
the boundary. For example, at k= (0, 0,0) zincblende 
states are shifted and split to yield corresponding 
wurtzite states as shown in Fig. 9. The result is indicated 
in Fig. 19, where the existence of a barrier (energy dis- 
continuity in both valence and conduction bands) is 
shown schematically. We emphasize that this barrier 
arises solely because of change in the symmetry of the 
crystal potential V(r) by calling this the symmetry effect. 
Further, the barrier height (discontinuity in energy at 
a fixed k) depends on k (i.e., electron energy) and 
should diminish as k approaches the zone edge if Fig. 12 
applies. Note that one may introduce a common refer- 
ence level of potential and also that the vertical separa- 
tion of the bands at fixed k, labelled AEzn and AEw, is 
characteristic of each of the structurally pure zincblende 
or wurtzite regions. Hence a given mixed crystal will 
show optical evidence of the simultaneous existence of 
both band gaps. The wavelength dependence of the 
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Fic. 14. Secular determinants at K (Wur) = (2/3, x, 0) =k(ZB). 


18 J. F. Nye, Physical Properties of Crystals (Oxford University Press, Oxford, 1957), appendix E;. J. Birman, Domains of Polariza- 


tion and Anomalous Birefringence in Twinned Zincblende (Sylvania report, 1957, unpublished). 
#9 J. A. Stratton, Electromagnetic Theory (McGraw-Hill Book Company, New York, 1941), p. 189. 
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anomalous photovoltage in twinned and faulted crys- 
tals of ZnS is probably related to this property.” The 
existence of barriers for conduction parallel to [111] 
has already been noted experimentally for faulted ZnS 
single crystals, and in fact there seems to be some 
evidence of more ohmic behavior at higher field strengths 
(higher electron energy and therefore k near the zone 
edge) which would corroborate the model of lower 
barrier height at larger k.?! 

Of course the actual barriers exist because of both 
polar and symmetry effects and much more work needs 
to be done before these are unscrambled. Conduction 
perpendicular to “‘c” involves carrier transport in zinc- 
blende regions, wurtzite regions, and within the barrier 
region. Quantitative analysis of conduction in this 
direction would involve averages over the [—M,T—K 
directions in both zincblende and wurtzite, and lacking 
precise experimental or theoretical information on band 
structures would be premature at this time. 

(c) Randomly faulted structure: Wur'.—Consider a 
region of a crystal which in terms of the stacking dia- 
gram is neither 2 layer nor 3 layer but rather where the 
stacking shows one dimensional stacking disorder. That 
is, we may characterize the structure by the proba- 
bility of faulting a.’’ The local potential in such a region 
of random faulting will fluctuate but it seems reasonable 
to relate it to the pure zincblende and wurtzite struc- 
tures’ potential on the average by 


V (ZB) =V(Wur’)+V'(a)=V(Wur)+V’, (5) 


where Wur’ means the randomly faulted, wurtzite-like 
structure, and the perturbation V’(a) depends upon a. 
The exact nature of this dependence is of great im- 
portance, but is a problem beyond the scope of this 
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Fic. 15. States at K(Wur) =k(ZB) = (#/3, x, 0) in a “normal” 
order. No “simple’’ correspondence between states exists here as 
in the cases of propagation along T—A; '—I’. 


2G. Cheroff and S. P. Keller, Phys. Rev. 111, 
A. Lempicki, Phys. Rev. 113, 1204 (1959). 
21 Lempicki, Frankl, and Brophy, Phys. Rev. 107, 1238 (1957). 
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LCAO METHOD 


FOR WURTZITE 


Fic. 16. Photomicrographs of a synthetic ZnS crystal needle. 
Upper: using unpolarized white light ; lower: under crossed Nicols 
using convergent plane polarized white light (the crystal has been 
rotated about its c axis in the lower photograph). The ¢ axis is 
directed along the needle, perpendicular to the striations and in 
the plane of the photograph. Note the alternation of optically iso- 
tropic bands (zincblende) and birefringent bands (pure wurtzite, 
or faulted wurtzite: Wur’(a), regions). Photograph courtesy of 
H. Samelson and L. Ankerson. 


paper. For example, we might define a so that a=0 
(perfect structure) means wurtzite structure and then 
V’(a=0) = V’, while a= 1 means zincblende structure so 
that V’(a=1)=0. If (5) holds and we assume that we 
can always choose a k parallel to the “c’’ axis, ranging 
from k=(0,0,0) to k= (0,0, 2) (see Table IV) (i.e., 
that there is always a plane wave of wavelength A= 2di11 
which can be propagated), then we may carry out the 
LCAO treatment relating the Wur’ structure to the 
zincblende as we did for the zincblende-wurtzite struc- 
tures. Now, however, at any k (parallel to ‘“c’’) the 
shift and splitting of ‘‘corresponding” Wur’ states will 
depend on a. It is of interest that the correspondence 
between states still seems to apply for propagation 
parallel to ‘‘c’”. For propagation perpendicular to “c’”’ 
we may relate Wur’ states to wurtzite states, by using 
(5). (It should be pointed out, however, that a randomly 
faulted Wur’ structure will no longer belong to space 
group Cs,': the 6; and m, operations will be lost and the 
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site symmetry at A, B,C sites would be C;3,=3m. This 
would not affect the simplified LCAO approach as 
used here.) 

In any event, if the definition of a given above holds, 
it would follow that the extremes of optical band gap 
(vertical energy band separations at some k) should be 
for pure zincblende and wurtzite structures, the gap of 
a crystal with random faulting lying always between 
these extremes. Experimental work to test this hy- 
pothesis would be of considerable interest. 





zB 
REGION 


CRYSTAL 


i b= W_REGION 














POTENTIAL 





OISTANCE ALONG “C* oR {i11]—> 


Fic. 18. Band structure: rotation-twinned zincblende illustrat- 
ing the barrier caused by a “polar’’ effect. This is due to a thin 
layer of locked-in wurtzite and hence polarization (due to pyro 
electric and piezoelectric effects), or an inner “double layer,” at 
the twin plane. 
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Fic. 17. (a) Stack- 
ing diagram of rota- 
tion-twinned = zinc- 
blende. Compare Fig. 
4. The dotted and 
shaded circles show 
stacking for the con- 
tinuation of the pure 
zincblende structure. 

| Note the 3 layers of 
| C wuRTZITE wurtzite structure at 

rg the rotation twin 

ets: “w plane: B, C, B and 
7/8 the probable atom/ 
displacements 
from ‘ideal’ posi- 
tions due to unbal- 
anced forces in the 
twin plane (arrows). 
/8 This locked in strain 
gives rise to a “po- 
lar” effect on the 
band structure for 
propagation (k) 
along ‘‘c.” (b) zinc- 
blende-wurtzite mix- 
ed crystal. Regions 
of zincblende may 
alternate with re- 
gions of wurtzite in 
a real crystal and 
within each region 
the potential may 
show substantially the 
zincblende or wurtz- 
ite site symmetry, as 
illustrated by optical 
properties shown in 
Fig. 16. 
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7. CONCLUSION AND SUMMARY 


The relationship between energy bands of a com- 
pound in zincblende and wurtzite structures was dis- 
cussed in the simplified LCAO framework. For propaga- 
tion parallel to the “c” axis ((—A or '—T”’) wurtzite 
states may be regarded as perturbed zincblende states. 
The magnitude of the perturbation V’ in Eq. (4) de- 
pends on (a) difference in bond type (e.g., effective 
charge) in the two structures, and (b) departure of the 
wurtzite structure from ideality. Consequently, the 
amount of the perturbation of corresponding zincblende 
and pure wurtzite states depends on these too. Along 
I'—M and l'—K in both zincblende and wurtzite it is 
possible to compare k vectors but the states along these 
lines are not simply related. This follows because there 
is not a simple relation between the group of the wave 
vector G(k) along this line for zincblende and wurtzite 
structures, as there is for propagation along ‘‘c’’ in the 
two structures, where G(k) for wurtzite is either a sub- 
group of, or isomorphic to, the corresponding G(k) for 
zincblende. 

Experimental results on ZnS and SiC band gaps in 
zincblende and wurtzite structures” suggest that some 
general relationship exists between the band gaps 
(relative displacements of various conduction and 
valence band states) of a compound in zincblende and 

2H. R. Phillip, Phys. Rev. 111, 440 (1958); F. A. Kroeger, 
Physica 7, 1 (1940). 
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wurtzite structures. We have in Fig. 12 a hint of such a 
general relationship, although Fig. 12 applies only toa 
simplified case within the already simplified LCAO 
method. In particular we would like to know: do band 
extremes occur at corresponding (as we have defined 
them) points or lines in zincblende and wurtzite? If 
not then what determines a shift of the extremes when 
going from one to another structure? 

For a mixed crystal of zincblende twinned on wurtzite 
we find energy (k) dependent barriers for propagation 
parallel to the “‘c’”’ axis (the same k can be used on either 
side of the boundary plane). We identify two effects as 
producing these barriers: (a) a polar effect, (b) a sym- 
metry effect. The polar effect is illustrated by rotation- 
twinned zincblende where the boundary (twin) layer 
gives rise to a locked-in double layer across which the 
potential is discontinuous. The symmetry effect is due 
to change in symmetry of the crystal potential on pass- 
ing from wurtzite to zincblende regions and gives rise 
to the k dependence of barrier height. A mixed crystal 
should show evidence of both zincblende and wurtzite 
gaps in its optical absorption; the wavelength depend- 
ence of the anomalous photovoltage may be related 
to this.” 

For the randomly faulted wurtzite-like structure: 
Wur’ (a), we expect a smooth progression of gap width 
from pure zincblende to the pure wurtzite depending on 
a (probability of faulting) which controls V’(a), in Eq. 
(5). Hence assuming that all k are allowed from 
k= (0,0,0) to k= (0,0, 2) (I’ or A), (see Table IV), i-e., 
that plane waves of wavelength as short as A= 2di1 can 
be propagated parallel to “‘c,” the effective gap at any k 
(along ‘‘c’”) will depend upon a in that local region. 
Similarly, the barrier height (symmetry effect) on 
transition from a zincblende region to a Wur’ region 
will depend upon the a@ of the latter. Propagation per- 
pendicular to “‘c” in such a region will involve carrier 
motion on the close-packed sheets ([111] planes) 
within zincblende, wurtzite, Wur’ (a), or barrier regions. 
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Fic. 19. Band structure of a mixed crystal illustrating the 
“symmetry effect” or production of a barrier due to the shift and 
splitting of corresponding states. The situation shown is for 
constant k propagation on both sides of the barrier, k parallel to 
“c.” Barrier heights (discontinuities) as well as absolute magni- 
tudes of energy separations (AEzn, AEw) are k dependent. To 
obtain a proper “symmetry” effect band structure at a different 
k one uses Fig. 13 (or its correct equivalent) and draws a vertical 
line at the appropriate k. The figure is schematic, as the magni- 
tudes of barriers, splittings, and shifts of bands with respect to 
the common reference level are not known for any particular 
material. For example, it is entirely possible that the valence and 
conduction bands (for some particular material) are all shifted 
away from the common reference level, on passing from the zinc- 
blende to the wurtzite parts of the mixed crystal. 


As a final word, the simplicity of the arguments used 
in this paper must be emphasized. A really self- 
consistent dynamical theory of electron propagation 
and band structure analogous to the work on x-ray and 
electron diffraction in faulted materials now under 
way,” remains for the future. 
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The saturation of the paramagnetic resonance of the (halogen)2~ complex (V x center) in the alkali halides 
has been studied. The saturation of the absorption signal (x’’H) versus H, is obtained over a 60-db power 
range for KCI, KBr, and LiF at 78°K. Portis’ theory of inhomogeneous saturation has been generalized by 
omitting the assumption that the individual spin packet width is very much smaller than the envelope width. 
Methods are developed to determine independently from a given experimental saturation curve the spin- 
packet width 1/7:, the spin-lattice relaxation time 7), and the product 7,72. For KCl and LiF values of 7», 
T, and 7;T: are determined for the different hyperfine lines of the Vx center spectrum. For KBr only the 
product 7,7: could be obtained. From the results it is concluded that the spin-packet width is not limited 
by 7;. For KCl at 78°K, 7;~77>2; for LiF, T2 is two orders of magnitude or more less than 7 and depends 
in a complicated manner on the external magnetic field and the angle between the Vx center axis and the 


external field. 


INTRODUCTION 


XPERIMENTALLY measured electron  spin- 

lattice relaxation times have not generally been 
in quantitative agreement with calculated spin-lattice 
relaxation times for electron spins in metals, semi- 
conductors, and insulators. In some cases the agreement 
is sufficiently poor that it is uncertain what is the 
dominant mechanism responsible for the coupling 
between the spins and the phonons. A particularly 
interesting system for spin-lattice relaxation studies is 
the Vx center,! the first hole-type color center in alkali 
halides discovered by paramagnetic resonance tech- 
niques. There are several reasons why this center is 
more attractive for the study of paramagnetic relaxation 
than many other paramagnetic systems: (1) The 
electronic structure of the Vx center is known quite 
accurately.2 The wave function (mostly p orbital), the 
principal g values, the hyperfine coupling constants, 
and the electronic excited states are well known for 
this center in LiF, KC], KBr, and NaCl. These parame- 
ters are better known than for iron group paramagnetic 
ions (d orbitals) or rare earth paramagnetic ions (f 
orbitals). In addition, by forming the Vx centers in 
different alkali halides, these parameters can be varied 
which is a helpful method for sorting out relaxation 
mechanisms. (2) These centers are located in a simple 
cubic lattice. Moreover, extensive work has been done 
on the lattice dynamics of the alkali halides*—much 
more than on the often studied salts containing water 
of hydration. Furthermore, the Vx center (trapped 
hole) is a smaller perturbation on the lattice than other 


* Sponsored in part by the Alfred P. Sloan Foundation; it is 
part of a thesis submitted in partial fulfillment of the requirements 
for the degree of Doctor of Philosophy in Physics at the University 
of Illinois. 

+ Present address: General Electric Research Laboratory, 
Schenectady, New York. 

1W. Kinzig, Phys. Rev. 99, 1890 (1955). 

2T. G. Castner and W. Kinzig, J. Phys. Chem. Solids 3, 178 
(1957); T. O. Woodruff and W. Kinzig, J. Phys. Chem. Solids 5, 
268 (1958). 

3M. Born and K. Huang, Dynamical Theory of Crystal Lattices 
(Clarendon Press, Oxford, 1954), Chap. II. 


paramagnetic defects involving vacancies or inter- 
stitials. (3) The Vx center has an axis of symmetry and 
strongly anisotropic spin-orbit and hyperfine inter- 
actions. This makes it possible to check the orientation 
dependence of the spin-lattice relaxation times, another 
useful method for sorting out mechanisms. In addition, 
because of the well resolved hyperfine spectrum, 
relaxation times can be determined for different hyper- 
fine lines. (4) Since the Vx centers can be formed by 
x-irradiation at 78°K, their concentration can be 
changed in a controlled manner. This fact allows one 
to investigate possible concentration-dependent spin- 
lattice relaxation effects in the same sample. (5) It is a 
one-electron-type defect. 

The F center, on the other hand, is the best known 
paramagnetic color center and has also received the 
most attention. However, there are several reasons 
why the Vx center is more attractive for paramagnetic 
relaxation studies. The much greater localization of the 
wave function for the Vx center allows a simpler 
theoretical model for spin-lattice relaxation calculations. 
The lattice accommodating the trapped hole contains 
no vacancies. The narrow resonance line widths for the 
Vx center insure better signal to noise than for the 
F-center resonance and greatly reduce the problem of 
modulation fast passage effects. The details of the 
resonance spectrum of the Vx center may allow more 
information to be measured about relaxation processes, 
although double-resonance techniques for the F center* 
might offset this advantage. Nevertheless, it will be 
interesting to compare the Vx-center saturation be- 
havior with the known results for the F center. 

The F-center resonance static line width has been 
successfully attributed to the hyperfine interaction of 
the electron spin with nuclei surrounding the F-center 
halogen ion vacancy.® This broadening is of the in- 
homogeneous type due to the interaction of the electron 
spin with something outside the F-center electron spin 


4G. Feher, Phys. Rev. 105, 1122 (1957). 
5 Kip, Kittel, Levy, and Portis, Phys. Rev. 91, 1066 (1953). 
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system. Portis’ investigated the saturation behavior 
of the F center at room temperature showing that the 
dispersion component, x’,.of the susceptibility did not 
saturate at all. The absorption component, x’’, of the 
susceptibility saturated, but in a different manner than 
a homogeneously broadened line, the susceptibility 
falling off only as 1/H, rather than as 1/H,° for the 
homogeneous case. 

The problem of inhomogeneous saturation was first 
treated by Bloembergen, Purcell, and Pound’ for an 
inhomogeneous external magnetic field. Portis® de- 
veloped the theory of saturation due to inhomogeneous 
broadening for both the dispersion and the absorption 
and successfully explained the experimental saturation 
behavior of the F center including the breakdown of 
the Kramers-Kronig relations connecting x” and x’. 
Portis, from his F-center absorption saturation curve, 
determined the product of the longitudinal spin-lattice 
relaxation time, 7), and the transverse or spin- 
dephasing time, 7). However, he could not independ- 
ently determine 7, or 7, without a further assumption 
about the spin-packet width. A spin packet is composed 
of all spins having the same static field to within a field 
interval determined by the dephasing time 72. In any 
case, for the F center the inhomogeneity broadening 
producing the static width is several orders of magni- 
tude larger than the individual spin-packet width. 

The Vx center differs markedly from the single 
broad unresolved F-center resonance. The trapped 
hole resonance consists of a well-resolved hyperfine 
spectrum containing a large number of quite narrow 
lines. Analysis reveals the spectrum can be accounted 
for by assuming that it arises from a halogen molecule 
ion (e.g., Cleo in KCl, Fz in LiF). A complete dis- 
cussion of the electronic structure of the Vx center has 
been made by K dinzig and the author,” hereafter referred 
to as “C and K.” Their discussion reported that even 
the narrow Vx center lines (1.35 gauss in KCl) were 
Gaussian and likely inhomogeneously broadened by 
the hyperfine interaction of the hole with nuclei neigh- 
boring the molecule ion. The work below reports 
absorption saturation measurements on the Vx center 
electron-spin resonance (ESR) transitions and the 
determination of the individual spin-packet width 
(1/yT2) and the longitudinal spin-lattice relaxation 
times at 78°K. Because of the narrow lines and the use 
of a high-power klystron, saturation curves could be 
obtained over a 60-db range in power. These curves 
for large power bent down like a homogeneously 
saturated resonance. Portis’ saturation theory for 
inhomogeneously broadened lines has been extended 
to find a general expression for x’’ without making the 
assumption that the individual spin-packet width is 
very much less than the envelope width due to the 
inhomogeneous broadening. Making use of the general 
expression for x”’ and curve fitting techniques, values 

© A. M. Portis, Phys. Rev. 91, 1071 (1953). 

7 Bloembergen, Purcell, and Pound, Phys. Rev. 73, 679 (1948). 
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of the spin-packet width and 7, were determined. It 
had been the bias of the author and others® to assume 
that the spin packet-width must be limited by 7}. 
However, the experimental saturation curves could 
only be explained by making 7, longer than TJ». This 
work concludes that the spin-packet width is not 
determined by spin-lattice relaxation at 78°K. In 
addition, the results show that steady-state saturation 
methods may be used to extract the spin-packet width 
and spin-lattice relaxation of inhomogeneously broad- 
ened ESR transitions if the lines are narrow enough 
and sufficient microwave power is available. It is noted 
that the inhomogeneous saturation is a quite common 
and general phenomenon, and there are a large number 
of examples of inhomogeneously broadened ESR lines.® 


Saturation of Homogeneously Broadened Lines 


Saturation of homogeneously broadened lines is 
characterized by a uniform decrease of the suscepti- 
bility of the entire line independent of where the power 
is applied in the line. The line-broadening mechanism 
ensures that the microwave energy is absorbed by all 
the spins. Bloembergen"” treats this case, and Portis® 
reviews it in his F-center saturation work. Following 
Bloembergen, we consider the case with the spin S=}, 
and let » be the surplus number of spins, n= N_—N,, 
where NV_ is the number of spins with M,= —4 and N, 
is the number with M,=+4. Then n has a thermal 
equilibrium value 29= (N4+N_)g8H/2kT , where g is 
the spectroscopic splitting factor, 8 is the Bohr mag- 
neton, H is the applied magnetic field, and 7, is the 
lattice temperature. The magnitude of is determined 
by the competition between the applied power tending 
to saturate the ESR (making n=0) and the spin-lattice 
relaxation tending to restore the excess spin population 
to the thermal equilibrium value mo. Thus 


(dn/dt) = —wyH\?g(w—wo)n, 
(dn/dt)su= —(n—Mo)/T,, 


where y is the gyromagnetic ratio, H, is the rotating 
component of the microwave magnetic field inducing 
transitions (AM,=+1), g(w—wo) is the normalized 
shape function for the transition, and 7, is the spin- 


8A. M. Portis, Phys. Rev. 104, 584 (1956); A. G. Redfield, 
Phys. Rev. 98, 1787 (1955); C. P. Slichter (private discussion). 
More recently, Bloembergen, Shapiro, Pershan, and Artman, 
Phys. Rev. 114, 445 (1959), discuss the problem of saturation of 
inhomogeneously broadened lines. ‘They consider a spin-packet 
width 1/yT.2 and relaxation time 7, and introduce a cross- 
relaxation time 72; determined by multiple spin flips or by random 
diffusion in frequency space. The time required for a spin to 
diffuse across a distribution of width 1/y72* is found to be 
T4/T2*, 

9G. Feher, Phys. Rev. 114, 1219 (1959); W. Kinzig and T. O. 
Woodruff, J. Phys. Chem. Solids 9, 70 (1958) ; Kanzig finds that the 
// center saturates in a way indicating inhomogeneous broadening. 
Other paramagnetic color centers also show inhomogeneous 
saturation. See also J. L. Burkhardt, Phys. Rev. Letters 2, 149 
(1959). 

10N. Bloembergen, Ph.D. thesis, Harvard 
(Drukkerij Fa. Schotanus & Jens, Utrecht, 1948). 
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lattice relaxation time. For the stationary condition 
these two rates are equal, and the surplus number of 
spins m is given by 


1 
oa" ‘ao (3) 
1+77 Hig(w— wo)T1 1 


The rate at which power is absorbed from the micro- 
wave field is P=}hw(dn/dt),+, the 4 appearing because 
one spin flip changes n by 2. Combining this with the 
relationship between the power absorbed by the sample 
and the absorption susceptibility (i.e., P= }wx’H,’), 
the absorption susceptibility resulting is 


, 3 . (4) 
1+y°H;’ *2(w— wT: 


where the static susceptibility xo is equal to 
g’8?(N,+N_)/4kT 1. If T2 for the homogeneous line is 
defined as in Portis’ work; i.e., g(0)=T2/x, this 
expression assumes the familiar Bloch form for the line 


center, 
1 
x’ ‘= $y wo! (- : ——~}) (5) 
1+7y°H? T: T; 


The X-band spectrometer described below measures 
a signal Vp « x” H;. Below saturation the signal is just 
proportional to the microwave field H,, while for strong 
saturation of the homogeneously broadened line the 
signal V»« 1/H,. In this case the power absorbed from 
the rf field by the ESR transition becomes independent 
of H,. The relative signal measured by the spectrometer 
has the form 
Vr=X 


where YX = yH,(T,T:2)'. From a homogeneous saturation 
curve the value 7,7; may be experimentally deter- 
mined, and knowledge of the homogeneous width 
(1/yT2) allows one to find the spin-lattice relaxation 
time 7,. We now will compare this homogeneous 
saturation with inhomogeneous saturation. 


(1+ X°), (6) 


Inhomogeneous Saturation 


Inhomogeneous broadening of the ESR transition 
comes from those interactions outside the electron spin 
system which vary slowly over the time required for 
spin transitions. Only those spins with Larmor fre- 
quencies within 1/7, or yH,, whichever is larger, of 
the Larmor frequency w= 7H satisfying the resonance 
condition will be saturated with sufficient microwave 
power. The spin packets precessing at different Larmor 
frequencies interact very weakly with each other, and 
spin diffusion" is extremely slow. Thus it is possible to 
“dig a hole”’ in the inhomogeneous line; that is, saturate 

1 P. W. Anderson, Phys. Rev. 109, 1492 (1958). Calculation of 
the spacial spin diffusion for the Vx center in concentrations of 


10'7/cc using Anderson’s result give a spin diffusion of order 0.1 
second. 
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only a narrow portion of the line, namely the width 
determined by H, or T>2. 

In treating inhomogeneous saturation we must now 
distinguish between two frequency distributions; one 
g(w—w’), the line-shape function of the individual spin 
packet which is homogeneously broadened and very 
narrow, and two, the distribution of static fields 
h(w—wo) (centered about wo) providing the shape 
function or envelope of the distribution of spin packets. 
These are both normalized so that 


f g(w—w’)dw’=1, f h(w’ —w»)dw'=1. (7) 
0 0 
Following Portis? and implicitly assuming that the 


spin packets behave independently of each other and 
making use of (4), the absorption susceptibility becomes 


(8) 





; * are’ g(w—w’)h(w—wo)dw’ 
(a) =tro f ‘e 
0 1+nry’H PT ig(w—w’) 
Portis has shown the dispersion susceptibility to be 
2 wh(w’ —w)dw’ 
x'(w) = bof 
0 1t+9yHYTig(w—’) 
® 201""¢(w"’ —w ‘)do" 
| 
0 
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19 9 
ee 


Portis then shows, for the case in which the over-all 
broadening is very large in comparison with the width 
of the spin packets, that these expressions become 


1g (w—w’ dw’ 


1+rH? 


© 2ey’*h(w—wo)dw’ 
x!) =bxe ff, 
0 


, (10) 





"(w)= _ 5xawh (w— oof 


Tig(w—w’) 


- (11) 
w?—w? 

Several important features should be noted. The 
expression for the dispersion is independent of the 
microwave field, Hi, and does not saturate. The ab- 
sorption line shape is that of the envelope and is 
independent of the degree of saturation. The way in 
which x” saturates will depend on the line shape 
function of the individual spin packets. In particular, 
if the line shape is Lorentzian, the shape function will be 


Ts 1 


sererretunnmenestoorirsm (12) 
wr 1+T?(w— w’)? 


g(w—w’) = 


Performing the integration the absorption suscepti- 
bility will be given by 


if 
(1+7°HYT iT: 


x” = 3xawoh (w— wo) 





(13) 
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and the spectrometer absorption signal becomes 
Vax x” Myx X/(1+X*)}, (14) 


where X=7H,(7,T:2)!. In this case the absorption 
signal increases linearly below saturation and then just 
flattens out for X¥>1. 

In the saturation of the F-center resonance, Portis 
found that the dispersion signal did not saturate at 
all over a range of 50 or more in power. The absorption 
signal increased linearly and then flattened out in 
agreement with (14). In addition, Portis determined 
the value of 7,72 at room temperature. His results 
confirm that the individual spin-packet width is 
extremely narrow in comparison with the inhomo- 
geneous broadening. In addition, the individual spin- 
packet line shape is closely Lorentzian. Portis empha- 
sizes that if the spin-packet line shape fell off any more 
rapidly in the wings than a Lorentzian line, the absorp- 
tion signal for H)>1/y(Ti1T2)! (X>1) would decrease 
instead of flattening out. That the individual spin- 
packet line shape is Lorentzian will be the major 
assumption in extending Portis’ theory. 

The Vx center resonance lines are Gaussian and are 
inhomogeneously broadened, but the spin-packet width 
may be only an order of magnitude or so less than the 
inhomogeneous width, and it is no longer a good 
approximation to assume that h(w—wo) is slowly 
varying over a spin packet. From the results on the F 
center and the nature of the individual spin packet, 
one can conclude that the Lorentzian assumption seems 
a good one for the line shape of the individual spin 
packet. Hence, the normalized distributions are 


: (15) 


1 w’—wo\? 
h(w' — a9) =— — exp| - (-“— ) | (16) 
1 Awg Awg 


where Awg is the Gaussian width of the inhomogeneous 
broadening. (The width between points of maximum 
slope of the absorption, x”, is AH max slope= V2Awe/.) 
The 7, of the homogeneous Lorentzian line is defined 
by the condition Aw,7.=1, where Aw, is the frequency 
width for half maximum of g(w) for the individual spin 
packet. The absorption susceptibility becomes, from 
(8), 
2X0 
tiAw Awg 
w’ exp — (w’ —wo)?/ (Awa)? Jdw’ 


penile all - -. (17) 
J 1+[(w—w')?/Aws]+ (YH 2T)/Awr) 





In general, this integral is difficult to evaluate, but it 
can be solved for w=wo; that is, the susceptibility may 
be calculated at the line center. This has been done in 
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Fic. 1. Inhomo- 
geneous saturation 
curves showing the 
spectrometer absorp- 
tion signal Vz versus 
the reduced micro- 
wave field H,/Hj, 
where Hy=1/y 
(T:T2)* for various 
values of a. The 
quantity a is the 
ratio of the spin- 
acket width to in- 
omogeneous enve- 
lope width. These 
curves are a plot of 
(19). 


COMPLETE INHOMOGENEOUS CASE 








Appendix A. The result is 


—{ i —(at) ], 


wo ) exp(a’/*) (18) 


x’ (wo) = boni( 


AWG 


where ®(at) is the error function, a is a parameter which 
measures the degree of inhomogeneous broadening, 
namely the ratio Aw, /Awg of the Lorentzian spin-packet 
width to the inhomogeneous Gaussian width, and 
t= (1+7?HT,T;)! is a saturation factor. The relative 
spectrometer absorption signal is proportional to x’’H; 
and can be written 


x {1—#(a(14+X%)!)} 
Ve —— exp(a?X*)- — 
[1—®(a)] 





= (14x9) e 


As the parameter a approaches zero, the expression for 
Vr approaches Eq. (14) corresponding to the F-center 
case where the individual spin-packet width is negligible 
compared to the inhomogeneous broadening. Figure 1 
shows the spectrometer signal Vez plotted versus X or 
H,/H, (Hy=1/y(T1T:2)' where Hy is the value of the 
microwave field H; which makes the ordinary saturation 
parameter 3. Vp is plotted for various values of a. For 
the F center an estimate of a places it in the range 
1.6X10~ to 10%, indicating why the saturation curve 
of the F center is quite flat for Y>>1. Having an experi- 
mental saturation curve intermediate between the 
homogeneous and complete inhomogeneous case, it is 
now possible to relate the shape of the saturation curve 
to the individual spin-packet width AH. 


Determination of T,, (7;7:)', and 7; 


Combining curve-fitting techniques with the results 
above, one can determine the spin-packet width and 
the spin-lattice relaxation time. Figure 2 shows a 
typical absorption signal saturation curve. We introduce 
now the quantities Vz } tower ANd Va 4 upper Where V py 
is } the maximum Vz for a given saturation curve. If 
we consider the separation between Vp 4 upper and 
Vr 4 ower AS & Measure of the width of the saturation 
curve it is evident from Fig. 1 that the width increases 
as the parameter a decreases. By taking the various 
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Fic. 2. Typical inhomogeneous saturation curve. The experi- 
mental ratio of Hi(V pr 4 upper) to Hi(Ve 4 tower) determines a or 
AH ,/AHe and hence T2. H, is H, at the onset of saturation and 
determines (7\T:)*. The ratio of xus"F to xs"; for Hi>Hy, 
(H,;>H,T2/T2*) determines T; when T;* is known. 


values of a in Fig. 1 and computing the ratio of H, 
(Ve 4 upper) to Hi (Ve 4 tower), the curve shown in Fig. 
3 is determined. From the value A (Va 4 upper)/ 
H, (Vr 4 tower) for a given experimental curve, it is 
possible with the help of Fig. 3 to obtain a and the 
individual spin-packet width. An appealing feature of 
this 7, determination is that it does not require an 
absolute determination of H,. 

The evaluation of H; and consequently (7,72)! can 
now be done once a has been found. The intersection 
of the linear signal below saturation with the horizontal 
line drawn tangent to the maximum of the saturation 
curve locates what we shall call Hj uncorrected. The 
multiplicative correction factor which relates it to the 
true H;, is a known function of a (see Fig. 1) and ranges 
from 1 for a equal zero to 2 for the completely homo- 
geneous case (for the homogeneous saturation case H, 
corresponds to Vr max). We have now determined nearly 
independently 7, and (7;72)!. 

For H\>H, and a<1 it is possible to determine 7; 
independently by a method herewith called the 7:* 
method. Here T;*(Awg/V2)=1 if we note that Awg/v2 
is just the rms static width of the inhomogeneous line. 
It can be shown (see Appendix B) that the ratio of the 
saturated absorption susceptibility to the unsaturated 
susceptibility is given by 


x2’ (wo) (-) 1 
Xue’ (wo) T Hi, 


where 1/7°H°7,T* looks like an ordinary saturation 
parameter (y°H°7\T2*>1) in which 7, has been re- 
placed by 7;* representing the static line width. We 
can say that Eq. (20) represents physically the satu- 
ration of the entire Gaussian distribution of spin 
packets, and the individual spin packet has lost its 


(1+ correction terms), (20) 


identity. However, for (20) to be valid it is necessary 
that H,>H;(T2/T2*), which is a more restrictive con- 
dition than for a homogeneously broadened line of width 
T,* since we also require a1 for the correction terms 
to be small. 

The picture of inhomogeneous saturation of x” is as 
follows. The onset of saturation of the individual spin 
packet occurs when H, approaches H,. As H, increases 
above H;, the absorption susceptibility of the spin 
packet falls off as 1/H,’. However, as H, increases it is 
also covering more and more spin packets; in fact, the 
number of spin packets covered is closely proportional 
to H,. Hence the susceptibility falls off as 1/H, instead 
of as 1/H,;. This 1/H, falloff is not quite correct 
because, instead of having a constant distribution of 
spin packets, we have a Gaussian distribution of spin 
packets. The number of distinct spin packets is approxi- 
mately AH¢/AH, or 1/a. A word of caution should be 
injected about the spin-packet concept. If the spin- 
packet width arises from time-varying magnetic fields 
due to the reorientation of nuclear magnetic moments 
in the immediate vicinity of .a center, two or more 
centers may belong to the same spin packet but still 
be essentially noninteracting. Finally, when H, ap- 
proaches (AHg/AH,)H;, enough power is available to 
saturate all the spin packets in the distribution, and 
the entire line then saturates homogeneously like a line 
of the static width. 

Thus we can independently determine 72, (71T>), 
and 7, for an inhomogeneously broadened saturation 
curve, provided the saturation curve extends over a 
sufficiently large range of microwave power. The 72 
or spin-packet width is determined by the parameter a 
which is a function of the width of the absorption signal 
saturation curve. The value of (7,7.)! results from 
H, which measures the onset of saturation of the 
individual spin packet. It should be emphasized that 
each individual spin packet saturates homogeneously. 
Finally, by the ““7,* method” where all the spin packets 
are saturated together much like a line of width 
Awg=v2/T2*, we can determine 7;. These three ap- 
proaches will be applied to the saturation of Vx center 
ESR transitions. For the F-center case only the value 
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Fic. 3. The ratio of Hi(Vr 4 upper) to Hi(VR 4 tower) versus a 
or AH,,/AH«, based on curves shown in Fig. 1. 
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(T,T 2)! or H, could be determined, because not enough 
of the saturation curve could be obtained. 


The Experimental Apparatus and Techniques 


The experiments reported here were done with an 
X-band spectrometer of the balanced-mixer type 
featuring a closed loop and separate sample cavity and 
detector magic-tee bridges. A block diagram of the 
spectrometer is shown in Fig. 4. An excellent analysis 
of various types of spectrometers is given by Feher.” 
This type of balanced-mixer closed-loop spectrometer 
has two significant advantages. First, klystron ampli- 
tude noise can be balanced out to first order by the 
use of matched detectors. Second, and of importance 
for saturation work, one can keep a constant microwave 
bias power on the detectors while the power incident 
on the sample cavity can be varied from maximum 
power to 100 db below maximum power. The conversion 
gain of the detectors remains constant while the power 
incident on the sample cavity is varied over many 
orders of magnitude. The klystron employed was a 
V58 with a maximum power output of 600-700 milli- 
watts for the high-power mode. The klystron was 
frequency-stabilized with a modified Pound I. F. 
Stabilizer.’ For detectors selected Sperry 821 Bolome- 
ters were used. For independent microwave phase 
adjustment and microwave power attenuation a 
Hewlett Packard X885A Phase Shifter and a Hewlett 
Packard X382A Precision Variable attenuator were 
employed. The narrow-band amplifier was a Liston 
Becker 8-cycle breaker amplifier modified as follows. 
The input breaker was disconnected converting the 
amplifier into an 8-cycle ac amplifier. The input breaker 
served as the reference for the 8-cycle modulation of 
the magnetic field. The output breaker acted as a 
synchronous detector. 

The sample cavity was operated as close to match 
as possible (|I"| <0.01 was readily obtainable) meaning 
that the cavity bridge was nearly balanced. The bridge 
unbalance was monitored with a crystal connected to 
a sensitive galvanometer which could detect an un- 
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Fic. 4. The balanced-mixer spectrometer featuring a closed 
loop. Matched bolometers are used as detectors. Isolation between 
the detector bridge and the sample cavity bridge is important for 
low-power operation. 


2G. Feher, Bell System Tech. J. 32, 449 (1957). 
'S R. H. Dicke and R. H. Romer, Rev. Sci. Instr. 26, 915 (1955). 


RESONANCE 


OF A V CENTER 1511 
balance of 0.01 microwatt. Because the power incident 
on the sample was varied over a 70-db range from 275 
milliwatts down to 0.027 microwatt, it was necessary 
to have about 50 db isolation between the sample cavity 
and the microwave bias power (20 milliwatts) incident 
on the bolometers. The phase shifter was used to select 
x” or x’. For strong saturation of x’’ the ratio of x’ to 
x” may be greater than 20 db, requiring accurate phase 
settings to better than 1°. This was achieved by modu- 
lating (temporarily) the klystron frequency at 8 
cycles/sec after carefully tuning the klystron to the 
frequency of the sample cavity. Looking at the 8-cycle 
signal output of the Liston Becker Amplifier on a scope 
and tuning the phase shifter for a null of the 8-cycle 
signal, the phase can be accurately set for x’’, making 
use of the full sensitivity of the narrow-band detection 
scheme. 

To determine the microwave field H; required the 
measurement of the microwave power and the Q of the 
cavity. The power incident on the sample cavity was 
measured with a hp 430C Power Meter. The Q of the 
matched cavity was measured by placing the cavity 
reflection curve on the scope and measuring the 
frequency difference of the  half-power points 
(O_=0/Av). For the matched cavity 20,=Qo, where 
Qo is the unloaded Q. The experimental Qo’s varied 
from 4000 to 4500. The relationship yielding Hj for 
the TE, mode and a rectangular cavity of length C, 
height A, and volume V, is 


H?V, C\"77 QoP. 

lk ure 

83 A/T 1 2nv 

where P, is the power incident on the matched cavity 

and vo is the microwave frequency. The maximum 

error in the H, determination is about 15% coming 

primarily for the Q measurement. With the V 58 

Klystron the maximum H, obtainable at full power 
was 0.7 gauss. 

The samples were 0.060 in. thick and covered the 
end wall of the aluminum cavity. The samples were 
irradiated while at 78°K through the cavity end wall 
0.020 in. thick with a 50-kv x-ray tube with tungsten 
target. The irradiation times varied from 8 to 20 hours 
producing 10'* to 2 or 3X10"? Vx centers. Preliminary 
experiments with crystals of various thicknesses 
showed that the filtering of the x-rays by the cavity 
end wall was sufficient to assure a homogeneous 
concentration of Vx centers. 

Because of the field modulation combined with 
synchronous detection the spectrometer measured 
dx"’/dH, i.e., the derivative of x’’ as the dc field was 
swept slowly. If transitions within the individual spin 
packet are adiabatic and if no fast passage effects 
occur, then the derivative at its maximum will be pro- 
portional to x’’(wo). The condition most difficult to 
meet is the achievement of slow passage due to the 
modulation of the magnetic field. The slow-passage 
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Fic. 5. Absorption 
saturation curves of the 
Vx center at 78°K in 
KCl, KBr, and LiF. 
(Vr«x”"H;.) Hy is 
give by 1/y(7:T:)* and 
measures the onset of 
saturation. @ is the angle 
between the external 
field and the Vx center 
axis, and M, is the mag- 
netic quantum number 
of the two nuclei making 
up the center. 








condition is expressed by 


mH m<H1/T, (22) 


where w» is the angular modulation frequency, and 
H,, is the modulation amplitude. However, fast-passage 
effects can only occur when the spin packet is saturated 
[H,>1/y(T:T:2)']. Consequently, fast-passage effects 
will not occur if wmHm<1/y7T\(7iT2)'. For the Vx 
center in KCI typical values of JT, and T; at 78°K are 
1 wsec and 7 ywsec. With w»=50, slow passage requires 
H»<50 gauss, which is easily satisfied. It was possible 
in all the experiments at 78°K to avoid fast-passage 
conditions. 

For a rectangular cavity and the TE, mode, A, 
varies sinusoidally over the end wall. As a result, 
different centers different H,’s and different 
parts of the crystal will begin to saturate at different 
powers. One should then integrate the signal Vr over 
the values of H,; to which the Vx centers are exposed. 
Watkins" has shown that 


ae ” 
see 


” 


x =x" (wo, 1?) wv) 


X? ((A(H)?) Pw 


| as) 


1— roe 
(1+X*)?  ((H)?)m)? 


where X*=(H7*),/H? and H?=(H),t+AH,. The 
angular brackets indicate the average over the volume 
of the sample. Qualitatively one notices that the effect 
of a nonuniform //, is to flatten the saturation curve 
slightly at the peak. The effect then falls off for X¥<1 
and for X>>1. In these experiments a 3-in. diameter 
shield was used during irradiation, restricting Vx 
center formation to the center two-thirds of the cavity. 
In this case the second term in the bracket for Y=1 is 
about 4%. 


Experimental Results and Discussion 
Steady-state ESR Vx center absorption saturation 
curves were obtained at 78°K in KCl, KBr, NaCl, and 
LiF with concentrations close to 10!’ centers/cc. 
Because of the larger static line width and slower 
coloring rate not enough Vx centers could be obtained 
in NaCl to get a satisfactory signal-to-noise ratio over 


"G. Watkins, Ph.D. thesis, Harvard University (unpublished). 
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a large enough power range to obtain good saturation 
curves. The NaCl data will not be presented. 

Figure 5 shows the absorption signal saturation 
curves for KCl, KBr, and LiF at 78°K. The (@=45°, 
M,=1, LF) lines are shown for KCl and KBr while 
the (@=45°, M;=0) line is shown for LiF. @ is the angle 
between the Vx center axis and the external magnetic 
field, My is the nuclear magnetic quantum number of 
the two nuclei making up the Vx center, and LF means 
lower external field than for the center (M;=0) line 
in contrast to HF for higher external fields than for 
the center line. The curves indicate that in KBr the 
absorption susceptibility starts to saturate with an H, 
an order of magnitude larger than required for satu- 
ration in KCI or LiF. The value of H, is slightly smaller 
for KCI than for LiF, indicating a slightly larger value 
of (7,T:)' for KCl than for LiF. However, the LiF 
absorption signal decreases only a small amount for 
H,> H, while for KCI the absorption signal bends down 
approaching a slope of —1 where x”«1/H;? and 
Vre«1/H;. Qualitatively this implies that the Vx 
center in LiF has a smaller value of @ or a smaller ratio 
of spin packet to static Gaussian width than in KCl. 
The LiF saturation curve does not remain as flat for 
H,> Hy, as it does for the F center because the Vx 
center resonance is less inhomogeneously broadened 
than the F-center resonance. Unfortunately, not enough 
microwave power was available to measure the satu- 
ration curve for KBr over a range that was sufficient 
to obtain JT, and 7; separately. Only the product 7:T:2 
could be obtained for KBr. 

Saturation curves were obtained for different hyper- 
fine lines and for different values of 6. The detailed 
results will be presented first for KC] and KBr because 
of similar behavior and then for LiF because of its more 
complex behavior. 

Figure 6 shows the absorption saturation curves of 
the Vx center in KC! in which both nuclei were Cl*® 
and a Cl*-Cl*? center. In addition, the F-center ab- 
sorption saturation curve is shown for comparison. The 
graph shows that the saturation curves for the 35-35 
center and the 35-37 center are identical if we disregard 
the absolute intensity of the signal. In fact, data taken 
for all the lines in the hyperfine pattern for one @ have 
the same values of a and H; to within experimental 
errors (15%). Thus, within experimental error the 


Fic. 6. Vx _ center 
absorption _ saturation 
curves for a Cl*-C]* 
center and a C]§-C]37 
center at 78°K. An F- 
center absorption satu- 
ration curve at 78°K is 
also shown. 
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TABLE I. Vx center saturation parameters at 78°K 


From a 


AHL T2 
gauss 





@=45° 
6=90° 
KBr 

6=45° 
6=90° 


0.052 


0.049 1.0 


lines, arising from different isotope combinations but 
corresponding to the same @, have the same spin-packet 
width, and the same spin-lattice relaxation time. In 
KBr only H; could be determined from the saturation 
curves, but (as in KCl) it was the same to within the 
experimental error for all the hyperfine lines of a given 
6 for Br®-Br®!, Br*!-Br7®, and Br?°-Br?® centers. Table I 
shows the results for KCl and KBr. 

The Table indicates that 7; is a factor of about six 
larger than 72 for KCI at 78°K. The Table also indicates 
that the three determinations T2, T;, and (7172)! are 
consistent with one another to within the experimental 
error. It should be re-emphasized that the author was 
biased by previous work which suggested that the 
spin-packet width was 7)-limited. The data were first 
analyzed assuming 7,= 7». This led to an inconsistency 
between the a method and H; method. It will be shown 
below that in LiF the inconsistency is an order of 
magnitude greater. In addition, when the data were 
first analyzed, the 7,* method for directly determining 
T; had not been considered. It was only after assuming 
T,%T, that all the data could be satisfactorily ex- 
plained. One also observes that the value (7,7»)! has 
a @ orientation dependence. For KC] at 6=90°, (7,T»2)! 
is about half the value measured at 06=45°. Before 
attempting a complete understanding of the atomic 
processes contributing to 7; and T2, more data should 
be obtained for other values of 6 and over a large 
temperature range. In LiF more extensive data were 
taken for different values of 0. 

For the LiF Vx center, each different hyperfine line 
for a given @ behaves differently and we must explicitly 
specify the lines we are talking about. Figure 7 shows 
saturation curves for the @2=45°, M;=1 low-field (LF) 
line and the @=45°, M;= —1 high-field (HF) line. The 


Fic. 7. Vx center 
absorption — satura- 
tion curves in LiF at 
78°K. Shown are 
high-field M,=—1 
and low-field My, 
=+1 lines for 
6=45°. 
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From Hy From T2* 
Hy (T1T2)4 Z T1 
gauss sec sec usec 
0.019 : : 6.2 
0.035 coe cee 
0.28 
>0.39 


high-field line begins to saturate first; nevertheless, 
above saturation it does not bend down as quickly as 
the low-field line. Figure 7 also shows the saturation 
curve shapes, given H, and assuming 7;=7>2. The 
dashed extensions of the curves show what the assump- 
tion 7;= 7, would do to the saturation curves. Here, 
even more dramatically than in KCl, the assumption 
T,=Ty? is at variance with the experimental results. 

Figure 8 shows more saturation curves for different 
6 and different hyperfine lines. The 6=90° doublet in 
LiF refers to the two small lines on the low-field side 
of the center line. There are two separate features of 
the saturation curves for LiF. First, for a given @ the 
LF lines always saturate less readily than the HF lines. 
In addition, the difference in saturation behavior of the 
LF and HF lines is large when these lines are far apart 
in field (@=0°) and small when these lines are close 
(8=90° doublet). Secondly, there is also a complicated 
variation of the saturation behavior with @. The onset 
of saturation occurs at lowest power for @=0°, and at 
highest power for 6=60°. For 6=90° the saturation 
starts at a lower power than other angles except @=0°. 
Table II shows all the results and the saturation 
parameters. 

Table II indicates spin-packet widths varying from 
5 gauss down to } of a gauss corresponding to T»’s 
varying from 0.01 usec to 0.32 usec, while the T)’s vary 
from about 15 to 45 usec. For the Lif Vx center at 
78°K the 7,’s are much longer than the spin packet 
T»’s, the ratios varying from sixty to several thousand. 
The significant feature of the results is that 7, depends 
in a complicated way on the dc magnetic field and on 
the angle 6 between the field and the Vx center sym- 
metry axis. 7, values vary over a range of 30 while 7, 
only varies over a factor of about 3. The difference in 
the magnitude of inhomogeneous broadening for the 
LF lines (a>0.1) and the HF lines (a<0.1) is apparent 
from the Table. The interpretation of these results for 
the individual spin-packet width (1/y72) and for the 
spin-relaxation time 7, using atomic processes will be 
undertaken in a future paper. 

We can say the energy level pattern is different for 
high and low fields or for various @. It is possible that 
nuclear spin mixing may be faster for one field or 
orientation than another. Abragam and Proctor have 
found that spin mixing depends on field.'® The large 


16 A. Abragam and W. G. Proctor, Phys. Rev. 109, 1441 (1958). 
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TABLE II. LiF Vx-center relaxation parameters at 78°K. 


a 
4H 
Run and line a gauss 


6=0°, M,;=—1HF 0.021 0.18 
6=0°, M;=1 LF 0.16 1.4 

6=45°, M;=—1 HF 0.018 0.16 
6=45°, M;=1 LF 0.14 1.3 

6=45°, M;=0 0.030 0.28 
6=60°, M;=—1 HF 0.085 0.74 
6=60°, M;=1 LF 0.56 49 

6=60°, M;=0 0.063 0.55 
6=90°, M;=0 HF 0.041 0.36 
6=90°, M;=1 LF 0.058 0.51 
6=90°, M;=0 HF ose ree 


0.10 


hyperfine interaction for the LiF Vx center and its 
large static-line width indicate that the nuclei neigh- 
boring the Vx center “see” quite different local fields, 
depending upon which hyperfine line, HF or LF, is 
being observed. It is further noted a spin-spin inter- 
action between Vx centers cannot account for a 7» 
which depends on field and angle @. 

Since completion of this paper, measurements have 
been made at the General Electric Research Laboratory 
with W. Kinzig extending the data to 48°K, 20°K, and 
11.5°K for KCI and LiF.'* Experiments by pulse tech- 
niques have also been carried out in the liquid He range. 
In this forthcoming paper a development of the calcu- 
lation of spin-lattice relaxation for the Vx center due 
to spin-orbit coupling, hyperfine coupling, etc., will be 
considered. Because the electronic structural properties 
and parameters of the center are well known (see C 
and K), it is felt that a reasonable comparison between 
the experimental relaxation times and calculated 
relaxation times can be attempted. 


CONCLUSIONS 


It is possible to treat the saturation of inhomo- 
geneously broadened lines for the case in which the 
variation of the envelope distribution of spin packets 
over a single spin packet cannot be neglected. When 
the saturation curve can be measured over a sufficiently 
large power range, it is possible to determine separately 
the individual spin-packet width or T2, (71T:2)!, and 
the spin-lattice relaxation time, 7). Saturation curves 


Fic. 8 Vx _ center 
absorption _ saturation 
curves in LiF at 78°K. 
The high-field and low 
field lines for @=60°, 
6=0°, and 6=90° are 
shown. 
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16 T. G. Castner, Bull. Am. Phys. Soc. 4, 21 (1959). 





Method 
Ay 
Hy (T1T2)4 
gauss usec 





0.014 3.6 
0.048 ; 
0.020 

0.038 

0.027 

0.048 

0.076 

0.046 

0.028 

0.031 

0.020 


for the Vx center at 78°K indicate in KC] and LiF that 
the spin-packet width is not determined by 7;. 7; may 
be much longer than 7». For KCI all the hyperfine lines 
for a given @ have the same values of JT: and 7; to 
within experimental error. For KBr the product 7,72 
is the same for all hyperfine lines of the same @. In 
LiF 7, is different for different hyperfine lines de- 
pending on both @ and H. In LiF 7; depends on 6 and 
may have a small H» dependence. 
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APPENDIX A. EVALUATION OF THE 
INHOMOGENEOUS ABSORPTION 
SUSCEPTIBILITY INTEGRAL 


The integral (17) is evaluated in the following way. 
One first introduces a set of reduced frequency variables 
Xp=wo/Aw,, and x’=w’/Aw, and, in addition, defines 
a saturation parameter s such that =7yH/?7T,T>. (17) 
becomes 

x’ (wo) =— (24) 
V7 


3Xvd f x’ exp[—a?(x’— x»)? ]dx’ 
0 


1+ (x’—29)?+3s? 
Making the substitution y=x’—.» and (since x9> 10*) 
replacing the lower limit —x 9 by — , one obtains 
sxod f°” (y+2x0) exp(—a’y")dy 
VT 4» et ar 


The first term in the integral vanishes because it is odd, 
leaving an integral, which will be designated J(a,t), 


of the form 
” exp(—a’y’)dy 
a/)=2f a —, 


re (26) 
fy" 





PARAMAGNETIC 


where 2= 1+. This can now be written as 
* exp[—a?(?+y") |dy 
I(a,t)=2 exp (att) f ao as, 
0 Py? 


Defining the integral in (27) as J(a,t) and differenti- 
ating J(a,t) with respect to a and evaluating the 
integral, one has 


dJ (a,t)/da= — exp(—aF). (28) 


Next, integrating with respect to a, one obtains 


Ha) =J(0)+ f [—x exp(—a’?’) Jda. (29) 


This integral has the form 


Tv 
J (a,t)=—[1—(at) ], (30) 
2t 


where ®(a/)=2/x! /* exp(—y*)dy, the usual error 
function. Combining (30) and (27) with (25), we 
obtain 


wo \, exp(a’?) 
i ai 


x") = bron! — —_ —[1—(at) ]. 


Wa t 


APPENDIX B. SATURATION OF THE ENTIRE 
ENVELOPE OF SPIN PACKETS 


The absorption susceptibility may be written in the 
form 


Wo exp(a’/’) ‘ 

x’ (wo) =x0 ~~) ——— f exp(—.x")dx. (32) 
Awg l at 

If the substitution y’=.«?—a’? is made, (32) will take 


the form 
wo f*% yexp(—y*)dy 
”/ 7 
x (on) =xe—t f . 
0 


. (33) 
Awe [y?+a7r ]}} 


If a’??>1 the denominator may be expanded in a 
Taylor’s series, and the integral becomes 


1 wn 1 y 2 3 y 4 
—f yexp(—y)] 1 (~) + (~) 
al “9 cE 2 wae 8\ al 
5 y 6 
es () ae fay (34) 
16\ at 
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Integrating term by term, the saturated susceptibility 
becomes for a> 1 


a Wo 1 1 
Xs (w0) _ tyy~— —f{i- 
Awg al? 2a7t? 4a‘t4 
It is now convenient to evaluate x’’(wo) when there 
is no saturation at all (7,;=0, yielding ?=1). We start 
with 


Wo S 
Xus’ (wo) =xo— expla’) f exp(—.x*)dx. (36) 


WG 


This can be written as 


: wr Wo ; 2 ba 
Xue’ (wo) =—xo— explo?) - ne f exp(— “as| 
2 0 i 


Z Awg T 
(37) 


Expanding this for a<1 one obtains 


r! wo 2 4a’ 
Xue’ (wo) = ol )}) == ‘iaeeiale”s as ee | (38) 
9 


y Awe T 3m? 
One now takes the ratio of the unsaturated suscepti- 
bility xus’’(wo) found just above to the saturated 
susceptibility x,’’ (wo), getting 
Xus (wo) |  (1—2a/e'+---) 
———_——-=7'al’ 
x’’ (wo) 1 
| ina deees 
\ 2a*l? 
But 2>1 and hence 2~7yH271T2 and a= Aw,/Awe 
= T»«/V2T2; consequently (39) becomes 


v7 h 
Xus (wo) it 
=—"HT;T;* 
7 
Xs (wo) 


(1+ correction terms). (40) 


If the correction terms are small, this expression is 
nearly independent of the spin-packet width 1/7». 
Consequently, if the experimental conditions a<1 and 
alt>1 can be satisfied, expression (40) may be used 
with an experimental saturation curve to determine 
T; directly. For KCI at 78°K the parameters for the 
Vx center of a~0.05 and @Pinax~6 correspond to 
HA max~0.7 gauss and the correction term will be about 
3%. 





PHYSICAL REVIEW VOLUME 


813d, 


NUMBER 6 SEPTEMBER 15, 1959 


Low-Temperature Ultrasonic Attenuation in Tin and Aluminum*t 


DanreL H. Fitsont 
Department of Physics, University of California, Los Angeles, California 
(Received April 14, 1959) 


According to theory based on an ideal metal, the ultrasonic attenuation should be proportional to the 
square of the frequency and to the electrical conductivity of the metal. Experiments were performed to 
compare theoretical and experimental values of attenuation in a frequency range from 100 kc/sec to 1 


Mc/sec. 


A long wire sample coupled to a barium titanate transducer was suspended in a chamber, which in turn 
was placed in a liquid helium bath. A short train of sine waves was sent through the wire and the amplitude 
of successive reflections was measured as a function of temperature and frequency. The electrical conduc- 


tivity of the sample was also measured. 


Two samples were investigated. High-purity tin yielded experimental results in excellent agreement with 
theory. The attenuation in high-purity aluminum was proportional to the electrical conductivity but 


averaged 45% higher than theoretically predicted. 


INTRODUCTION 


T has been clearly established that at very low 

temperatures, and for very pure metals, the inter- 
action of the conduction electrons with the lattice may 
become the dominant factor in causing ultrasonic 
attenuation. Theories for the loss have been presented 
that are in agreement with each other at low frequencies, 
where the acoustic wavelength exceeds the electron 
mean free path. In this region it is predicted that the 
attenuation will be proportional to an electronic 
relaxation time. Steinberg’ points out that the relaxa- 
tion time appropriate to the acoustic loss may 
differ from that found from electrical conductivity 
measurements. 

Although the acoustic loss has been measured as a 
function of temperature on many occasions, the 
electrical conductivity in the same sample has been 
measured only at 4.2°K in the region governed by 
impurity scattering.‘ Therefore, there have been 
published no measured comparisons of the “acoustic” 
and electrical relaxation times as a function of 
temperature. 

Theory also predicts that for wavelengths greater 
than the mean free path, the attenuation will vary with 
the square of the frequency. All verifications of this 
have previously been carried out at 10 Mc/sec and 
above. 

This paper will describe some results found in tin 
and aluminum by measuring the acoustic loss and 
electrical conductivity in the same sample, as a function 
of temperature, with a view to establishing the relation 


* Based on a dissertation submitted in partial fulfillment of the 
Ph.D. requirements at the University of California, Los Angeles, 
1958. 

t This work was supported in part by the Office of Naval 
Research. 

t Now at the Department of Physics, U. S. Naval Postgraduate 
School, Monterey, California. 

1 W. P. Mason, Phys. Rev. 97, 557 (1955). 

2 R. W. Morse, Phys. Rev. 97, 1716 (1955). 

5M. S. Steinberg, Phys. Rev. 109, 1486 (1958). 

‘W. P. Mason and H. E. Bémmel, J. Acoust. Soc. Am. 28, 930 
(1956). 


between the acoustic and electrical relaxation times. 
The frequency range for the acoustic measurement was 
100 kc/sec to 1 Mc/sec. 


THEORY 


In order to propagate a longitudinal wave, the sample 
must be many wavelengths in diameter. At low fre- 
quencies, this becomes impractical. However, when the 
sample is only a fraction of a wavelength in diameter, 
a Young’s modulus wave may be propagated. This is a 
compressional wave characterized by lateral as well as 
longitudinal motion. 

The theory of ultrasonic attenuation due to electron- 
lattice interaction has been developed for longitudinal 
waves. Using either Mason’s model! of a viscous 
electron gas, or Morse’s model* based on a relaxation 
process, the theory may be simply extended to apply 
to a Young’s modulus wave. A complex Young’s 
modulus is found which may be written 


V*=Votj(8/15)wN Eora(1+»)?, (1) 


where w is the angular frequency, N the effective 
number of electrons, Ey the Fermi energy, 74 the 
acoustic relaxation time, and v Poisson’s ratio. Yo is 
the usual real elastic modulus. 

The attentuation then takes the form 


9 


4 
a=— — NEora(1+»)?, (2) 
15 pC* 


where p is the density of the sample, and C is the phase 
velocity of a Young’s modulus wave. If 7. is assumed 
equal to the relaxation time appropriate to electrical 
conductivity, then we may rewrite Eq. (2) and obtain 


(3) 


4 mo 
a=— —E,(1+)? 


15 pC? e 


where m is the effective electron mass, ¢ is the electronic 
charge, and o the electrical conductivity. 
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The velocity of such a wave is dependent upon the 
ratio of sample radius to wavelength, a/A. A formula 
for computing the phase velocity applicable for small 
values of a/d is given by Rayleigh® as 


C=C 1+7v?(a/r)*]; (4) 


Co is the velocity measured at a wavelength sufficiently 
large that no correction is needed. At the highest 
frequencies used, in the experiment to be described, 
a/X was less than 0.2. The resultant correction to C as 
obtained by (3), using this ratio of a/A, leads to a 10% 
change in the theoretical value of a. 


EXPERIMENTAL TECHNIQUES 


The pulse-echo method has been in common use for 
the measurement of the attenuation of high-frequency 
elastic waves. The same method may also be utilized 
at lower frequencies and requires somewhat simpler 
electronic circuitry. A block diagram of the circuit is 
shown in Fig. 1. A positive pulse triggered at 30 cps is 
applied to the plate of a one-tube Hartley oscillator. 
The duration of the pulse determines the length of the 
sine burst from the oscillator. This signal is then fed 
directly to an end-polarized, cylindrical barium titanate 
transducer, 0.20 cm in diameter, and 0.64 cm in length. 
The single transducer was used both as a transmitter of 


longitudinal waves, and as a receiver, after these waves 
were reflected from the far end of the specimen. It 
operated well over the entire temperature range. The 
receiver range. The received signal was then fed through 
a 20-db broadband amplifier to an oscilloscope and 


displayed without rectification. As many as one 
hundred successive echoes were visible at low attenua- 
tions. The display was either photographed and the 
attenuation measured at a later time, or an ex- 
ponentially decaying pulse was superimposed on the 
received pattern and the time for this pulse to decay 
6 db measured with an electronic counter. 

Two metals were investigated; the first, aluminum,® 
was cut from a rectangular rod of 99.995% purity. The 
rod was quartered, etched, and then drawn through 
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Fic. 1. Circuit used for acoustic measurements. 
5Lord Rayleigh, Theory of Sound (Dover Publications, Inc., 
New York, 1945), first edition, Vol. 1, p. 252. 
6A gift from Atomics International, obtained originally from 
Alcoa. 
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Fic. 2. Arrangement for mounting and supporting sample. 


stainless steel dies, until a wire of 0.177-cm diameter 
and of over a meter in length was obtained. 

The second metal used was tin, originally in the form 
of a 99.999% pure cast cylindrical rod. A wire of 
0.116-cm diameter and of over one meter in length was 
extruded by forcing the tin rod through a steel die. 

The ends of each sample were polished flat and 
perpendicular to the sample axis. Each sample was 
coiled into a loose helix and annealed. Coiling did not 
produce an adverse effect on the wave-train envelope 
or on the attenuation, for coil diameters greater than 
5 cm. If the coil diameter was reduced much below 5 
cm, the received pattern became distorted, and in- 
dividual echoes were lost in random reflected waves. 
Large increases in attenuation were noticed after 
coiling, but annealing always restored the attenuation 
to a value within 10% of that found in the straight 
sample. 

The sample to be measured was mounted in the 
sample holder as shown in Fig. 2. A point near one end 
of the sample was fixed by three pins. Fine wire was 
spiraled about three rods to form a helix of triangular 
cross-section concentric with the sample. Each turn of 
the sample was thereby supported by a corresponding 
triangle of wire. The barium titanate transducer was 
pressed against the sample by means of a spring washer. 
Non-Aq stopcock grease was used as a coupling film 
between the sample and transducer. 

The sample holder was mounted in an evacuated 
chamber which was placed within a helium Dewar 
system. Temperatures as low as 1.05°K were achieved 
by lowering the pressure above the liquid helium. An 
Allen-Bradley 100-ohm }-watt carbon resistor was 
used to measure temperatures above 4.2°K. 

After completion of the acoustic measurements, the 
resistivity of the sample was measured by clamping 
it at both ends in another sample holder, passing 
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Fic. 3. Measured attenuation at 575 kc/sec as a function of 
temperature in tin, with electrical conductivity in arbitrary units, 
represented by the solid line, superimposed. 


current up to 10 amp through it, and reading the 
potential drop on a White potentiometer. The potential 
drop was kept above 100 uv at all temperatures. 


EXPERIMENTAL RESULTS 


Two measurements of the acoustic loss vs tempera- 
ture were made in the tin sample. Three frequencies 
were utilized: 300 kc/sec, 575 kc/sec, and 1090 kc/sec. 
Most of the data were taken at 575 kc/sec. The attenua- 
tion at other frequencies was also measured, but only 
at 4.2°K and 1.05°K. A plot of attenuation measured 
at 575 kc/sec vs temperature is shown in Fig. 3. The 
electrical conductivity in arbitrary units, represented 
by the solid line, is superimposed upon the measured 
points. 

A magnetic field generated by a solenoid surrounding 
the Dewars restored the tin to the normal state below 
3.7°K, when desired. 

The residual attenuation was large, but whatever loss 
was present at 1.05°K in the superconducting state was 





Fic. 4. Measured attenuation at 540 kc/sec as a function of 
temperature in aluminum. Electrical conductivity is represented 
by the solid line. 
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considered as residual and subtracted from the loss 
found in the normal state. 

Both conductivity and acoustic loss were measured 
twice in the aluminum sample over a period of several 
weeks. Only two frequencies were utilized, 540 kc/sec 
and 1070 kc/sec. The magnitude of the residual 
attenuation was estimated from the shape of the 
attenuation curve. Figure 4 shows points plotted from 
attenuation measurements at 540 kc/sec. A solid line 
representing electrical conductivity in arbitrary units 
is superimposed. 

Table I summarizes the results in tin and aluminum. 
The variation in theoretical values of a/ f? for a given 
metal is due to the change in velocity with frequency. 


DISCUSSION 


Implicit in the formula for attenuation is the assump- 
tion that the Fermi surface is spherical. It is well known 
that tin does not have a simple Fermi surface. Neverthe- 
less, the value for a as found from Eq. (3) agrees well 
with the experimental results, assuming one free 


TaBLE I. Comparison of theoretical with experimental attenuation 
at 4.2°K. Residual attenuation has been subtracted. 


a/ f2(1075 db-sec?/cm) 
Experi- 
mental 


Frequency Theoretical 


Metal (108 sec) 


$51 

5.65 
16.6 
16.6 
16.8 
17.0 
17.4 
18.2 


540 
1070 
225 
300 
450 
575 
750 
1090 


Aluminum 


Tin 


electron per atom. This is probably because the sample 
was a small-grained polycrystal, and the measured 
attenuation represents an average over all crystal 
directions. Measurements by Bommel and Mason in 
single tin crystals show that the attenuation is strongly 
dependent on sample orientation.‘ 

It has been assumed that aluminum has a spherical 
Fermi surface with a density of three electrons per 
atom. Heine’ has shown that while most of the Fermi 
surface may be represented by spherical caps extending 
into the second zone, there are regions where the 
effective mass of the electrons is quite different from 
the free electron mass, and the Fermi surface may not be 
considered as spherical. The simple theory, based on 
free electrons, cannot be expected to be in quantitative 
agreement with experiments performed on polyvalent 
metals. The experimental results for aluminum average 
1.45 times the value found from Eq. (3) 

It may be concluded, however, that the attenuation 
found in tin and aluminum is proportional to the 
electrical conductivity and hence that the acoustical 


7V. Heine, Proc. Roy. Soc. (London) A240, 340 (1957). 
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relaxation time is proportional to the electrical relaxa- 
tion time. It is also apparent from Table I that the 
attenuation is proportional to the square of the fre- 
quency, at least in the case of tin.® 


8 Measurements at even lower frequencies by E. Lax show no 
deviation from the frequency-squared dependence of attenuation 
in aluminum. These measurements are to be published. 
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Dipolar Line Broadening and Enhanced Pseudo-Dipolar Moments 


RoBERT L. WHITE 
Hughes Research Laboratories, Culver City, California 


(Received April 24, 1959) 


The assumption of a strongly enhanced pseudo-dipolar moment in certain ferrimagnetic and antiferromag 
netic materials is shown to be incompatible with the paramagnetic resonance line widths in these materials. 
Why the concept of pseudo-dipolar moments is not applicable to line-width calculations is discussed. 


HE purpose of this communication is to point 

out that the assumption of a strongly enhanced 
pseudo-dipolar moment leads to erroneous predictions 
concerning the paramagnetic resonance line widths of 
certain ferrimagnetic and antiferromagnetic materials 
above their Curie temperatures. 

The concept of a strongly enhanced pseudo-dipolar 
magnetic moment was first introduced by Van Vleck! 
in a calculation of magnetic anisotropy in cubic ferro- 
magnets. The motivation was to thereby simulate, in 
various higher orders of perturbation, spin-lattice effects 
incompletely understood and therefore absent explicitly 
from a simplified Hamiltonian restricted to dipolar 
interactions. 

Anderson and Weiss? have shown that the dipolar 
contribution to line width, AH, of a paramagnetic 
absorption in the presence of exchange is given by 


AH = (10/3)(H,2/H.), (1) 


where H, is the broadening dipolar field and H, is the 
exchange field. For ordinary electronic dipoles, the 
dipolar contribution to H,” is given by 


H,2=5.1(g8n)2S(S+1), (2) 


where g is the Landé g-factor, 8 is the Bohr magneton, 
n is the number of spins per unit volume, and S is the 
spin quantum number of the dipole. For a pseudo- 
dipole, (g8n) of expression 2 will be replaced by (pg), 
where # is an enhancement factor, estimated variously 
by various authors for different situations, but generally 
about 50.!3 The exchange field H, of (1) may be calcu- 
lated from the Curie temperature of the material by 

1 J. H. Van Vleck, Phys. Rev. 52, 1178 (1937). 

? Pp. W. Anderson and P. R. Weiss, Revs. Modern Phys. 25, 269 
<o Suhl, Walker, and Anderson, J. Phys. Chem. Solids 
1, 129 (1956). 


Eqs. (28a) and (30) of reference 2. Note that H, is 
defined in this reference as the exchange field per 
exchange-coupled neighbor, not as total exchange field. 

The data on paramagnetic line widths of ferrimagnetic 
yttrium iron garnet (YIG), gadolinium iron garnet 
(GdIG), and antiferromagnetic maganese fluoride are 
plotted in Fig. 1. One observes that the line width 
becomes almost constant for temperatures well above 
the Curie temperature. Furthermore, as is shown in 
Table I, the observed line widths are not compatible 
with the assumption of a strongly enhanced pseudo- 
dipolar moment which in each case would lead to a 
theoretical line width from one to three orders of mag- 
nitude too great. The assumption of the intrinsic 
electronic moment leads to theoretical dipolar line 
widths somewhat smaller than the observed, reflecting 
probably the omission of crystalline field broadening 
and conductive losses from the line-width calculation. 














300 400 
TEMPERATURE, °C 


Fic. 1. Paramagnetic line widths above the Curie 
point for various materials. 
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TABLE I. Comparison of experimental and theoretical 
paramagnetic resonance line widths. 


AH (calc) 
Hy? Dipolar Dipolar 
Material H, Intrinsic Enhanced intrinsic enhanced AH (obs) 
5000- 350 
50 000 
12 500. 410 
125 000 
34 000. 465 
340 000 


YIG 7105 107 10%-10 50 
2.5 X10° 

2.5 K10" 

9.2 K104 107 = 10%10' 


GdIG 7X105 2.5 X10 


MnF; 


As Van Vieck! pointed out, the pseudo-dipolar inter- 
action was introduced to simulate in the ferromagnetic 
metals the combined effects of (1) spin-orbit coupling 
within each magnetic ion, and (2) the (incomplete) 
orbital quenching of each ion by purely electrostatic 
interaction with its neighbors. To be successful in 
simulating such a complicated situation, the trial func- 
tion had to depend (a) on spin state because of (1), 
and (b) on spatial configuration of near neighbors 
because of (2). The pseudo-dipolar interaction meets 
both of these requirements, and hence has been a useful 
concept. 

It is worthy of note, however, that effect (2) depends 
primarily on the position but not the spin state of 
neighbors, so one is inclined to suspect predictions 
arising from the pseudo-dipolar treatment which 
depend sensitively on spin states of neighbors. Predic- 
tions on line width fall in this category, and, as re- 
marked above, the paramagnetic resonance line width 
in (say) YIG is incompatible with the enhancement 
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factor (p~50) required to explain the magnetic anisot- 
ropy on a pseudo-dipolar model. The author also feels 
that the calculation of Clogston et al.’ of the scattering 
of a long-wavelength spin wave by fluctuations in the 
local pseudo-dipolar fields is based on a questionable 
premise. 

A further comment on the relation between pseudo- 
dipolar interactions and the role of near neighbors is 
afforded by the observation that in the garnets and 
ferrites the magnetic anisotropies observed are as large 
as or greater than the anisotropies observed in the ferro- 
magnetic metals, although the nearest neighbors in the 
insulating ferromagnet case are oxygens which play 
only an intermediary role in the superexchange 
interaction. 

Finally it may be noted that recent developments 
in the theory of the magnetic anisotropy of cubic 
insulating ferromagnets,‘ based explicitly on the crystal- 
line field splittings of the separate ion states, have 
rendered unnecessary the assumption of an enhanced 
pseudo-dipolar moment to explain any of the salient 
magnetic properties of these materials. 
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Nature of the Singularities in the Spectrum of a One-Dimensional Ionic Lattice* 
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It is known, by a theorem of van Hove, that regular lattices will have singularities in their frequency 
spectra, the type of singularity depending on the dimension. Although several studies of the one-dimensional 
lattice with Coulomb interactions have appeared, no one yet has succeeded in identifying the type of 
singularity that appears in addition to the inverse square root singularity, in the frequency spectrum. In 
this note we establish that for a one-dimensional crystal the singularity is of the form g(w)~A { (w—w;) 
XIn[1/(@—w;) ]}~+ where A is a constant and w, is the position of the singularity. 


OME years ago Broch,' and more recently Rosen- 
stock,’ obtained formal expressions for the fre- 
quency spectrum of a one-dimensional ionic lattice 
with both Coulomb interaction and nearest-neighbor 
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interactions. They found that the presence of Coulomb 
interactions introduces, besides the usual square root 
singularity, an additional singularity into the frequency 
spectrum. Neither of these authors was able to deter- 
mine the nature of the additional singularity. It is the 
purpose of this note to derive the form of the singularity 
by an application of Newton’s method. 

According to the results of Rosenstock we may 
expand the dispersion relation in the neighborhood of 
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the maximum frequency in the form 


pm (1—p)? In(1—p) J 
+0[(1—p)"], (1) 


=2— o[ 2&— 


2u?/w 1? 


where w, is the maximum frequency of a monatomic 
nonionic lattice, o=2e?/(aa*), where e is the charge, 
a is the force constant, a is the equilibrium interatomic 
distance, and § is a pure number. The frequency 
spectrum is given by the derivative relation 


wr dp 
g()=—- 2) 
dw do 
Since we wish to have w as the independent variable, 
the essential mathematical problem is to find a solution 
of the equation 


(1—p)? In(1— p) =A (w1—»), (3) 


which is valid in the limit ww; and p—1. The 
parameters A and w are defined, from Eq. (1), by 
A=8w/(om’w,?) and w:=w1,(1—a8s)}. 

As our first step we make the substitutions 


? 


1—p=e*", 


2A (w1— 


(4) 


w)=e" 
to simplify the form of Eq. (3). The resulting equation 
then reads 

set =e", (5) 


and we shall find an asymptotic solution valid in the 
limit and v— ». Taking logarithms of Eq. 


(5), we find 
Inx 
x—Inv -x(1- ) (6) 
x 


As x approaches infinity the term (In«)/x will approach 
zero; hence it is reasonable to choose as a zeroth 
approximation x»=v. We may derive higher approxi- 
mations by using Newton’s method, yielding 


Xn— In Xn v 


rary <x 


Xn+ (2a 
Vn 
= —(Inx,+v—1). 
a | 
In this way we find 


x,=0+1n0+y(v) 
=7+Inv+O(Inv/v), 
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where 
limy(v) =0 
v-7® 
In terms of the original parameters p and w, the solution 


of Eq. (8) reads 
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which leads to the expression 
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in the limit w — w;. Together with Eq. (2), this proves 
that for the frequency spectrum in the neighborhood 
of the singularity, 


p= 


9 
yr 
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introduces a weaker 
nearest neighbor 


interaction 
that due to 


i.e., the Coulomb 
singularity than 
interactions. 

Finally we note that if we substitute x, from Eq. 
(8) into Eq. (5) we have 


aye 71— e- =e *{ [14+ (Inv/v)+ (y(v)/v) Je 


which approaches zero as v becomes infinite demon- 
strating that x; is a solution in the asymptotic sense. 
It can also be shown by a more exact mathematical 
analysis that in the limit w — w), one can write 


d 2: A (w— w) } 
lim — 1s ] 
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as we have tacitly assumed in using the second line of 


Eq. (11). 
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Calorimetric optical absorption measurements at 4.2°K have been made on a representative series of a 
brasses over the wavelength range 0.23 to 4.0 microns using electropolished bulk specimens. Changes in 
the absorption spectrum below 6000 A are interpreted in the light of current theories concerning the band 
structure of noble metal alloys. The variation of the infrared absorptivity with residual resistivity shows 
that the impurity relaxation time for copper is anisotropic. This anisotropy increases on alloying and suggests 
that the Fermi surface becomes more distorted with increasing solute concentration. 


I. INTRODUCTION 


ELATIVELY little attention has been given to 
optical absorption measurements as a means of 
studying the electronic structure of alloys, particularly 
those involving the noble metals as solvents. Until quite 
recently, the only data relating to the latter were those 
given by early experiments on the copper-nickel,!” 
copper-zinc,’ and copper-aluminum‘ primary solid 
solutions. These measurements were all made on me- 
chanically polished surfaces, using the conventional 
polarimetric method of Drude® to obtain values of 
and k over a relatively restricted range (approximately 
0.4-1 «). Owing to the disturbed nature of the reflecting 
surfaces caused by mechanical polishing, the sensitivity 
of the Drude method to fine scratches on the samples 
and also owing to the fact that, at room temperature, 
the effects of lattice vibrations and the diffuseness of 
the Fermi distribution function smear out the transi- 
tions, the data gave very little detailed information on 
the changes in the absorption structure on alloying. 
According to Mott,® the main absorption edge in 
copper is due to electron transitions from the d-band 
of the crystal to unoccupied states at the top of the 
Fermi distribution, the minimum frequency of the edge 
corresponding to the energy separation between the 
two. Hence on alloying, the shift in the absorption edge 
should equal the associated displacement of the Fermi 
level, it being supposed there is very little change in 
the relatively low-lying d-band. The addition of zinc 
or aluminum, which contribute additional electrons to 
the conduction band, should therefore give a shift in 
the absorption edge to higher energy, i.e., shorter 
wavelengths. More recently, however, Friedel’ has pro- 
posed a modification of the Mott theory taking into 
account the screening by the conduction electrons of the 


! Lowery, Bor, and Wilkinson, Phil. Mag. 20, 390 (1935). 

2 Bor, Hobson, and Wood, Proc. Phys. Soc. (London) 51, 942 
(1939). 

’ Lowery, Wilkinson, and Smare, Proc. Phys. Soc. (London) 
49, 345 (1937). 

‘LL. McPherson, Proc. Phys. Soc. (London) 52, 210 (1940). 

® See, for example, M. P. Givens, in Solid State Physics, edited 
by F. Seitz and D. Turnbull (Academic Press, Inc., New York, 
1958), Vol. 6, p. 336. 

®N. F. Mott and H. Jones, The Theory of the Properties of 
Metals and Alloys (Clarendon Press, Oxford, 1936), p. 119. 

7 J. Friedel, Proc, Phys, Soc. (London) B65, 769 (1952). 


hole in the d-band. According to this view, the main 
absorption edge in copper should not differ sensibly 
from the excitation energy 3d’ — 3d*4s for the isolated 
Cut ion, the secondary structure in the region below 
3000 A® being associated with the excitation energy 
3d" — 3d*4p. Alloying should therefore not cause any 
appreciable shift in the absorption edge, apart from 
relatively small effects due to changes in Coulomb 
interaction between the screening charge and the lattice, 
exchange interactions, etc. 

As a result of the broadness of the absorption edges, 
the restricted frequency range covered, and the small 
number of alloy systems studied, the older work is not 
capable of discriminating between these theories in a 
satisfactory way or of providing useful information 
about the band structure of noble metal alloys in 
general. There is thus a great need for reliable optical 
absorption measurements, using the improved tech- 
niques of sample preparation and measurement now 
available, on an extensive series of alloy systems. The 
present work was undertaken to provide such data, 
and in this paper the results of experiments on the 
copper-zinc primary solid solutions, covering the wave- 
length range 0.23-4 yw, are presented. 

II. EXPERIMENTAL 

Measurements were made at liquid helium tempera- 
tures using the calorimetric technique developed for 
experiments? on the infrared absorptivity of copper and 
silver at 4.2°K. A schematic diagram of the apparatus 
is shown in Fig. 1. Light from a grating monochromator 
or suitable infrared filter enters the system through an 
electropolished stainless steel light pipe, which also 
serves to evacuate the inner assembly, thereby pre- 
venting significant gas conduction. A quartz lens focuses 
the incident energy at an angle of incidence of 15° onto 
the target, which is screwed into a copper stage contain- 
ing a carbon resistance thermometer and a 100-ohm 
manganin wire heater. The radiation is then reflected 
onto an absorber, which has a similar heater and 
thermometer and which is coated with palladium black 
so as to have substantially zero reflectance over the 
wavelength range of the present experiments. 

’ W. Meier, Ann. Physik 31, 1017 (1910). 

*M. A. Biondi, Phys. Rev. 102, 964 (1956). 
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Both the target and absorber stages are supported 
by thin walled stainless steel tubing connected to a 
heavy copper base in contact with the helium bath. 
This base also contains a heater and a thermometer, 
which are used in an electronic control circuit to 
maintain the base temperature constant to within 
about 0.0001°K. Fine copper wires provide the desired 
heat leaks from the stages to the base so that, with 
radiant energy incident on the system, temperature 
differences are set up across the heat leaks proportional 
to the power absorbed by the target and absorber. 
These temperature differences are measured by the 
carbon resistance thermometers. By reproducing the 
resistance changes by means of dc power applied 
through the respective heaters, the absolute absorp- 
tivity can be simply calculated from the formula, 
A=Pr/(Pr+Pa), where Pr and P, are the power 
required to reproduce the temperature rises of the 
target and the absorber, respectively. 

In applying this formula to calculate the absorptivity 
of the samples in the infrared, small corrections have 
to be applied to allow for the effects of scattered light 
leaking onto the target stage. Although this stray 
radiation is minimized by placing behind the target a 
blackened stop in good thermal contact with the base, 
the residual leakage still accounts for a few percent of 
the total absorption owing to the high reflectivity of the 
targets in the infrared. The appropriate corrections were 
determined by noting the apparent heat flux to the 
target stage with a dummy sample, thermally isolated 
from the stage, reflecting the incident radiation. 

To cover the wavelength range of these experiments, 
band pass interference filters were used in conjunction 
with a Nichrome glower for the region 3.0-4.0 microns. 
From 1-—2.5 microns, a Bausch & Lomb 250-mm mono- 
chromator with a grating blazed at 1 micron and a 
tungsten light source were employed. Below 1 micron 
a grating blazed at 0.3 micron was used, the tungsten 
source replaced by a high-pressure mercury arc lamp 
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Fic. 1. Schematic 
diagram of the op- 
tical absorption ap- 
paratus, which is im- 
mersed in the liquid 
helium bath. The 
symbols H and T 
refer to heaters and 
thermometers, __re- 
spectively. 
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Fic, 2. Optical absorptivity of a-brasses in the ultraviolet and 
visible wavelength regions. The ultraviolet absorption peaks 
move toward longer wavelengths (A-G) with increasing solute 
concentration. 


to cover the region from 0.23-0.6 micron. The intensity 
of the arc lamp was sufficient to enable the monochro- 
mator to be operated with good resolution (a window 
half-width of 60 A) over most of the visible and ultra- 
violet regions of the spectrum, while still maintaining 
adequate accuracy. Below about 0.28 micron some 
difficulty was encountered with scattered light in the 
monochromator. Although corrections were made for 
this effect, the absorptivity values in this region are of 
somewhat doubtful accuracy and should only be taken 
as indicating the trend of the actual behavior. 

The specimens used in these experiments were ma- 
chined from ingots, which were formed by induction 
melting appropriate quantities of high-purity copper 
and zinc in a helium atmosphere. These ingots were 
cast in graphite molds, cold worked and subsequently 
maintained at 800°C for 24 hours to remove coring and 
to assist in their homogenization. They were next sub- 
jected to further cold work in three mutually perpen- 
dicular directions (to prevent preferential grain growth) 
after which optical and residual resistance specimens 
were machined from them. The resulting samples were 
then sealed with brass shavings of the same composition 
in a Pyrex tube under a helium atmosphere and 
annealed. To prevent exaggerated grain growth, the 
duration and the temperature of this heat treatment 
were varied with solute concentration. By careful control 
of the annealing conditions, a grain size of about 1/100 
mm was obtained in all specimens, so that the measured 
properties can be taken with confidence to be charac- 
teristic of polycrystalline material of random orientation. 

The targets for the optical absorption measurements 
were all electropolished using an 80% orthophosphoric 


acid solution. By varying the voltage across the cell 


with changing solute concentration, satisfactory sur- 
faces could be obtained for all the specimens used 
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Fic. 3. Details of main absorption edge for a-brasses. The dashed 
lines indicate the extrapolation procedure used to determine the 
onset of the main absorption. 


in this work. After polishing, the specimens were washed 
in running hot and cold water to remove the phosphate 
film on the surface, rinsed and then immersed in 
distilled water, and finally transferred to a helium dry 
box where they were dried in a current of helium gas 
and mounted in the apparatus. The latter was evacuated 
and cooled to nitrogen temperature as soon as possible 
after the mounting operation was completed to mini- 
mize the danger of surface deterioration in the speci- 
mens. In this way quite reproducible absorptivity 
results were obtained, giving a reasonable assurance 
that properties measured were in fact those associated 
with the bulk material. 
Ill. RESULTS 

Absorptivity data for the various alloys in the visible 
and ultraviolet regions of the spectrum are shown in 
Fig. 2. The main absorption edges are shown in more 
detail in Fig. 3. Owing to the improved surface quality, 
the residual absorption beyond the edge is quite small 
even in the alloys with large zinc concentrations. This 
circumstance, together with the fact that the measure- 
ments were made at low temperatures, thereby elimi- 
nating the effects of lattice vibrations and the diffuse- 
ness of the Fermi distribution function,’ results in 
much sharper edges than had been previously reported. 
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Fic. 4. Infrared absorptivity of a-brasses. It should be noted 
that the absorption scale is one-tenth that used in the previous 
figure. 
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The infrared data for the alloys are shown in Fig. 4, 
which is drawn on a much enlarged scale compared to 
Figs. 2 and 3. It will be seen that the absorptivity drops 
quite sharply in the vicinity of 1 micron and then 
becomes substantially constant from 1.75 to 4 microns. 
If the latter values are plotted against the corresponding 
residual resistivities of the samples, the graph of Fig. 5 
results. It is of interest that our residual resistivity data 
are in good agreement with those of previous workers" 
as may be seen from Fig. 6. This agreement suggests 
that the metallurgical procedures used in the present 
work were such as to give reliable, well-annealed 
samples. 

IV. DISCUSSION 

(a) Absorptivity in the Visible and Ultraviolet 

Reference to Fig. 3 shows quite clearly that alloying 
moves the main absorption edge to shorter wavelengths. 
To determine the magnitude of the shift as a function 
of solute concentration, it is necessary to adopt a some- 
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Fic. 5. Variation of infrared absorptivity with residual resis- 
tivity of a-brasses. The dashed curve was calculated on the as- 
sumption of isotropic impurity scattering from the solute atoms 


what arbitrary procedure as a result of tailing in the 
data on the long-wavelength side of the edge. Accord- 
ingly, we have approximated the curves by straight 
lines after the manner shown by the dashed lines in 
Fig. 3 and have taken their point of intersection as the 
wavelength \» where interband transitions begin. The 
relative sharpness of the curves on the short-wavelength 
side of the edge, even for the more concentrated alloys, 
minimizes the errors involved in this procedure. 

Values of Ao as a function of solute concentration are 
given in Table I, from which the full curve of Fig. 7 is 
derived. The displacement of the edge, which is initially 
quite small, becomes linear with concentration for zinc 
contents in excess of three percent. Such behavior is 
not at all consistent with the original theory of Mott, 
according to which the shift of the edge 6 on alloying 
is equal to the change of the Fermi level, as computed 
from the rigid band model. In this case we have 


¢7dN\—! 
se=z f (—) dc, (1) 
o \dE 


1 Kemp, Klemens, Tainsh, and White, Acta Met. 5, 303 (1957). 
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for a concentration ¢ of solute having a valence Z 
relative to the solvent. Using the density of states data 
obtained from low-temperature heat capacity measure- 
ments on the a-brasses,' Eq. (1) gives for 6E the dashed 
line of Fig. 7. Clearly, it does not represent an adequate 
fit to the data. 

According to Friedel,'* the Fermi level in an alloy 
does not behave in the manner predicted by the rigid 
band model. Thus when zinc is added to copper, the 
additional valence electrons contributed by the solute 
are localized in the immediate vicinity of the zinc atoms 
in order to screen out their additional ionic charge. 
Because the screening distance is small, the copper 
atoms in dilute alloys see virtually the same potential 
as that in the pure metal. This is equivalent to saying 
that the Fermi level in dilute alloys does not alter 
appreciably. For more concentrated alloys there is a 
displacement of the Fermi level, but it is much smaller 
than that predicted by the older theory. Using the 
Thomas-Fermi model to treat the effects of screening, 
Friedel has obtained an expression for the shift in the 
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"1G. 6. Variation with solute concentration of 
residual resistivity for a-brasses. 


Fermi level 
Zq 
AE= . (2) 
gR coshgR—sinhgR 





where the screening parameter q is related to the density 
of states in the alloy by the relation 


g=4nr(dN/dE)o, (3) 
and where 
1/R=c/r,3, (4) 


r, being the radius of the atomic sphere for the solvent 
atoms. For copper, Eq. (3) gives g=1.12 (atomic 
units). 

If we retain the idea that the energy separation 
between the top of the d-band and the Fermi level 
corresponds directly to the quantum energy of the 
absorption edge, then the displacement of the latter 


12 J, A. Rayne, Phys. Rev. 108, 22 (1957). 
13 J. Friedel, in Advances in Physics, edited by N. F. Mott 
(Taylor and Francis, Ltd., London, 1954), Vol. 3, p. 461. 
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TABLE I. Values of Ao, the wavelength corresponding to the 
main absorption edge, in a-brasses. 


Solute concentration Xo Eo 6E 
at. % (A) (ev) (ev) 





0 5790 2.15 0 
2.44 5760 0.011 
4.96 5720 0.027 

10.00 5600 0.073 

14.40 5480 0.122 

21.00 5350 0.177 


should be given by Eq. (2). As may be seen from Fig. 7, 
which shows this relation plotted for g=1.12, there is a 
slightly better fit to the data in that the theoretical 
curve has zero slope as c— 0 and at least gives the 
right order of magnitude for the energy shift. No reason- 
able choice of g, however, gives complete agreement 
with the experimental results. Hence if the present 
picture is correct, the screening must differ considerably 
from that given by the simple Thomas-Fermi model. 
This, of course, is not unlikely, since the applicability 
of the Thomas-Fermi model to a stituation where the 
calculated screening length is considerably smaller than 
the ionic radius is somewhat doubtful. 

The above treatment does not take any account of 
the screening of the hole in d-band caused by the 
ejection of an electron to the Fermi level. Friedel’:!*:"4 
has considered this effect in some detail and has con- 
cluded that the main edge in pure copper essentially 
corresponds to the excitation energy 3d'° — 3d%4s for 
the isolated Cut ion, corrected for the Coulomb attrac- 
tion between the hole and the lattice, exchange effects, 
etc. Of these the most important appears to be the 
former, so that on alloying the displacement of the 
edge should be given by the change in the Coulomb 
interaction. Hence, using Eqs. (2) and (4), we have!® 


bE=(p+Zc)dAE/dc 
2(p+Zc)ZgR? (5) 
- 3c(qR— 1)? exp(gR) 
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4 J. Friedel, Ann. phys. 9, 158 (1954). 
16 Atomic units are used in these equations. 
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TABLE IT. Values of Zz, and AEz, for a-brasses.* 


Solute concentration bEZn AEZn 
(at. %) (ev) (ev) 


5 0 0 

10 0.048 0.058 
15 0.095 0.107 
20 0.143 0.147 


* Referred to 5% alloy. 


where p is the number of conduction electrons/atom 
of solvent. 

The magnitude of the screening constant g may 
differ from the Mott value given by Eq. (3), because 
of exchange effects and also because of the more diffuse 
nature of the screening of the hole in the d-band relative 
to that of the zinc ions. Taking g=1.64, given by 
Friedel as the screening parameter for copper if exchange 
effects are considered, we obtain the dashed curve 
shown in Fig. 7 for the variation of 6E with solute con- 
centration. Although the fit to the experimental data 
is better than that given by the Mott formula, Eq. (1), 
the concentration dependence does not appear to be 
correct. If we reduce g to take screening differences into 
account, no improvement is effected. It must therefore 
be concluded that the Friedel theory, although correct 
in a qualitative way, does not correctly describe the 
screening in an alloy for finite solute concentrations. 

It is of interest to compare the displacement of the 
optical absorption edge with the changes in the energy 
of solution for the copper-zinc system. We define 4, 
the energy of solution per atom A at a concentration 
c in the alloy A.wBa_.n, by the relation 


A w-iBa_epntA — AewBa_on-1+Bt+Ea, (6) 


N being a large number. Experimental data are usually 
interpreted in terms of a quantity E4’ given by 


Aun 1Bq ontaA ——? A wBa_ontEa’, (7) 


which is related to E, by the Duhem-Margules 


equation™ 
E,' “s Cc 
E,-—~ [ eid(—). (8) 
1—c 0 1—c 


Friedel has shown that the changes in energy of solution 
on alloying AFz,, should be the same as the displace- 
ments 6£z, of the optical edge, apart from differences 
in screening between a hole in the d-band and a zinc ion. 
Using the data of Herbenar et al.!® giving Ez,’, AEzn 
can easily be computed from Eq. (8). The values of 
AFz, relative to the alloy for c=0.05 are given in 
Table II, together with the associated values of 6/z,, 
taken from the smooth curve of Fig. 7. The agreement is 
very good, being within experimental error, and suggests 
that the above-mentioned differences in screening are 
small. 

Reference to Fig. 2, Curve A, shows that the absorp- 


16 Herbenar, Siebert, and Duffenback, J. Metals 188, 323 (1950). 
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tion in pure copper exhibits secondary structure at 
about 3200 A. It has been suggested by Friedel’ that 
this structure is associated with a screening of the hole 
in the d-band corresponding to the excitation energy 
3d — 3d°4p for the free Cut ion. One would thus 
expect relatively little change in the form of this part 
of the absorption curve on alloying, or at least a shift 
in the same direction as the main edge. Such is clearly 
not the case; the peaks‘of the absorption curves, which 
are designated by the letters A to G in Fig. 2, move 
from 2900 A in pure copper to 4300A for a thirty 
percent zinc alloy. It thus seems unlikely that Friedel’s 
theory is correct. 

The interpretation of the absorptivity data below 
the main edge is complicated by the fact that transi- 
tions can take place both from the d-band to the Fermi 
level and from the Fermi level to the next zone, pre- 
sumably in the (111) directions. There is consequently 
considerable uncertainty about the wavelength at which 
the latter process is initiated. In particular no quantita- 
tive information can be obtained regarding its change 
with alloying, except that it definitely shifts to longer 
wavelengths. 

The shape of the absorption curve for the 30% alloy 
suggests that both transitions begin simultaneously at 
about 4800 A, i.e., 2.6 ev. On the basis of this hypothesis, 
we can make an estimate of the energy gap for copper 
along the [111] direction. Let us first adopt the rigid 
band model and suppose that the Fermi surface in 
copper, although nonspherical, does not touch the zone 
boundary and that on alloying contact takes place at 
a zinc concentration of about 10%.'7 Once contact has 
occurred, transitions from the center of the zone face 
can no longer take place. However, one may reasonably 
suppose that, providing the contact area does not 
become too large, the minimum energy for excitation 
from the Fermi surface into the next zone will still 
approximate the gap in the [111] direction for pure 
copper. Thus, the value of the gap is roughly 2.6 ev. 
Various theoretical estimates of the energy gap are 
available but none of these can be taken very seriously 
as a result of the extreme sensitivity of the value to the 
starting ionic potential used in the calculations. Thus, 
for example, Howarth,'* using the cellular method, 
found a value of 0.24 ev with a Hartree Cut potential 
and 1.7 ev with a Hartree-Fock potential. The present 
figure is much lower than the value of 7.65 ev deduced 
by Pippard’® from his work on the anomalous skin effect 
in copper. 

If instead of the above model we adopt that of 
Cohen and Heine,” which assumes that contact between 


'7 At this value the variation of y with zinc concentration 
exhibits a marked change in slope. See reference 12. 

‘8D, J. Howarth, Proc. Roy. Soc. (London) A220, 513 (1953). 

1 A. B. Pippard, Phil. Trans. Roy. Soc. (London) 250, 325 
(1957). 

22M. H. Cohen and V. Heine, in Advances in Physics, edited 
by N. F. Mott (Taylor and Francis, Ltd., London, 1954), Vol. 7, 
p. 395. 
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the Fermi surface and the Brillouin zone already occurs 
in copper across the {111} faces, then the gap across 
these faces decreases on alloying. This model would 
thus predict that the resulting absorption edge moves 
to longer wavelengths with increasing zinc concentra- 
tion, in apparent agreement with the present work. 
Provided that contact between the zone face and the 
Fermi surface is maintained on alloying, the energy 
gap for the alloy with c=0.3 would be 2.6 ev according 
to this view. ‘To obtain the gap for pure copper, it is 
necessary to use the formula” 


Asp— (Agp) solvent 
e/a—1 
e/a 


For (A, p)sotute= 5.8 ev, (Acp)scivenr™ 3.8 ev, c=0.3, Eq. 
(9) gives 


1 
(14) Ca. )aame (Agp) solvent ]. (9) 


Asp— (Acp)cu= 0.9 ev. (10) 


Since the p level is presumed to lie below the s level, 
the gap for copper is thus estimated to be 3.5 ev. 

As will presently be demonstrated, however, the in- 
frared absorptivity data suggest that the Fermi surface 
must become more distorted on alloying. This would 
necessitate that the gap increase and hence that the 
corresponding absorption edge move to shorter wave- 
lengths. The present data are inconsistent with such a 
conclusion and hence we must depend on the rigid band 
model for an estimate of the gap. 


(b) Absorptivity in the Infrared 


From Fig. 4 it can be seen that the absorption data 
beyond the main absorption edge exhibit considerable 
tailing out to approximately 1.75 uw. In this region the 
curves all behave in a strikingly similar way, being 
roughly parallel to each other. It has been suggested 
by Biondi’ that, in the case of copper, the tailing may 
result from the effect of indirect transitions. Such a 
suggestion cannot be correct, since phonons having an 
energy of approximately 1 ev would be required in such 
processes and these cannot exist in a metal. Again, it 
has been suggested that the effects of electron-electron 
interactions could account for the observed behavior. 
This explanation also cannot be correct, since the fre- 
quency dependence of such an absorption process would 
in a simple theory be given by”! 


A electron-electron © @", (11) 


which exponent is far too small to account for the above 
data. It is equally impossible to ascribe the effects to 
collision damping, since for copper the plasma frequency 
w@»= (4rNe*/m)} corresponds to a wavelength of 1400 A. 
At the moment, no plausible explanation can be ad- 
vanced for the observed behavior. 

21 V. P. Silin, J. Exptl. Theoret. Phys. U.S. S. R. 34, 707 (1958) 
[translation: Soviet Phys. JETP 7, 486 (1958) ]. 
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Beyond 1.75 microns the absorption is constant, 
within experimental error, up to the wavelength limit 
of the present data; viz., 4 u. In this range, the absorp- 


tivity can be written as the sum of three contributions, 
A=A surface t 1 volume 4 1 impurity) (1 2) 


where the effects of these can be considered independ- 
ently provided that the scattering probabilities for each 
process are small. This condition is presumably satisfied 
for the systems studied here. 

The first term of Eq. (12) results from scattering of 
the electrons by the surface of the metal. It has been 
shown” that for the case of diffuse scattering, which 
assumption best fits the experimental data at microwave 
frequencies,” the surface absorption of a cubic metal is 
given by an expression 


(13) 


, Oe 
A surface = 4U/C, 


where 2 is an effective electron velocity defined in terms 
of an integral over the Fermi surface of the metal. For a 
nonspherical energy surface v is a function of the 
crystallographic direction, but if we consider poly- 
crystalline specimens, an equation of the form given 
by (13) will still apply if for the velocity we take a 
simple geometrical average, 3, given by 


o=( 1/4n) { o(a)aa, 


In order for Eq. (13) to hold, the condition 


(14) 


5;= (mc?/4nNe*)!, 


(15) 


must be fulfilled, w) being the frequency corresponding 
to the onset of the internal photoelectric absorption. It 
is easily shown that this equation is satisfied for the 


Wo>>wW>V, 6s ; 


wavelength region being considered. 

The volume absorption arises from the interaction of 
the electrons with the zero-point vibration of the lattice. 
This interaction has been considered by Holstein™ and 
also by Gurzhi.2® Using the usual approximations of 
conductivity theory, vz., spherical energy surfaces, 
acoustical isotropy, etc., Holstein was able to obtain 
an explicit expression for the volume absorption of the 
form 


A votume= (m*/aNe*)*(1/Tett), (16) 
where Tes iS an effective relaxation time related to the 
usual conductivity relaxation time 7 at a temperature 
T>¢ by the equation 


(17) 


6 being the Debye temperature. Although no closed ex- 


Tett= 31 7/8, 


2 T. Holstein, Phys. Rev. 88, 1427 (1952); M. I. Kaganov and 
V. Slezov, J. Exptl. Theoret. Phys. U. S. S. R. 32, 1496 (1957) 
[ translation: Soviet Phys. JETP 6, 1216 (1957) ]. 

*8R. G. Chambers, Proc. Roy. Soc. (London) A215, 481 (1952). 

*4'T, Holstein, Phys. Rev. 96, 535 (1954). 

25R, N. Gurzhi, J. Exptl. Theoret. Phys. U. S. S. R. 33, 451 
(1957) [translation: Soviet Phys. JETP 6, 352 (1958) ]. 
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pression for the volume absorption of a material with 
a nonspherical Fermi surface and arbitrary acoustical 
properties is available, one may anticipate that it is 
possible to define a suitably averaged relaxation time 
which will result in an equation similar to (16) for 
this case. 

An expression for the impurity absorptivity may be 
obtained by solving the Boltzmann equation for a 
periodic electromagnetic field assuming the existence of 
a relaxation time for impurity scattering. In the near 
infrared, where wr>>1, the absorptivity may be ex- 
pressed in the form (see Appendix) 

A impurity = (r){1/7)(Ne, ‘Ma)pimpurity, (18) 
where pimpurity is the residual resistivity due to impuri- 
ties and (7), (1/7) are averages of the impurity relaxa- 
tion time defined by the following integrals over the 
Fermi surface: 


vdS 
(r)= f red / f wdS; (1/r)= f — / f odS. (19) 
T 


From the Schwarz inequality it may readily be shown 
that 


(r)1/r)21 (20) 


’ 
the equality sign holding for isotropic impurity scatter- 
ing. The quantity 1/m, appearing in (18) is the usual 
inverse effective mass, which for cubic metals is given 
by the equation 


1 1 
— = —__—_ J vend, 
m, 12n°*Nh? 


N being the density of conduction electrons. 
In the region of interest, the absorptivity of the 
alloys may thus be expressed in the form 


A=A surface A volu met Bpimpurity, 


(21) 


(22) 
where 


B=(r){1/r)(Ne?/am,)}. (23) 


It may readily be shown that the first two terms change 
only slowly with solute concentration, so that the im- 
purity term dominates the variation of the absorptivity 
with residual resistivity. In particular, the initial slope 
of the curve of absorptivity versus residual resistivity 
should be essentially equal to 8, the value of 8 for 
pure copper. Using the value of m,=1.45m for copper 
derived from the room temperature measurements of 
Schulz*® and setting (r){1/r)=1, we obtain a slope 
from (22) which deviates markedly from the experi- 
mental one. No reasonable choice of m, effects a material 
improvement in the fit to the data, a value of 
m./m=0.61 being required to obtain complete agree- 
ment with experiment. Such a figure is clearly untenable 


26 L. G. Schulz, J. Opt. Soc. Am. 44, 540 (1954); J. Phil. Mag. 
6, 102 (1957). 
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and we must conclude that (7){1/7) is greater than 
unity, i.e., the impurity scattering relaxation time in 
copper is anisotropic. 

This result is of considerable importance, since it 
constitutes experimental proof that much theoretical 
work done in the past is based on an unjustified assump- 
tion. In particular, the analysis by Olsen and Rod- 
riguez”’ of their magnetoresistance data on copper at 
low temperatures cannot be correct since they assumed 
7 to be isotropic. Their conclusions relating the mag- 
netoresistance coefficients to the geometry of the Fermi 
surface in copper are consequently open to question. 
Furthermore, if the impurity relaxation time in copper 
is anisotropic, there is very little doubt that the phonon 
relaxation time at high temperatures is also anisotropic. 
Thus treatments of transport processes, which make 
the assumption of isotropy for phonon scattering, are 
probably not accurate and conclusions about the Fermi 
surfaces in metals derived from them must be viewed 
with some suspicion. 

As shown in the Appendix, 7 will be isotropic and the 
product (7){1/7) will be unity, if the Fermi surface is 
spherical and the scattering probability P(k,k’) depends 
only on the angle between k and k’. No general proof 
exists for the conditions under which the second require- 
ment will hold, but it seems reasonable to suppose that 
it will be true for spherical energy surfaces and that the 
product (7){1/r) is only unity in such cases. Further 
one may assume that this product is a monotonic func- 
tion of the deviation of the Fermi surface from sphericity 
and that the larger the distortion of the surface, the 
greater will be the value of (r){1/7). For pure copper 
our data indicate that (7){1/7)=1.30+0.05, which 
suggests a highly distorted Fermi surface, in agreement 
with the results of other experiments.” 

Now according to Cohen and Heine the Fermi surface 
in copper alloys becomes Jess distorted with increasing 
solute concentration. In particular, they claim that, 
at the limit of the a phase of systems such as the one 
studied here, the Fermi surface is essentially spherical 
and just touches the zone boundary. If this were true 
and our assumptions regarding the isotropy of 7 are 
correct, then the experimental data of Fig. 5 would tend 
towards the dashed curve”® defined by Eq. (22) with 
(r)(1/r)=1 and mg=m,=1.37m. The latter value corre- 
sponds to the effective mass parameter of the spherical 
parts of the Fermi surface in copper” as derived from 
cyclotron resonance measurements.” This behavior 
clearly does not take place, and we must conclude that 
the energy surface does not become more spherical on 
alloying. In fact the experimental data are consistent 
with a (r)(1/r) which increases on alloying, leading 


27 R. L. Olsen and R. Rodriguez, Phys. Rev. 108, 1212 (1957). 

28 The theoretical dashed curve has been corrected for changes 
iN A surface ANd A volume On alloying. 

2 Tt is assumed that the curvature of these sections of the surface 
does not change appreciably on alloying. 

%® Langenberg, Kip, and Rosenblum, Bull. Am. Phys. Soc. 
Ser. IT, 3, 416 (1958). 
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to the conclusion that the Fermi surface becomes more 
distorted with increasing zinc concentration. 

This result again raises the fundamental difficulty, 
_ pointed out by Cohen and Heine, of reconciling the 
Hume-Rothery rules and results of specific heat meas- 
urements on the a-brasses with the shape of the Fermi 
surface for copper derived by Pippard.’® If we rule out 
the possibility of an increase in the sphericity of the 
Fermi surface on alloying, then the energy gap in the 
[111] direction cannot decrease with increasing zinc 
concentration. The present data for the visible and 
ultraviolet show, however, that it cannot increase. Thus, 
we can only conclude that the gap remains essentially 
constant, i.e., the rigid band model still applies. In this 
situation the only possible way of obtaining consistency 
between the experiments is to suppose that the Fermi 
surface in copper is only close to the {111} faces of the 
first Brillouin zone, but does not actually touch them. 

This hypothesis also presents difficulties, since the 
measurements of Schulz have shown that for copper 
ma/m:>1, where the effective thermal mass m;, is 
given by the relation 


m,/m=7/Ytree- (24) 


7 is the coefficient of the linear term in the heat capacity 
of a metal at low temperatures, and Yfree is the corre- 
sponding coefficient for an electron gas of the same 
density. Cohen*! has demonstrated that a ratio greater 
than unity indicates a Fermi surface having considerable 
area of contact with the zone boundaries, which is 
contrary to our previous conclusion. It is possible of 
course that the value of m, for copper, deduced from 
Schulz’ measurements, is inaccurate because of the 
difficulty of making experiments on thin films. If such 
is not the case, one must entertain doubts either about 
our assumptions regarding the relation between the 
shape of the Fermi surface and the anisotropy of 7 or 
about the applicability of the one-electron theory to 
alloys at all. Before making such inferences, however, 
more experiments on other alloys (including those based 
on silver and gold) should be made. 


V. CONCLUSIONS 


From the results of optical absorption measurements 
on the a-brasses at 4.2°K, it is concluded that the main 
absorption edge in copper is due to electron transitions 
between the d-band and the Fermi level. The observed 
shift of the edge on alloying is found to be inconsistent 
with the original theory due to Mott. Although the 
Friedel theory gives a better fit to experiment, there is 
still quantitative disagreement regarding the depend- 
ence of the shift on solute concentration. From the 
secondary structure of the absorption in the ultraviolet, 
it is concluded that the gap across the {111} faces in 
copper is approximately 2.6 ev. The variation of infrared 


absorptivity with residual resistivity indicates that 


31M. H. Cohen, Phil. Mag. 3, 762 (1958). 
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anisotropy of the impurity relaxation time increases on 
alloying and that the Fermi surface becomes more dis- 


torted with increasing solute concentration. 
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APPENDIX 


Assuming the existence of a relaxation time 7, the 
general form of the Boltzmann equation is 


0 of com. 
hen = “(8+ ne) OW, a. 


al Or nh T 


(Al) 


For a plane electromagnetic wave polarized along the 
x direction this reduces to 


af Aa Of f—hfo 


: (A2) 
al “ Ok, T 


which has a periodic solution of the form 


a 


Using Eq. (A3), we find for the current 
2e 
=—— | frdk 
(29)8 J 
(A4) 


e Ofo &, 
Sb Ms 
4h? OEN Ok, 1+iwr 


For a cubic metal this reduces to 


f=fo-———R 
: h Ok, 


2 fe 


e &, 
Jen fi eyrltRe( a. (as) 
12nth? 1+iwr 


dSdE/|V,E|, 


e 6, 
fr ViE| Re (— — )as, (A6) 
4 nth? 1+iwr 


where the integration over E has been carried out. Thus 
we have 


e |V.E | rdS 
yi -(— -f- pene ~)6. 
122*h? 1+w?r? 
es | 


Since dk= (A5) may be written as 


(A7) 


| Viele} —) 
1+w?*r? al ; 
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Identifying the real and imaginary parts of the dielectric 
constant e= (n+7k)? with the coefficients of these terms 
we have, since k>>n in this region, 


e ) 7rd§ 
nkw=— -f | V.E|———_, 
69h? 1+w*r? 


(A8) 
rds 


e 
n?’—k?=1— — fivei— . 
39h? 1+??? 


For w7>>1, which condition holds for all alloys studied 
here, Eqs. (A8) reduce to 


e vdS 
nk=— f= 
Or hiw* T 


e 
Mhaicasse: f odS. 
39h 


At normal incidence we have 


(n—1)*+k? 4nk 
{=1-— - ~—, (A10) 
(n+1)*+k? 


so that from Eqs. (A9) 


1 wdS i 
A= f- - /(fes) . (A111) 
(e?/129°h)! T 


Since 
1 1 


(e?/122°h) 
f 7vdS 


Pimpurity = ’ 


Eq. (A10) may be written in the form 


ae \ i 
A impurity - (— ) ( fus) (r)(1/7)pimpurity, (A12) 
120'h 


where 


vdS 
(r)= f ras / f vas; a/n)= f - / fras. 
T 


Equation (A12) may further be reduced if we introduce 
the inverse effective mass defined for a cubic metal by 
the equation 


1 1 1 
— = _ f vende — — fos, (A13) 
m, 12n°*Nh? 127° Nh 
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N being the density of conduction electrons. Substi- 
tuting Eq. (A13) into Eq. (A12) we find 

A impurity ae (r){1/7) (Ne?/am,)! Pimpurity- 


The Schwarz inequality gives the result 


v 2 
f rd S f -dS2 ( f sds) ' 
. 


whence from Eq. (A12) 
(r)(1/7)21, 
the equality holding for isotropic scattering. 


If we write f= fot+@, the equation giving the rate 
of change of f due to collisions is 


(A14) 
(A15) 


(A16) 


. 
(“) ‘ f [4(k’)—#(k) JP(k,k’)ds’, (A17) 
ot 


collisions 
where 
dS 
P(k,k’)dS= | (k’| V| k)|*———.. 
4h | V.E| 

For spherical energy surfaces and for P(k,k’) depending 
only on the angle 6 between k and k’, we may write 
¢=k,c(E), assuming the field to be along the x axis. 
Equation (A17) then reduces to 


1 ee," 
—= fraw(: -—)as’, 
7(k) k, 


ani f P(6)(1—cos6) sinédé, 


0 


(A18) 


which depends only on the magnitude of &, i.e., 7 is 
isotropic. 

Since y,(r)=e'*'u,(r), the general form of the 
matrix element determining the scattering probability 
P(k,k’) is 


(k’| Vk)= fe ®-ruyt(Q)V Oun(adde. (A19) 


Clearly, the explicit dependence of this integral on k 
and k’ is determined by the nature of u(r) and “,-(r). 
As far as is known, there is no general proof that 
P(k,k’) will depend only on @ and that 7 will be isotropic 
if and only if the Fermi surface is spherical. Such an 
assumption, however, does not seem unreasonable and 
we suppose that the product (7){1/r) is a measure of 
the sphericity of the Fermi surface. 
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The superlattice representation is described. This representation provides occupation probabilities which 
form a well-defined non-negative definite analog of the phase space density. The superlattice basis functions 
are orthonormal wave packets characterized in general by three parameters: a band index, a wave vector, 
and a superlattice position vector. The crystal momentum and position are automatically coarse-grained 
so as to satisfy the Heisenberg uncertainty principle. The Bloch and Wannier representations are special 
cases of the superlattice representation for particular choices of the superlattice parameter. Joint functions of 
position and momentum, such as the local current density, can be represented as the expectation value of 


Hermitian operators through the use of this representation. 


1. INTRODUCTION 


HE Bloch representation forms an adequate 

foundation for most theoretical studies of the 
properties of metals in the one-electron approximation. 
A drawback of this representation is that in placing 
emphasis on the wave vector, it completely ignores 
spatial localization. A Bloch electron is uniformly dis- 
tributed throughout the entire crystal. By forming 
appropriate linear combinations of Bloch functions, it 
is possible to form a spatially localized set of basis 
functions, the Wannier functions.!? Unlike the atomic 
orbitals, the Wannier functions are an orthogonal set. 
These functions are characterized by lattice positions; 
there is no dependence upon wave vector. 

The superlattice representation bridges the inter- 
mediate region wherein both position and momentum 
play a role. If plane waves are used as the initial basis 
set instead of Bloch functions, the superlattice repre- 
sentation has no band index and the occupation prob- 
abilities of superlattice states then provide a quantum- 
mechanical analog of the phase space density. This 
representation allows a quantum-mechanical treatment 
of inhomogeneous processes such as thermal transport.’ 

The traditional approach to localizing momentum- 
dependent properties is related to the Wigner phase 
space density. For the current density and other joint 
functions of position and momentum, this approach 
gives expressions which are functionals of the wave 
function. The superlattice representation provides a new 
approach to the localization problem. Legitimate 


* Based on part of a thesis submitted by E. C. M. to the 
Graduate Faculty of Cornell University as partial fulfillment of 
the requirements for the degree of Ph.D. 

t Now at John Jay Hopkins Laboratory for Pure and Applied 
Science, General Atomic Division of General Dynamics Corpora- 
tion, San Diego, California. 

t Now at Scientific Laboratory, Ford Motor Company, Dear 
born, Michigan. 

'G. H. Wannier, Phys. Rev. 52, 191 (1937). 

2 J. C. Slater, Phys. Rev. 76, 1592 (1949), 

3E. C. Melrvine, following paper [Phys. Rev. 115, 
(1959) ]. 

4. Wigner, Phys. Rev, 40, 749 (1932). 
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quantum-mechanical operators are defined whose expec- 
tation values represent observable localized properties. 

Since many common operators have off-diagonal 
elements in the superlattice representation, the density 
matrix formalism is useful.*:* For a pure state repre- 
sented in terms of a set of basis functions {y,} by the 


wave function 
Vi=)'. ay" ny 


the density matrix p is given by 


(1.1) 


(n| p|n’)=an'an™. (1.2) 


In the case of an incoherent superposition of states W' 
with probability p,;, the density matrix is 


(n|p|n')=>d; pidniay™. 


Thus the diagonal elements of the density matrix give 
state occupation probabilities in the n-representation, 
while the off-diagonal elements yield information 
about the phase correlations between the expansion 
parameters. 

The Schrédinger equation for the density matrix is 


dp/dt= —ih- [Hp], (1.4) 


where the brackets denote the commutator. The ex- 
pectation value of any operator M is given by the 
trace of Mp: 


(M)=Tr(Mp)= > nw (n\p|n’)(n'|M!n). 


(1.3) 


(1.5) 


This may be considered a more fundamental definition 
of the density matrix than Eq. (1.3). 


2. THE SUPERLATTICE REPRESENTATION 


Since the superlattice representation is based on a 
unitary transformation similar to the Wannier trans- 
formation,' a brief review of the Wannier representa- 
tion will be given. 

®U. Fano, Revs. Modern Phys. 29, 74 (1957). 


®R.C. Tolman, Principles of Statistical Mechanics (Oxford Uni- 
versity Press, London, 1938), pp. 327 ff. 
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Consider a cubic crystal with side A=na, containing 

n® jattice cells of side a. The natural basis functions 

for a quantum-mechanical treatment of this crystal are 


the Bloch functions 


Yon (1) = Mx (4) exp(ik-r), (2.1) 


where k is an allowed wave vector in the fundamental 
Brillouin zone and 4,,, normalized in the crystal, has 
the periodicity of the crystal lattice. Periodic boundary 


¢(r—I1)= 


There exists a proof’:* that Wannier functions are the 
most localized orthonormal set which may be formed 


¢ 
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Fic. 1. Localization in position and momentum in three repre- 
sentations. (a) Localization in the Bloch representation: a fine- 
grained k, no knowledge of position in crystal. (b) Localization 
in the Wannier representation: a fine-grained I, no knowledge of 
crystal momentum. (c) Localization in the SL representation: a 
coarse-grained K and L, no knowledge of x and 2. 


from atomic functions. (The sequence of transforma- 
tions is atomic functions to Bloch functions to normal- 





—¢iK. 


ox.(r)=- 


AiNns 


Thus the SL functions constitute a set of orthonormal 
wave packets (Fig. 2). The wave vector characterizing 
the SL function is the coarse-grained K, an allowed 
wave vector inside one SL cell. The SL position vector 


7 B.C. Carlson and J. M. Keller, Phys. Rev. 105, 102 (1957). 

® See also W. Kohn and S. Michaelson, Proc. Phys. Soc. 
(London) 72, 301 (1958) which appeared after this work was 
completed, 


1 sin{w(x—1,)/a] sin[w(y—l,)/a] sin[w(s—/,)/a] 
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conditions are used. y is a band index. Use of Bloch 

functions gives a fine-grained knowledge of the crystal 

momentum fk by sacrificing all knowledge of the 

location of the electron in the crystal [Fig. 1(a) ]. 
Bloch functions are utilized to build Wannier func- 

tions in the following manner': 


¢,(r—l) =n? ye Wo (r) exp(—ik- I). 


For free electrons, retaining a ghost lattice, 


ain’ sin[ r(x— 1,)/A] sin[r(y—l,)/A] sin[a(s—1,)/A] 


(2.3) 





ized Bloch functions to Wannier functions.) Wannier 
functions sacrifice all knowledge of the crystal momen- 
tum to gain this localization to a lattice cell [ Fig. 1(b) ]. 

A superlattice (SL) containing V* cells of side A is 
now superimposed on the crystal lattice.’ The wave 
vector k is then broken up into a coarse-grained wave 
vector K and the residue x. While K is an allowed wave 
vector inside one SL cell, the residue x has components 
of magnitude not more than one-half the spacing 
between SL wave vectors [Fig. 1(c) ]. 


k=K-+x, 
K,;=24N;/A; 


(= 2nn,/A; 


NV,=0, #1, ---+(n—N)/2N, 


; (2.4) 
n;=0, +1, ---+(V—1)/2. 


1=X, ¥, 2. 


There are (n/N)*K vectors and N*x« vectors. The 
allowed SL translations are V* in number: 


L=M,A; M,;=0, +1, ---4(N—1)/2. (2.5) 


Superlattice functions are formed by utilizing a Wannier- 
like transformation to introduce L vectors in place of 
x vectors: 

L). 


@xi(r)= VN! > Wxix(r) exp(—ix: (2.6) 
XK 


The orthonormality of SL functions may be demon- 
strated from the definition just as in the case of Wannier 
functions. 

For free electrons, 


1 sin{a(x—L,)/A ] sin[a(y—L,)/A] sin[w(s—L,)/A ] 
sinfa(x—L,)/A] sin[a(y—L,)/A] sin[w(s—L,)/A] 





L characterizes the envelope function which is centered 
on a SL cell and resembles a Wannier function in shape. 


® Both the crystal lattice and the superlattice are treated as 
being cubic. Since the latter is purely a mathematical construc- 
tion, the shape of the SL unit cell is independent of the physically 
determined shape of the crystal lattice unit cell. The extension 
of this formalism to the case where one or both of these unit cells 
is of a more general shape is apparent. The quantities n, V, and 
(n/N) are integers, and for convenience are taken to be odd 
integers, 
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The size and shape of the SL cell determines the spacing 
of K and L vectors. In the cubic superlattice used here, 
the spacing between successive K; is 2x/A, and between 
successive L; is A. The Heisenberg uncertainty principle 
is obeyed by the SL position 1 and the pseudomomen- 
tum AK: 


(AhK,)(AL;) ~h(w/A)(A/2)>h. (2.8) 


The shape of the envelope function will closely re- 
semble the free-electron envelope displayed in Fig. 2 so 
long as #,x(r) varies slowly with k so that it may be 
well approximated by u,x. 

Special cases of SL functions occur when we choose 
A to match the lattice spacing or the crystal size: 

(a) A=a. In this case, K=0, x=k, N=n, and 

Wannier functions result. 

(b) A=A. In this case, K=k, x=0, V=1, and Bloch 

functions are retained. 

To find matrices in the superlattice representation 
most easily, the unitary transformation from the Bloch 
representation will be developed. If M® is a matrix in 
the Bloch representation and M&%" the corresponding 


SL matrix, then 


M®“= SMBS" (2.9) 


is satisfied, where 
(v’K’L’| S|», K+«) =~? exp(ix-L)éxxd,. (2.10) 


Examples of commonly encountered matrices are 
the crystal momentum and the energy, both diagonal 
in K and v: 


(vKL| p| vVKL’) =AK6,1.+.\V% > hex exp(ix-L), (2.11) 
K 
(vKL| | VKL’) = £,(K)éu1.+NV,4,(K)-> x 
K 


XexpLix:(L—L’)]+---, (2.12) 


where the Bloch energy £,(k) has been expanded in a 
Taylor’s series about k= K and only the first two terms 
retained. Since the sums over « are finite, it is possible 
to rewrite them: 


N33 «x expLix: (L—L’) ] 
K 
=-—iViN > exp[ix-(L—L’)], (2.13) 
K 


which gives the analytic expression (1-component) : 
(w/A) 
—1(—1) (eer be IA — = 
sin[a(Lz—Lz)/A | 
XK (1—6L2b2/)btyty dnt. (2.14) 
It is also possible to write Eq. (2.13) symbolically as 
—iVL6L1, (2.15) 


where 6,1,’ is the Kronecker delta function. 
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In the use of matrices it is tacitly assumed that orders 
of integration may be altered with impunity so that 
the product of two matrices is the matrix of the product 
operator. When the matrices involve r and p this is not 
always true. To illustrate this, the plane wave repre- 
sentation yields an off-diagonal matrix for x, diagonal 
matrices for p, and [.,x]. The matrix element of x 
may be written symbolically as the derivative of a 
delta function, and if left in this form the matrix 
operations yield the correct result : 
[pz ] 
(k{ [p:,x]|k’)= 
The same arguments hold true in the superlattice 
case for matrices which are represented by6,,/dxx’VLOLL’. 


The symbolic form is more revealing for the evaluation 
of matrix products, and will be used here (with appro- 


—ih; 
Y (2.16) 
— thd. 




















x=0 x=A 











x=-A 


Fic. 2. One-dimensional free-electron superlattice function. 


priate discretion). 
(vKL| p| »’K’L’) =A(K—iV 1) 611dxx'5,,'; 
(vKL| E| »’K’L’) =(£,(K) —iVxE,(K)-Vit-:: 


X ori bxK by. 


(2.17) 


(2.18) 


In treating the electron-phonon interaction one uses 
a second-quantized formalism. In order to express this 
interaction in the superlattice representation, a second- 
quantized superlattice will be required. We shall 
indicate second-quantized operators by the tilde, JV. 

Superlattice annihilation and creation operators for 
electrons are given by linear combinations of the corre- 
sponding operators in the Bloch representation : 


bxi=N3S bxy. exp(ix-L). (2.19) 


In second-quantized language, 6,x1*b,«1 is the number 

operator, while the operator 6,x1/*d,xx will be called the 

“moving operator” since its action is to move an elec- 
M , 

tron from the state vKL to the state vKL’. A general 

matrix corresponds to an operator in occupation number 
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space, 


M= a >. (vKL| M | v'K’L’)bx1*by xv. 


*KL »’K’L’ 


(2.20) 


The superlattice annihilation and creation operators 
for phonons are similarly given by linear combinations 
of the corresponding operators for plane wave states, 


aaui=N-3 > AQ+e exp(iE-L), (2.21) 
E 


where q=Q+ is a phonon wave number and ) is the 
polarization index. The inverse transformations are 
easily found. 

In summary, the superlattice representation has basis 
functions which are wave packets made up of Bloch 
functions. Superlattice functions are characterized by 
a coarse-grained wave vector and position vector. They 
are the most localized orthogonal set which will still 
retain a wave-vector dependence. If one wishes ortho- 
gonal states with a dependence upon both position and 
momentum, these are the optimal functions to use. 


3. LOCAL OPERATORS 


Due to the uncertainty principle, position and mo- 
mentum are not independent quantities. Consequently 
a meaningful joint density function of position and 
momentum cannot be written. However, there are 
quantum-mechanical analogs of the phase space density 
which reduce to a joint density function in the classical 
limit. One of the many possible analogs is the so-called 
Wigner phase space density first used by Wigner and 
Szilard* and Dirac,!® considered in detail by Moyal" 
and others, and recently extended by Barut.” The 
Wigner density ®(P,R) is a Fourier transform of the 
correlation, 


@(P,R)= (2eh)* f dr(R—Sr|o| R+40) 
Xexp(ihP-r), 


(3:1) 


where R and P are parameters, not operators, and 
(r|p|r’) is the density matrix in the position representa- 
tion. Delta-function normalization is used. This func- 
tion may also be written in terms of the plane wave 
representation density matrix (k|p|k’) as: 


#(P,R)= (2nh)-* f dk(ir"P+4k| pl) 'P— 1k) 


Xexp(ik-R). (3.2) 
This function has some of the properties of a phase 
space density.‘ In particular, integrating over P gives 


“0 P. A. M. Dirac, Proc. Cambridge Phil. Soc. 26, 376 (1930). 
J. E. Moyal, Proc. Cambridge Phil. Soc. 45, 99 (1949). 
2 A.O Barut, Phys. Rev. 108, 565 (1957). 


AND 
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the occupation of the state R, and vice versa: 


J dP @(P,R)=(R|p|R); 


far &(P,R) = (hP| p|h“P). 


The Wigner function will not necessarily give the correct 
expectation value of a function of both position and 
momentum. Nor is ®(P,R) necessarily positive definite. 

A natural generalization of the Wigner density, in 
the case of crystals, is the “Bloch-Wannier” density 
D,(k,I), which involves crystal momentum and lattice 
position in place of momentum and position parameters. 
The density may be defined in terms of the density 
matrix in the Bloch representation (vk| | »’k’) or in the 
Wannier representation (vl| p|»’I’) : 


D,(k) =n- & (v, 1— 42) p| v, 14-32) exp(ik-) ; 
a 


(3.4) 
D,(k,l) =n ¥ (v, k+4«! p!», K—3x) exp(ix-I). 
K 


Box normalization is used. The Bloch-Wannier density 
is the true band-theory analog of the Wigner density 
since it is related to the two natural orthonormal sets 
in a crystalline potential. Analogously to Eq. (3.3), the 
following equations hold: 


> D,(k,D = (ol p| vl) ; 
k 


(3.5) 
> D,(k,D) = (vk! p| vk). 
1 


Throughout the rest of this section, the crystalline 
potential will be ignored. Its effect may be included by 
the generalization of the Wigner density to the Bloch- 
Wannier density, and the superlattice representation 
KL to the full superlattice representation vKL. 

It is desirable to have well-defined spatially localized 
functions of momentum. In particular a_ transport 
theory must yield current densities j(r,/), energy densi- 
ties e(r,/), and thermal current densities w(r,/), and these 
quantities must be defined with care quantum-mechan- 
ically. It is illuminating to observe the functions which 
result if one performs the usual probability operation 
in P, using the Wigner density: 


(3.6) 
(3.7) 
(3.8) 


g(r,t)= —ef* (r,OYr,), 
j(r,0) = —e(y*vwt+yviy*), 
€(t,t) = gmmLy*r'y +2 (vy): (WW) +yo"Y*], 
w (r,t) =zemly*vo'p + 2(v*y*- v) (vy) + (v*Y*) (0%) 
+ (vp) (oY*) +2 (wy-v*) (WY) tov) 


In these expressions, e is the magnitude of the electronic 


(3.9) 





NEW 


charge and the velocity operator v is given by 


v=m'(—ihV+eA/c), (3.10) 


where m is the electronic mass and A is the vector 
potential. 

These are not the simplest real expressions which 
may be formed by putting together the appropriate v 
operators, but they are the simplest ones which satisfy 
the continuity equations for charge and energy: 


dq/dt+div j=, 
0¢/0t+div w=0. 


(3.11) 
(3.12) 


Notice that all but the first of these functions are not 
the expectation values of quantum-mechanical operators 
but are instead functionals of the wave function. This 
is because the uncertainty principle prevents these 
spatially localized functions of momentum from being 
observables. Charge density presents no difficulty since 
it does not involve momentum. 

One of the most valuable features of the SL repre- 
sentation is that the superlattice occupation prob- 
ability is a phase space density analog. In addition, it 
allows the formation of legitimate quantum-mechanical 
operators which represent localized properties of the 
crystal. Any operator whose matrix in the plane wave 
representation is (k|.M|k’) can be put into correspond- 
ence with a local operator having a superlattice matrix 


(3.13) 


(KL| M (Ly) | K’L’)= (K| M7 | K’)é.én1 


0? 


which has the same functional dependence on the wave 
vectors but which has nonzero elements only for one 
value of L and L’. 

The localization, of course, is only in the Wannier 
sense. The probability density of a SL function (Fig. 3) 
is approximately one-half contained in the central SL 
cell (/r;|<4A) with an additional one-fourth being 
inside the first node (|r;, <A). It would be pleasing to 
work with a square envelope such that all the prob- 
ability density was in the central SL cell, but this 
would destroy the orthogonality of the basis states so 
essential to a manageable theory. 

The superlattice occupation operator, 


(KL| P(Ko,Lo) | K’L’)= bxxdxx,dt1d11,, (3-14) 


A 


Fic. 3. Probability density for a one-dimensional 
free-electron superlattice function. 


QUANTUM-MECHANICAL 


REPRESENTATION 


\ 
~ 
\ 


\ 


(a> 


WIGNER DENSITY SUPERLATTICE OCCUPATION 


Fic. 4, Mixing of plane-wave density matrix elements to form 
the Wigner density and the superlattice occupation. The Wigner 
density mixes elements on a skew diagonal intersecting the main 
diagonal at h (P,P). The superlattice occupation mixes elements 
in a block centered at (K,K). 


has as its expectation value the occupation of the 
superlattice state KoLo. This provides a quantum- 
mechanical analog of the phase space density. The super- 
lattice local operators for current density, energy 
density, and thermal current density have diagonal 
matrices in the SL representation, and so their expecta- 
tion values are given by the normal probability opera- 
tion in K using the SL occupation probability as a 
substitute for the phase space density. 

The local functions traditionally used can be derived 
from the Wigner density. The SL local functions are 
derived from the SL occupation probability. So to 
compare the traditional localized functions with the 
expectation values of the SL local operators a com- 
parison of the Wigner density and the SL occupation 
probability will suffice. Using box normalization, these 
may be expressed in terms of the plane-wave density 
matrix: 


&(P,R)= (nh) > (k\ p| k’)6G(k+k’), A-P) 
kk’ 
Xexpli(k—k’)-R]; 
(KL{ p| KL)=N-3 y A (K+«! p|K+x’) 


KK 


(3.15) 


Xexpli(xn—x«’)-L]. (3.16) 
The meaning of these equations is made clear by 
observing that the Wigner density mixes elements on a 
skew diagonal of the plane wave density matrix while 
the superlattice unitary transformation mixes elements 
in a block near the main diagonal (Fig. 4). The mixing 
of matrix elements is similar in the position representa- 
tion. The application of a plane-wave density matrix 
formalism based on the Wigner density is rendered un- 
manageable by the manner in which off-diagonal 
elements of widely ranging k and k’ are mixed. The SL 
approach, on the other hand, remains manageable. 

The superlattice and Wigner density approaches will 
now be compared point by point. 


(1) The SL occupation is an observable. The SL 
local functions are expectation values of operators with 
well-defined operational meanings. While the Wigner 
density is the expectation value of an operator, the 
derived local functions of momentum are functionals of 
the wave function and are not observables. 

(2) The SL occupation, energy density, and prob- 
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ability density are non-negative definite. The Wigner 
density and derived energy and probability densities 
may take on negative values. 

(3) In both approaches the continuity equations of 
charge and energy are satisfied. 

(4) Both the SL occupation and the Wigner density 
are quantum analogs of the phase space density, which 
they approach as 4-0. If one uses the Schrédinger 
equation as the equation of motion, either quantity 
produces the usual transport-equation field terms. 

(5) The occupation probability in the momentum or 
position representations may be regained by summing 
over one of the variables of the Wigner density. The 
superlattice occupation has this property if the SL 
density matrix is diagonal in this variable of summation ; 
otherwise the off-diagonal elements are mixed in. 

(6) The Wigner density mixes elements on a skew 
diagonal of the momentum density matrix. The SL 
unitary transformation mixes together a small block 
of elements near the element (K,K). This leaves the SL 
representation open to approximation when matrices 
vary slowly with k and k’, and allows a clear distinction 
between classical and quantum effects. 


The most striking attributes of the SL representation 
are mentioned in items (1) and (6). Item (1) gives the 
SL the advantage of being a well-defined formalism. 
Item (6) gives it the advantage of being an easily 
applied formalism giving useful results. 


4. SUMMARY 

The superlattice representation has as basis func- 
tions orthonormal wave packets of Bloch functions 
characterized by a band index, a coarse-grained wave 
number K, and a superlattice position L. The unitary 
transformation from the Bloch representation is easily 
applied, and resembles the Wannier transformation. 
The localization of the basis functions in space resembles 
that of the Wannier functions in shape, but the super- 
lattice functions are centered on a superlattice cell 
rather than a crystal lattice cell. The Bloch and Wannier 
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representations are special cases of the superlattice 
representation. Most of the commonly encountered 
operators have off-diagonal matrix elements in the 
superlattice representation and so the density matrix 
formalism is useful. Diagonal operators whose expecta- 
tion values represent spatially localized functions of 
momentum are easily defined in this representation. 
A second-quantized formalism may be established, 
defining creation and annihilation operators for super- 
lattice electrons. A population of phonons may similarly 
be represented in terms of a second-quantized super- 
lattice representation based on plane waves instead of 
Bloch functions. 

The Wigner density and the customary quantum- 
mechanical spatially localized functions of momentum 
were discussed in relation to the superlattice quantities. 
The superlattice provides two major advantages over 
the use of the other phase space density substitute. 
Firstly, the superlattice occupation probability and 
superlattice local operators are well-defined quantum- 
mechanical observables, while the Wigner density and 
the derived functions are functionals of the wave 
function, and do not represent observable quantities. 
Secondly, the superlattice formalism is more easily 
applied, due to the manner in which Bloch-representa- 
tion density matrix elements are mixed. 

Due to the nature of quantum mechanics, all repre- 
sentations are but models upon which to base physical 
pictures. A representation is considered appropriate 
or “natural” if a description in terms of state occupa- 
tion probabilities suffices. For homogeneous processes 
the plane wave representation is natural because the 
behavior of the diagonal density matrix elements is a 
good approximation to the complete description of the 
quantum-mechanical behavior. The phases may be 
ignored. For inhomogeneous processes the SL repre- 
sentation plays the same role: the behavior of the 
diagonal elements of the SL density matrix provides 
a good approximate description of inhomogeneous 
behavior. 
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A quantum-mechanical transport equation corresponding to the classical Boltzmann equation is developed. 
Classical transport theory in the presence of a temperature gradient involves the phase space density, 
which has no meaning in quantum mechanics due to the complementarity of position and momentum. 
The superlattice representation allows the development of a quantum transport equation from the density 
matrix Schrédinger equation, through the introduction of irreversibility by standard methods. The quantum 
transport equation governing the electron occupation probability in the superlattice representation is 
derived for impurity scattering and for phonon scattering. This equation has the same field terms as the 
classical Boltzmann equation, but involves the discrete coarse-grained wave number and position of the 
superlattice representation. The scattering terms involve transitions between different superlattice states, 
and thus include scattering processes which move electrons from one superlattice cell to another. The 
solution of the classical equation is affected very little by these quantum effects, for all temperature gradients 


which might reasonably be encountered. 


1. INTRODUCTION 


HERMAL transport in metals has been treated 

by many authors and in several recent review 
articles.!~* In all of these treatments, the classical phase 
space density is used. The complementarity of position 
and momentum makes a joint density of these quanti- 
ties a meaningless function in a quantum-mechanical 
treatment. The usual rationale for using the classical 
function is provided by a discussion of the properties 
of wave packets,’ to give estimates on the validity of 
the classical treatment. 

The classical Boltzmann transport equation is derived 
from classical mechanics. The scattering expressions 
involving ‘Golden Rule” transition probabilities are 
then inserted. These transition probabilities are the 
result of an irreversibility approximation imposed on a 
result of quantum-mechanical perturbation theory. 
This patchwork approach is somewhat unsatisfying. It 
is an amalgam of classical and quantum expressions, 
whereas the transport equation should result from the 
addition of nothing to the Schrédinger equation but the 
approximation of irreversibility. The classical transport 
equation involves the classical phase space density 
applied to wave packets by qualitative arguments, 
whereas if this is a valid procedure it should be carried 
out in an explicit quantitative manner. 

In addition, it is known that the results of calculations 
based on the classical transport theory do not agree 


* Based on part of a thesis submitted to the Graduate Faculty 
of Cornell University as partial fulfillment of the requirements 
for the degree of Ph.D. 

t Now at John Jay Hopkins Laboratory for Pure and Applied 
Science, General Atomic Division of General Dynamics Corpo- 
ration, San Diego, California. 
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edited by N. F. Mott (Taylor and Francis, Ltd., London, 1953), 
Vol. 2, p. 28. 
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Berlin, 1956), Vol. 14, p. 198. 
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with experimental evidence at low temperatures.® While 
there are a myriad of possible reasons for the discrep- 
ancies, it is of interest to ascertain whether the use of 
the classical equation might be one. Knowledge of the 
validity conditions applying to the use of the classical 
equation are then of interest. 

The task which is set forth is the following. One 
wishes to derive a quantum transport equation to 
correspond to the classical Boltzmann equation in the 
presence of an electric field and a temperature gradient. 
(Magnetic fields will not be considered in this work.) 
This equation should involve some quantum-mechanical 
analog of the classical phase space density. It should be 
derived in its entirety from the Schrédinger equation, 
with the introduction of irreversibility. It should con- 
tain quantum corrections to the first order in h and 
the temperature gradient. And the conditions under 
which it reduces to the classical equation should be 
determined. 

The principal problems which will be met in such a 
treatment are two. One is irreversibility; the other is 
complementarity. Irreversibility is a problem both in 
classical and in quantum mechanics, since the mechan- 
ical equations of motion in both cases are invariant 
under time reversal, while a transport equation governs 
the time-irreversible progression of a density function. 
Complementarity is a problem unique to quantum 
mechanics, and is due to the Heisenberg uncertainty 
relation between position and momentum. 

The fundamental new physical quantity in this work 
is the new representation used.® The superlattice (SL) 
representation provides a phase space density substitute 
to be used in the density matrix formalism.’ 


5R. Berman and D. K. C. MacDonald, Proc. Roy. Soc. 
(London) A209, 368 (1951); and Proc. Roy. Soc. (London) A211, 
122 (1952). 
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Section 2 derives the field terms of the transport 
equation for the SL occupation probability. Section 3 
discusses irreversibility. A quantum-mechanical trans- 
port equation is derived for rigid random impurity 
scattering and the electron-phonon interaction in Secs. 
4 and 5, respectively. Validity conditions on the 
classical transport equation (Sec. 6) and a summary of 
the principal results (Sec. 7) are then included. 


2. FIELD TERMS OF THE TRANSPORT EQUATION 


In the development of a transport equation to first 
order in the applied electric field and temperature 
gradient, the field terms involve gradients of the zeroth- 
order distribution function. In this quantum-mechanical 
treatment the zeroth-order distribution is taken to be 
the superlattice density matrix representing the Fermi 
distribution with an applied temperature gradient fixing 
the local temperature, 


(vKL} Po} v'K’L’) = fol EL K),7( L) }6,,-dxx i. 


E-Q -1 
we1)=|en(— )+4] 
kT 


This distribution is assumed in an ad hoc manner, just 
as are all zeroth-order distributions in transport theory, 
classical or quantum-mechanical. These distributions 
can be justified only by the physical significance of the 
resulting formalism. 

The complete Hamiltonian for the problem of electron 
transport includes kinetic energy, the electric field &, 
the periodic crystalline potential U(r), and a scattering 
interaction potential. In the case of phonon scattering 
the phonon energy is also included. This section deals 
only with the electron coordinates and only with the 
“unperturbed” Hamiltonian H,, omitting the scattering 
potential V. Usually H, is written 


H,'=p"/2m—F-r'+ U(r’), 


(2.1) 


(2.2) 


(2.3) 


where F is left as a general applied force derivable from 
a scalar potential. 

Consider the effect of a canonical transformation 
with the generating function F(r’,p)=r- (p+ Fi): 


p=p’—Fi, r=r’, Ha=(pt+Fi?/2m+ U(r). (2.4) 
After the transformation H, is cyclic in r, so that 
classically or quantum-mechanically p is a constant of 
motion. In the quantum-mechanical use of this transfor- 
mation, the wave vectors k change in their physical 
definition along with p since the wave function under- 
goes a unitary transformation. 

If the force is the result of an applied electric field, 
the canonical transformation is simply a gauge transfor- 
mation y(/)=c&-rt which sends the scalar potential to 
zero and includes the electric field in a time-dependent 
vector potential 


A(r,t) = —c&t. (2.5) 
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This transformation? has many advantages. If H, is 
the complete Hamiltonian (no scattering), the impor- 
tant result 


v()= v(0)+m"Fi, (2.6) 


may be produced without solving the Schrédinger 
equation, by using the fact that p is a constant of 
motion. 

The unperturbed Hamiltonian given in Eq. (2.4) 
includes kinetic energy, electric field in the time- 
dependent gauge, and periodic crystalline potential. In 
the superlattice representation the matrix for this 
Hamiltonian is simple®: 


(vKL| H,|»’K’L’) =[E,(K—eh"8/) 
—iVxE,-Vit--+: J6rvéxx5,,. (2.7) 


The Schrédinger equation for the density matrix 
becomes, for the complete Hamiltonian H=H,+V, 


dp/dt+ih-[H,,p]= —ih- [Vp]. (2.8) 


The field terms of the transport equation result from 
the left-hand side of this equation. The steady state is 
taken to be one in which the velocity distribution does 
not change in time, and in which the density matrix is 
p=potp’, where p’ is first and higher order in fields. 
The zeroth-order density matrix po is given by Eq. (2.1). 
To first order in fields the only contribution to dp/d¢ in 
the steady state comes through the time-dependence of 
the velocity. Since po is diagonal, the contribution from 
the commutator comes from the off-diagonal part of H,, 


(vKL| {dp/dt+-ih“[H,,0 |} | v' K’L’) 
={-—eh&-Vfolv, K—eh 81, L] 


+v-Vifoly, K—eh81, L)}6,,6xx511. (2.9) 


Thus the usual first-order field terms may be produced 
from a superlattice density matrix formalism. 

The gradients with respect to the two discrete 
parameters K and L arose from a formal treatment of 
them as continuous variables, and so the gradients in 
the resulting field terms should be similarly regarded 
as formal operators to be carried out as though fo(K,L) 
were defined for continuous K and L. 


3. IRREVERSIBILITY 


At some point in the development of transport 
equations from the equations of classical mechanics or 
quantum mechanics one must grapple with the basic 
problem of irreversibility. Whereas transport equations 
are destined to describe irreversible behavior, the 
equations of motion governing the classical phase space 
density and the quantum-mechanical density matrix 
have time-reversal symmetry. To systematize the 
introduction of irreversibility is to clarify what approxi- 


9 J. J. Hopfield (private communication). This gauge has also 
been used in the Japanese literature according to a private 
communication from N. H. Rostoker. 
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mation procedure will appropriately eliminate the 
memory of initial conditions. 

Most treatments of irreversibility in quantum- 
mechanical systems have used the plane wave or Bloch 
representations, in which the momentum and crystal 
momentum, respectively, are diagonal. These repre- 
sentations are admirably suited to the derivation of 
homogeneous transport equations. Before inhomogene- 
ous processes are approached in the superlattice repre- 
sentation the techniques used to obtain irreversible 
behavior in the plane wave representation will be 
reviewed. 

The treatment of an interaction Hamiltonian H’ by 
first-order perturbation theory” yields the well-known 
“Golden Rule” transition probability per unit time, 


W (k,k’) = 2h | (k | H’|k’) |°3CE(k) —E(k’)]. (3.1) 


The approximation which yields this expression discards 
all knowledge of the initial conditions, which would 
otherwise remain in the phases of the wave function. 
Thus irreversibility is achieved by a “random phase” 
assumption which sets all off-diagonal density matrix 
elements equal to zero after each collision. 

Mori!! and Kohn and Luttinger,” in their treatments 
of electrical conductivity, impose an electric field which 
is turned on slowly with time, &e~*'. When the limit of 
the resulting equations is taken as s— 0, irreversible 
equations result. The physical reason for this behavior 
is clarified somewhat by Lax’® who obtains the same 
effect by adding to the density matrix equation a term 
(p—po)/ta which is intended to indicate coupling to the 
rest of the universe. 

A more satisfactory approach to irreversibility is 
formulated by Van Hove." Since the introduction of 
irreversibility constitutes discarding part of the 
Schrédinger equation, Van Hove addresses himself to 
the problem of establishing ‘exactly what is to be 
discarded and what is to be retained. The resulting 
criterion for irreversibility is the following. If 
Ye (Kk) V KY) (kK pk) (Kk | V'| kK’) is of order M larger 
for k=k’ than for k#k’, irreversible behavior results 
in a system of M particles. In the limit of large systems 
this physical characteristic which leads to irreversibility 
may be expressed by the equation 


far V|k’”) (k”” |p|”) (k’”| Vk’) 
= F(k)6(k—k’)+G(k,k’), (3.2) 


where the first term is singular, G(k,k’) is nonsingular. 
If the nonsingular part is discarded the assumption of 
irreversibility is made. Use of the Van Hove criterion 


10. I. Schiff, Quantum Mechanics (McGraw-Hill Book Com 
pany, Inc., New York, 1955), second edition, Chap. 8. 

1H, Mori, Progr. Theoret. Phys. (Kyoto) 9, 473 (1953). 

12W. Kohn and J. M. Luttinger, Phys. Rev. 108, 590 (1957) ; 
J. M. Luttinger and W. Kohn, Phys. Rev. 109, 1892 (1958). 

18M. Lax, Phys. Rev. 109, 1921 (1958). 

41. Van Hove, Physica 21, 517 (1955); and Physica 23, 441 
(1957). 


PHENOMENA IN 


METALS 1539 
avoids the repeated use of the random-phase assump- 
tion. 

The beauty of the Van Hove theorem lies in the 
clarity with which it indicates which part of the 
Schrédinger equation describes systematic irreversible 
behavior. Unfortunately it is of use only in connection 
with operators which are diagonal in the plane wave 
representation since it considers only the diagonal 
density matrix elements. An attempt has been mnade by 
Prigogine and Toda" to extend the Van Hove approach 
to include off-diagonal elements of the density matrix. 
Such an extension cannot be general in applicability 
since there can be no unique division into systematic 
and nonsystematic parts for off-diagonal operators. An 
extreme example is given by the operator whose 
expectation value gives the initial conditions of the 
system: in order to be of use with this operator the 
density matrix equation must be exact. Any extension 
of the Van Hove criterion to off-diagonal behavior 
should specify the class of off-diagonal operators to 
which it is applicable. 

A formally quite different approach to irreversibility 
is used by Bloch and Wangsness.'® Two interacting 
quantum-mechanical systems are represented in a 
complete density matrix p. The distribution matrix o« 
for one of the systems is given by a trace over the 
coordinates of the other system, 


(3.3) 


Irreversibility is gained by assuming the b-system to 
return to its original distribution, with random phase, 
after each scattering event. In Argyres’ treatment of 
galvanomagnetic effects!’ the electron-phonon inter- 
action is treated by this approach to obtain a generalized 
Boltzmann equation. 

The view taken in this paper is that current, energy 
density, and thermal current may be adequately 
represented by superlattice local operators.® Since these 
are diagonal in the superlattice representation, the 
behavior of the diagonal SL density matrix elements is 
all that is required. The analogy between this behavior 
in the SL representation and the behavior of homo- 
geneous transport phenomena in the plane wave 
representation is apparent. 

In the case of impurity scattering, the approach of 
Sec. 4 assumes random phases in the superlattice 
density matrix. Since the behavior of the plane-wave 
vector k is also the behavior of the SL wave vector K, 
the Van Hove theorem predicts the diagonality in K. 
It is shown in Sec. 4 that the largest part of the inter- 
action is diagonal in L. For this part of the interaction 
the Van Hove theorem yields the required random 
phases. For the part of the interaction which is off- 
diagonal in L an ad hoc random phase assumption in 
L must supplement the Van Hove theorem. 


Oa Trop. 


16 T, Prigogine and M. Toda, (unpublished). 
16 R. K. Wangsness and F. Bloch, Phys. Rev. 89, 728 (1953). 
17 PN. Argyres, Phys. Rev. 109, 1115 (1958). 
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The treatment of phonon scattering in Sec. V utilizes 
an analog of the Bloch-Wangsness approach. Instead 
of the phonon distribution being diagonal in q it is 
assumed diagonal in QL, the superlattice parameters. 
Again the largest part of the interaction is diagonal in 
L and no further approximation is needed. For the 
small part of the interaction which is off-diagonal in L, 
however, an assumption of random L phase again must 
be made. 

The requirement of an additional assumption of phase 
behavior for interactions off-diagonal in L is clear in 
its origin: the superlattice representation is an approxi- 
mate description. A distribution diagonal in a SL 
representation with lattice parameter A is no longer 
diagonal in L when it is transformed to a SL represen- 
tation with parameter A’. The physical reason for 
assuming diagonality is as follows: one does not believe 
that there should be correlations in phase over distances 
as large as the superlattice parameter will be in this 
work. 


4. THE QUANTUM TRANSPORT EQUATION IN THE 
PRESENCE OF IMPURITY SCATTERING 


In the presence of random rigid impurity scatterers 
the density matrix need only contain the electron 
coordinates. It is possible to include the crystalline 
potential in the treatment. The random character of 
the impurity centers is well-defined and demonstrated 
explicitly using the superlattice representation. The 
introduction of irreversibility yields a transport equa- 
tion which contains the usual scattering terms plus 
added terms which represent scattering from one 
superlattice cell to another. 

The Hamiltonian may be written 

H=H,+)\V, (4.1) 
where H, is the unperturbed Hamiltonian of Eq. (2.4), 
with F=—e&. The superlattice matrix for H, is given 
in Eq. (2.7). 

The strength of the interaction potential is governed 
by the perturbation parameter \. The potential of % 
elastic impurity scatterers is represented by 


V= > o(r—I,), (4.2) 


a=1 


where v(r) is the scattering potential of one impurity 
at the origin. The scatterers are assumed to be randomly 
distributed at lattice positions |,. 

The superlattice formulation contains an implicit 
ensemble average over the impurity arrangement. 
Suppose there are many randomly distributed impuri- 
ties in each superlattice cell (9>>N*). Then set 1,= L” 
+24 where L” is a superlattice translation vector. 
Next the phase factors appearing in the square of the 
superlattice matrix element are replaced by their 
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average over all SL cells L’’, an automatic ensemble 
average, 
(| 20d expli(k’—k)-2.]|*) 


_ (N2/N®)5 ue + (N/N*) (1 —6dxx’). (4.3) 


The square of the potential matrix in the superlattice 
representation is then 
(vKL| V{»’K’L’) |?=S0.-{L(90/N) 
— (/N*)]] (vK] 0(L”) | »’K) | 6xx-di1 
+ (K/N*)| (vKL|o(L”)|»’K’L’)|?}, (4.4) 


where (vKL{x(L”)|»’K’L’) is the matrix for the 
potential of one impurity at the superlattice position 


L”. 
(vKL!2(L’”) | »’K’L’) 


= f dr bnes*(o(t- Le (0). (4.5) 


Although the short range of the scattering potential 
reduces the magnitude of the matrix element if either 
L or L’ does not equal L”, the extensive wings of the 
superlattice functions lead to finite contributions for 
unequal L, L’, and L’’. The significance of these matrix 
elements off-diagonal in L is that they represent a 
purely quantum effect, scattering from one superlattice 
cell to another. 

The introduction of irreversibility into the iterated 
Schrédinger equation yields the scattering terms 
(Appendix). A synthesis of Eqs. (2.9), (4.4), (A.4), 
(A.7), and (A.8) gives the transport equation, 


—eh" &-VfothVxk,(K)-Vifo 
=N8 Y WOKLy’K’L)[(0’K’L’| p! »’K’L’) 


KL’ 

—(vKL!p|vKL)]. (4.6) 
The transition probabilities W(vKL,v’K’L’) are given 
by 


W (vKL,v’K’L’) 
= Ieh-N2-N- YT 1] (VKL| 0(L”) | »’ KL) |? 


X6LE,(K)—£,-(K’)]. (4.7) 


For simplicity the transport equation is written for /=0. 

The transport Eq. (4.6) governs the irreversible 
progression in time of the superlattice occupation 
probability (vKL!p|vKL) and consequently yields 
expectation values for superlattice local operators. The 
meaning of the transition probabilities and the signifi- 
cance of the quantum effects shall be discussed. 

In the limit where V=1 and the superlattice repre- 
sentation is trivially the Bloch representation® there is 
no dependence upon L since L=0. The usual homo- 
geneous transport equation is then regained : 
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—eh*8- Vu folk) = EW (vk,'k’)L(' |p| o"R’) 
- — (ok| |); 

W (vk,y’k’) = 2h 029] (vk |v] vk) |? 
XL E,(k) — Ey-(k’) 


(4.8) 


(4.9) 


The use of a nontrivial superlattice adds the tempera- 
ture gradient term to the left-hand side and adds 
scattering terms with a position dependence to the 
right-hand side. 

If the scattering terms with L¥L’ are neglected, an 
approximate transport equation is produced: 


—eh- é- Vx fo( v,K,L)+h IVEE,( K) s Vifo(v,K,L) 
=N8>¥> W(vK,v’K’)[(0'K’L’| p| »’K’L’) 


»'K! 

—(vKL!p|vKL)]}. (4.10) 
The transition probabilities are those of Eq. (4.9). 
Equation (4.10) is the classical Boltzmann equation 
except that in place of writing transition probabilities 
between allowed k vectors, only allowed K vectors of 
the superlattice are used. These being fewer in number 
by a factor 1/N*, each of the transition probabilities 
appears with additional weight '*. 

It is apparent that the traditional transport Eq. 
(4.10) cannot be a good approximation to the quantum 
transport Eq. (4.6) unless the following condition is 
satisfied for L¥L’: 


W (vKL,v’K’L’)<<W (vKL,v'K’L). (4.11) 


This condition will be considered again in Secs. 6 and 7. 
If it is not satisfied, a description of transport theory 
must utilize the complete quantum transport equation 
including transition probabilities between superlattice 
states of different L. 


5. THE QUANTUM TRANSPORT EQUATION IN THE 
PRESENCE OF THE ELECTRON- 
PHONON INTERACTION 


The major difference between the previous Hamil- 
tonian and the one used here is due to the inherently 
quantum-mechanical nature of lattice vibrations. It is 
convenient to use a second-quantized Hamiltonian 
containing both phonon and electron energies and the 
interaction, 


A=A,4AA,+ V. (5.1) 


In the interests of diminishing the number of summation 

indices and keeping the interaction tractable, the 

following approximations are made. Bloch electrons are 

discarded in favor of the free electron model. Transverse 

phonons are omitted. Umklapp scattering is ignored. 

The result is a gas of free electrons in “jellium” metal. 
Plane wave electrons have an energy 


H,=(p—e&/)?/2m, (5.2) 
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for which the second-quantized superlattice operator is 


(hK—e&t)? 
A.-|¥ nee. i. 
KL 2m 


hK—e&i 
ar » (— —s -) -Vidsbe.*be (5.3) 
KLL’ 


m 


if higher than first-order terms are omitted in a power 
series expansion and V;é 1 is used in the sense discussed 
in reference 6. 

The operators in wave number space for the electrons 
are the annihilation operator dx, and the creation 
operator dx.*.6 The corresponding phonon operators 
are dgr, and de.*. Throughout this section a double 
superlattice representation is used. The electrons are 
represented in a superlattice whose basis functions are 
linear combinations of plane waves, as are the phonons. 
The complete description of any operator then contains 
the four parameters KLQL’. 

The phonon energy is 

Hy=hw(q). (5.4) 
The corresponding second-quantized superlattice oper- 
ator is 


A, = [> hw(Q)aer*der 
QL 


—1 > hVqw(Q)- Vidi v-aer*der |. 
QLL’ 


(5.5) 


Again the power series expansion has been cut off. 

For one polarization only, the interaction potential is 
i i, ‘ 
V=iC S ———[b*aqbwbx,ur4q 


k«’a [w(q) ]! 
(5.6) 


— by ag* bub, x q |. 
The constant is C= —?y(h/2uV)', where u is the mass 
density, V the volume, and y is the electron-phonon 
interaction constant. 
The transition to the superlattice is carried out by 
using the transformations of reference 6. 


us j |Qtn-»’| 
fac’ 5 NAS {———_—__—. 
KLETQL” xx’ | [w(Q+%—’) ]! 
XexpLix: (L—L”) ] exp[ix’- (L’—L’)] 
| Q+n— x’ | 
X bx r*dqyr, idx: 1bK,.K'4Q- - — : 
[w(Q+%—x’) }! 
Xexpl—ix- (L—L”) ] expl—ix’- (L”—L’)] 


x bx * dear wbx 1 bK,K Qy- (5.7) 


The superlattice wave vector I has components I; 
such that 
(5.8) 


' , i / 
[Kg—K; —I',| <2/A. 
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Various terms in the summation over «; and x,’ in Eq. 
(5.7) will have IT’; take on the values 0, +27/A. 

A linear combination d..qu,v of superlattice de- 
struction operators is defined in terms of a linear 
integral operator T.,qu,1” which operates on dayr,1”, 


dyau.u=T1,qu,Ldqyr,L”. (5.9) 
The operator T is given by 

Trew =N-*D> explix: (L—L”) ] exp[ix’- (L”—L’)] 

xx’ ) 
|Q+x—*’| 

x——_—_—_—... 

[w(Q+x—x’) }} 

The only operation other than multiplication which 

this operator implies is the definition of I in adayr,.”, 

which depends upon « and x’ by Eq. (5.8). In terms 


of dy.qu”,1’, the interaction potential of Eq. (5.7) may 
be written more simply: 


V=icN' > 
KLOQOL’K'L’ 


—byr*diquy vu *bx vox, x9 |. 


(5.10) 


[bxi*di.q ur, LOK’ 1LOK,K’+Q 


(5.11) 


The complete expression for the interaction potential 
thus involves some processes which violate superlattice 
wave number conservation by +22/A in each compo- 
nent, and which have unequal position parameters 
bo, oe 

The Schrédinger equation governing the second- 
quantized density matrix for electrons and phonons is 
taken into the interaction representation with respect 
to H,+H, and iterated. Again an approximate field-free 
unitary transformation is used in the scattering terms. 
The difference between the treatments of impurity 
scattering and phonon scattering lies in the introduction 
of irreversibility. Argyres’ use'’ of the Bloch-Wangs- 
ness'® approach will be followed closely here, except 
that the phonon distribution will be assumed to be 
diagonal in the superlattice representation rather than 
in the plane wave representation. An approximate 
density matrix is formed, a product of distribution 
matrices for electrons, ¢, and phonons, &. The distri- 
bution matrix for phonons is taken to represent a 
temperature gradient in the superlattice: 


(5.12) 


(5.13) 


p=d0», 


op= » x N( Q,L)agu* cat, 
QL 


o= > (KL! o| K’L’)dgi*bx-1. 


KLK'L’ 


(5.14) 


The tedious process of obtaining the scattering terms 
of the transport equation from the iterated Schrédinger 
equation with the potential (5.7) explicitly written out 
will not be given here. The terms are eventually added 
together to give systematic effects in a similar fashion 
to that used by Argyres.'” 
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As in Sec. 4, the well-defined procedure which is 
followed for the introduction of irreversibility is suffi- 
cient for the largest part of the interaction, that which 
is diagonal in L. The Bloch-Wangsness technique must 
be supplemented, for the rest of the interaction, by 
assumptions concerning the electron distribution ma- 
trix. It is assumed diagonal in L. Also terms which 
appear but are off-diagonal by one or two superlattice 
wave number units are ignored. 

The resulting transport equation is 


—eh"&-VfothmK-Vifo 


n= f/* p Be {Wen K— Q L’, KL) 
QL’ 


x[(K—Q L’|o| K—Q L’))—(KL|c| KL) ] 
+W(K+Q L’, KL)[(K+Q Lc} K+ Q L) 
—(KL}o|KL)]}. (5.15) 


The transition probabilities are for phonon emission 
processes, W,., and for phonon absorption processes, 


W ab. 
Wem(K—Q L’, KL) 


= 2rh™?C? > Trqu’,wTLQu’,L* 
L” 


Xdrr-(Neyruy+1)6[ (h2K2/2m) 
—hw(Q+1)—h?(K—Q)?/2m]. 
W.»(K+Q L’, KL) 


= 2rh™)?C? » Trew. Ty QL 
L” 


Xérr(Noy rol ( h?K?, 2m) + hos Q+ r) 
—h?(K+Q)*?/2m]. 
Nat= {exp[hw(Q)/«7(L) ]—1}-". 


The transport Eq. (5.15) yields information con- 
cerning the irreversible behavior of the electron occu- 
pation probability (KL|o| KL). It gives expectation 
values for superlattice local operators which refer only 
to the electron coordinates in the density matrix. In 
the limit where V=1, Eq. (5.15) reduces to the homo- 
geneous transport equation. 

The transport equation may best be discussed by 
considering the operators T and T* which appear in the 
transition probabilities. These involve processes which 
have unequal L, L’, and L”. In addition, some processes 
are included which violate superlattice wave number 
conservation by +22/A. Consider some approximation 
procedures to be applied to the defining equation for T. 
It is possible to force L=L” by assuming «=O in 
(Q+x—x’) in Eq. (5.10). Then 


(5.16) 


(5.17) 
(5.18) 


| | 4q| | 


T a i —— d Oro. (5.19 
L,QL’’,L (ou 50 Jouvs (5.19) 
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By similar procedures, it is possible to force diagonality 
(L=L’) or to force L’=L”. It is also possible to force 


L=L'=L": 


| lal 
TL,Qu”,L= (0 —— | 
[w(q) }*| 


The following notation is used in these equations: 
(QL! gw | QL’) 
1 |Q+é| 


OE, Oia Se, yy 
Mime ee 


These results indicate that the nonconservation of 
wave number is intimately associated with the least 
likely of the possible interactions: those in which the 
initial state of the electron, the final state of the 
electron, and the state of the phonon are all character- 
ized by different superlattice positions. All interactions 
having only two of the three positions equal conserve 
superlattice wave number and are of the same order of 
magnitude. Consequently Eq. (5.20) constitutes the 
only legitimate approximation to Eq. (5.10). This 
yields the purely L-diagonal interaction 

|Q| 


V=icN!} ya soins —[ bx.*a Lox’ LOK .K’ Q 
KoKL[o(Q} 


OL )b.x-duvdr0 (5.20) 


(5.21) 


—bxr*dar*bx 16x xq]. (5.22) 


The condition for the validity of the approximation 
(5.20) is that for L¥L’, 
|(QL| gw?! QL’) |?! (QL gw? | QT) |*. 
This will be considered again in Secs. 6 and 7. When it 
is not satisfied, the complete transport Eq. (5.16) must 
be used. When it is satisfied, putting together Eqs. 
(5.16) and (5.22) yields the classical Boltzmann equa- 
tion: 
—eh&-Vxfothm IK-Vi fo 
= NM VolWen(K—Q, K)[(K—Q L|c| K-Q L) 
— (KL}o| KL) ]+W..(K+Q, K) 
x[(K+Q L|o| K+Q L)—(KL|c|KL)]}. 
The transition probabilities for emission and absorption 
are given by the usual equations: 
Went K—Q, K) 
== Dah NC70?/w(Q) ](Nort+lélheK?/2m 
—hw(Q)—h?(K—Q)?/2m], 


(5.23) 


(5.24) 


W.»(K+Q, K) 
= QrhNC*L02/w(Q) ](Nar dL h?K2/2m 
+ hw(Q) — h?(K+Q)?/2m]. 
The factor .V* which enters Eqs. (5.16) and (5.24) was 
discussed at the end of Sec. 4. 


(5.26) 
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6. VALIDITY CONDITIONS OF THE CLASSICAL 
TRANSPORT EQUATION 


The quantum transport equation derived in this 
work contains quantum corrections to first order in h, 
and also corrections to first order in the temperature 
gradient. The use of classical transport theory for 
thermal transport phenomena is valid if the classical 
Boltzmann equation is a meaningful approximation to 
this quantum transport equation, since any quantum 
corrections not included here will be of higher order. 
In this section validity conditions for the classical 
approximation are gathered and summarized. Upper 
and lower limits are placed on the size of the superlattice 
cell by these conditions. A numerical evaluation may 
be made for degenerate and nondegenerate free electron 
gases, showing that the conditions are easily fulfilled. 

The field terms of the quantum transport equation 
for a free-electron gas are 


0( KL! p| KL) 
( om . ) =—eh 'E-Vifoth IVE: Vi fo. (6.1) 
al f 


The classical Boltzmann equation has field terms 


0 f(k,r) 
— = —eh 1E-Vifoth VE: Vr fo. (6.2) 
ot f 
For an infinite crystal, both k and r are continuous 
variables, while K and L are discrete. For the approxi- 
mation of Eq. (6.1) by Eq. (6.2) to have meaning, the 
energy and the superlattice occupation probability must 
vary slowly between successive values of K and L. 
The validity conditions on the use of the classical field 
terms are the following: 
| Vale! (4/A)KE, 
Vic fo! (4 A \KK fo, 


| Vr fo| (A, 2). 0. 


(6.3) 
(6.4) 


(6.5) 
In each case these are to be evaluated near the Fermi 
surface, kr, in metals, or near the most probable wave 
vector, k,, in semiconductors. 

The scattering terms of the quantum transport 
equation and of the classical equation take the following 
forms: 


(KL! p| KL) : : 
( ) =N8 > W(KLK’L)[(K’L'|p|K’L) 


al K’'L’ 
(KL!p|KL)], (6.6) 


0 f(k,r) : 
( ) => W(k,k’)[ f(k’,r) — (k,n) ]. (6.7) 
c k’ 


dl 
In order to neglect the terms with L’AL in Eq. (6.6), 
one must satisfy the validity condition 


W(KL,K’L’)<W (KL,K’L), (6.8) 
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which is the general expression corresponding to Eqs. 
(4.11) and (5.23) for impurity and phonon scattering, 
respectively. For these special cases, the condition 
may be written as 


Vik! | k’) | (/A)<| (k} 0} k’)| 


NG) cor 


(6.9) 


and 


(6.10) 


Along with Eqs. (6.3) and (6.4) these same conditions 
allow the replacement 


V8 > W(K,K’) f(K’,L) > W(k,k’)f(k’,L), (6.11) 
K’ k’ 


which completes the approximation of Eq. (6.6) by 
Eq. (6.7). 

The effect of the added scattering terms of the 
quantum transport equation on the solution of the 
equation has been studied.'* The solution was found to 
have a correction term increased by only one order of 
magnitude over those of the transport equation. The 
conditions (6.9) and (6.10) which were derived for the 
validity of the classical equation should therefore hold 
also for the validity of the classical solution. 

Thus there are four validity conditions. Three of 
them arise from the field terms. The other refers to the 
scattering terms. Except for Eq. (6.5), all the validity 
conditions give lower limits to A. The most stringent 
lower limit at temperatures above 0.1°K is given by 
Eq. (6.4) evaluated for a degenerate electron gas with 
Fermi velocity ur: 


TA>hurpr/k. (6.12) 


The upper limit on the superlattice parameter is given 
by Eq. (6.5): 
|oT| 
AT. 


Oz | 


(6.13) 


Putting together the two conditions (6.12) and (6.13), 
a limit is found on the size of temperature gradients 
which can satisfy the validity conditions: 
1)0T| 1 «7 
he ———, (6.14) 
T\0s| A hurr 


A numerical estimate of this validity condition gives 
1|dT| 
‘i «(5X 10°7) cm™, 
T| 02 | 


(6.15) 


This is a very weak condition, and is easily satisfied in 
practical cases. 
7. SUMMARY 
Using the Van Hove theorem supplemented by an 
assumption of diagonality in L, it was possible to derive 


48 E. C. McIrvine, thesis, Cornell University, 1959 (unpublished). 
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a quantum transport equation governing the case of 
impurity scattering of electrons. The result showed 
scattering terms which referred to impurity centers in 
each superlattice cell scattering electrons between all 
possible pairs of superlattice cells. The largest terms 
were those for which the electron both before and after 
the scattering is in the same cell as the impurity. 

Following Argyres’ use of the Bloch-Wangsness 
approach, and again supplementing this by an assump- 
tion of random L phase, a quantum transport equation 
for electrons suffering collisions by a phonon population 
was derived. Again the result showed scattering 
processes involving electron initial and final states and 
phonon states all characterized by different superlattice 
positions. Again the largest terms were those for which 
the three position parameters were equal. 

The classical equation was derived by replacing the 
discrete position and momentum parameters of the 
superlattice by continuous variables, and by dropping 
all scattering terms which connect states of different L. 
For the remainder of the interaction, which is diagonal 
in L, the previous added assumption of diagonality in 
L is not needed, and the Van Hove theorem or Bloch- 
Wangsness approach suffices as an introduction of 
irreversibility. Validity conditions on dropping these 
extra scattering terms were developed, as well as 
conditions on the replacement of the discrete K and L 
by continuous wave numbers and positions. Numerical 
estimates were made of the validity conditions. For all 
reasonable and practical temperature gradients, at all 
temperatures at least down to 0.1°K, the classical 
transport equation is a good approximation to the 
quantum transport equation here derived. 

The quantum corrections to the classical transport 
equation were of first order in # and in 07/02. All 
terms of this order which have meaning in the super- 
lattice context have been found. It is quite possible that 
higher order quantum corrections exist, and corrections 
with cross-products of # with 87/02. These will be even 
smaller, however, than the first order corrections which 
have already been ruled negligible for practical con- 
figurations. 

It will be noted that the added terms of the transport 
equation are dependent upon the superlattice parameter 
A. Consequently the resulting values of the observable 
functions which form the expectation values of super- 
lattice local operators will also vary with A. This does 
not indicate that the quantum corrections are fictitious, 
but rather reflects the observable nature of these 
functions. The expectation value of a spatially-localized 
function of momentum can be looked upon as the result 
of a measurement of that function with apparatus 
having characteristic dimensions on the order of A. 
Naturally the uncertainty principle causes the results 
of such a measurement to be different for different 
values of A. 

If the quantum corrections derived in this fashion 
were large, it would be necessary to determine what 
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value of superlattice parameter could legitimately be 
used in order to compare theory with a particular 
experimental arrangement. Fortunately, for a~ wide 
range of superlattice parameter the corrections are very 
small, and the numerical corrections are not needed. 
In this case the superlattice result is of interest due to 
the insight which it provides as to the nature of the 
quantum-mechanical corrections. It also provides a set 
of validity conditions which put the classical result on 
a much sounder footing. 
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APPENDIX 


If the Schrédinger Eq. (2.8) is taken into the inter- 
action representation by the unitary transformation 


t 
J=esp ih f (eat 


Op! /dt= —idAh[V" ,p |, 


(A.1) 


it becomes 


where 


M'=JMJ—, 
and 
Op Op! 
—+ih"(H,,p |=J-—J. (A.4) 
at ot 


When iterated to two orders, Eq. (A.2) becomes 


p!(r)—p!(0)=—idh f dtl V"(t),p"(0) ] 


0 


T id 
=v f at f ditLV"(t'),[V4(t),07(0) ]}. (A.5) 
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‘When Eqs. (A.4) and (A.5) are expressed in the 
superlattice representation they form the foundation 
for the development of a transport equation. An 
approximation of irreversibility in Eq. (A.5) yields 
the scattering expression to be inserted into the right- 
hand side of Eq. (A.4). The left-hand side of the latter 
equation gives the usual transport equation field terms 
as demonstrated in Eq. (2.9). 

The lowest order transport equation is desired, with 
fields to first order and the interaction to the lowest 
nonvanishing order. As written, the fields enter the 
scattering term through the unitary transformation J. 
The time dependence of H, involves the electric field 
while the off-diagonal part of H, involves the tempera- 
ture gradient. Consequently J involves these fields 
through the same dependences. In order to eliminate 
the field-dependence of the scattering term an approxi- 
mate field-free unitary transformation J’ will be used 
throughout Eq. (A.5) and the right-hand side of Eq. 
(A.4): 

(vKL|J’| v'K’L’) = exp th 1, (K)¢ ]6,.dxx dv. 

The introduction of irreversibility was discussed in 
Sec. 3. The essential irreversible part of Eq. (A.5) in 
the superlattice representation will be taken to be that 
for which the density matrix (vKL!p(0)|»’K’L’) is 
diagonal. Using the notation 


wx k= hLE,(K)—E£,-(K’) J, 


(A.6) 


(A.7) 
and the standard techniques of perturbation theory to 
integrate over (t—/’), the double integral over time 
yields a linear dependence on 7 and a coarse-grained 
time derivative (actually a difference quotient) may 
be formed : 


6(vKL| p| KL) 


él 
(vKL| p(7)|vKL)—(v KL! p(0) |» KL) 


r 


=2eh ¥ | (vKL|V | »’K’L’)!? 


wK'L’ 
X(L(v’K’L’| p| »’K’L’) 


— (vKL! p| vKL) J6(w.x,,.x:). (A.8) 
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Electron-Spin Resonance of Nitrogen Donors in Diamond 
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Electron-spin resonance of bound substitutional nitrogen donors in diamond is observed and discussed. 
The g factor is isotropic at 2.0024+-0.0005. For a given donor, one of the C-N bond directions is a hyperfine 
axis with constants A = 40.8 oersteds, B= 29.2 oersteds. There are thus four types of donors, equally abun- 
dant. A model for the donor wave function is proposed which puts the donor electron principally into an 
antibonding orbital located on a nitrogen atom and on one of its nearest-neighbor carbon atoms. A C-N bond 
distortion results which can be regarded as a manifestation of the Jahn-Teller effect. A careful search 
reveals the presence of an additional weak spectrum due to donors on N'*-C* pairs. (The isotope C8 which 
has a nuclear spin of $ has a natural abundance of 1.1%.) The hyperfine constants measured for a C8 atom of 
an N-C pair are A’=60.8 oersteds, B’ = 25.3 oersteds. The s and p contributions to all 4 measured hyperfine 
constants are separated to give the values 

On = (82/3) |y (0) |?x =2.41 atomic units, 

Py=((?—}(x2+-y*) ]/r®)x =0.28 atomic unit, 

Oc= (82/3) | (0) |2c =0.78 atomic unit, 

Po=((22?—4 (22+ y*) ]/r5)o=0.25 atomic unit. 
These are compared with theoretical values obtained by assuming a simple antibonding wave function 
composed of nitrogen and carbon tetrahedral orbitals. An increase of several percent in the N-C separation 
along the hyperfine axis is strongly implied by the comparison. 


I. INTRODUCTION impurities. Recently7* considerable amounts of gases 
have been found in natural diamonds by Kaiser and 
his co-workers. Nitrogen is the predominant gas and 
a concentration as large as 0.1% was reported. The 
same investigators measured the infrared absorption 
coefficient at 7.84 and showed its proportionality to 
the amount of nitrogen present in the stone. Thus a 
fairly complete explanation seems to have been provided 
regarding the question of the real difference between 
type I and type II diamonds.’ Type I diamonds have 
infrared absorption bands in the 7-134 range while 
type II diamonds are transparent in this range. Another 
property which distinguishes type I from type II 
diamonds is the ultraviolet absorption cutoff frequency. 
For type I’s this frequency is approximately 3000 A 
(4.15 ev). For type I’s it is 2250 A (5.5 ev). The value 
5.5 ev is, of course, the band gap energy for diamond. 
Kaiser’s group" has now shown that the shift of the 
ultraviolet absorption curve in type I diamonds is also 
proportional to the nitrogen content. 

A survey of the paramagnetic resonance spectra of 
approximately 100 natural diamonds!!:!? has inde- 
pendently shown us that nitrogen is present as a 
common impurity in diamond. It is certainly the most 
abundant paramagnetic impurity in natural diamond. 
Nitrogen being a group V element, and being next to 


REVIOUS work! ? reporting electron-spin resonance 

in diamond has been concerned exclusively with 
paramagnetic centers produced by irradiation with fast 
neutrons. Fairly specific models such as carbon vacan- 
cies and interstitials, C2 molecules, etc., were proposed 
to account for the spectra observed in diamonds treated 
in an atomic pile for times varying from 10 hours to 
1850 hours. The spectra of these defects are charac- 
terized by electronic spin values S= 1 and S= 2, by line 
widths ranging from 5 oersteds to 70 oersteds, and by 
various irreversible changes brought about by tem- 
perature annealing. 

Recently** room-temperature paramagnetic 
nance spectra have been observed in natural, unirradi- 
ated diamonds. These spectra are characterized by 
isotropic g values differing very little from the free- 
electron g value, by line widths typically about 0.3 
oersted, by characteristic hyperfine structures arising 
from the interaction of an unpaired electron spin with 
the spin of the impurity nucleus to which the electron 
is bound, and by long spin-lattice relaxation times. 

The minor element content in natural diamond had 
been determined by a number of spectrographic 
studies.°.* Si, Ca, Mg, Al, Fe, Ti, Cu, and B were 
generally found to be the most abundant nongaseous 


reso- 


7 Kaiser, Bond, and Tanenbaum, Bull. Am. Phys. Soc. Ser. IT, 


1 Griffiths, Owen, and Ward, in Defects in Crystalline Solids 
(The Physical Society, London, 1955), p. 81. 

2M. C. M. O’Brien and M. H. L. Pryce, in Defects in Crystalline 
Solids (The Physical Society, London, 1955), p. 88. 

8 Smith, Gelles, and Sorokin, Phys. Rev. Letters 2, 39 (1959), 

*Smith, Gelles, and Sorokin, Bull. Am. Phys. Soc. Ser. IT, 
4, 144 (1959). 

5 F. G. Chesley, Am. Mineralogist 27, 20 (1942). 

6 F, A. Raal, Am. Mineralogist 42, 354 (1957); E. N. Bunting 
and A. Van Valkenburg, Am. Mineralogist 43, 102 (1958). 


4, 27 (1959). 

8 W. K. Kaiser and W. L. Bond, Phys. Rev. 115, 857 (1959). 

9 See M. Lax and E. Burstein, Phys. Rev. 97, 39 (1955), where 
references to earlier work are given. 

 W. K. Kaiser (private communication). 

1 Most of the diamonds were kindly made available to us by 
Mr. Bart van Berg of the Rough Diamond Company, New York, 
New York. 

2 A type I, a type II (a), and a type II(b) were provided us by 
Dr. J. F. H. Custers of the Diamond Research Laboratory. 
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Fic. 1. Electron-spin resonance of nitrogen in diamond (temp=295°K). The de magnetic field is parallel to the crystal axis indicated. 
This spectrum is referred to in the text as spectrum A. 


carbon in the periodic table, should be present as a 
substitutional donor in the diamond lattice. One could 
expect a large fraction of the nitrogen donor electrons 
to be bound at room temperature. Furthermore, one 
might expect the structure of the donor wave function 
in diamond to be quite different from that of donors in 
silicon.’ In this paper a specific model is suggested for 
the wave function of the bound nitrogen donor electron 
and experimental evidence is presented with which the 
model can be tested. 


II. PARAMAGNETIC RESONANCE SURVEY 


Using standard X-band resonance techniques, one 
hundred and twenty natural diamonds were individually 
surveyed at room temperature. Almost all of the samples 
were good single crystals ranging in size from 0.25 to 
1.0 carat. Many of the stones were perfect octahedrons; 
generally these were the more lightly colored South 
African specimens. X-ray analysis revealed that a few 
of the more densely colored stones, opaque greenish 
colored diamonds from the Belgian Congo and Sierra- 
Leone regions, were twinned, but others selected of this 
type were perfect crystals. The Belgian Congo and 
Sierra-Leone stones appear to have the greatest con- 
centration of nitrogen impurities, as judged by the 
strength of the microwave resonance absorption. The 
spectrum due to nitrogen is reproduced in Fig. 1. This 
resonance was strongly observed in roughly half of the 


13W. Kohn, in Solid State Physics, edited by F. Seitz and 
D. Turnbull (Academic Press, Inc., New York, 1957), Vol. 5, 
p. 257; H. Brooks, in Advances in Electronics and Electron Physics, 
edited by L. Marton (Academic Press, Inc., New York, 1955), 
Vol. VII, p. 85. 


samples. As indicated by Tig. 1, the signal-to-noise 
ratio for the nitrogen resonance in these samples is 
large. 

In about thirty samples a spectrum which we had 
previously*® attributed to bound aluminum acceptors 
was observed. We recently measured the g value of this 
resonance to be g=2.0027+0.0005. Further work ap- 
pears to be needed to make the identity of this center 
more certain. Two distinct other types of spectra with g 
values ~2.00 were seen. These are not as yet under- 
stood by us. Paramagnetic resonance was detected in 
every sample. 

Angular studies were made for each of the above 
distinct spectra, orientation of samples being accom- 
plished by x-ray techniques and also by referring to the 
known scheme of plane indices for the macroscopic 
crystal faces. A few runs were made at liquid nitrogen 
and liquid helium temperatures which indicated a pro- 
nounced increase in the tendency for the lines to be 
saturated at these temperatures. No quantitative re- 
sults for the relaxation times have been obtained as yet. 
g values were measured by placing a small amount of 
the free radical DPPH (g=2.0036+0.0003) in the 
cavity together with a diamond sample. Values of the 
magnetic field were measured by proton resonance. 


III. INTERPRETATION OF THE 
NITROGEN SPECTRUM 
From Fig. 1 one observes that the g factor of the 
resonance is isotropic and that for Ho||[100] there is a 
triplet of equally spaced, equally intense lines. As the 
hyperfine splitting multiplicity is 27+1, this suggested 
? that the impurity producing the resonance is nitrogen 
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which is practically the only element with nuclear 
spin 1. The appearance of the spectra for the two other 
orientations shown is explained by considering the 
possibilities of more than one type of center. The 
simplest assumption is that a nitrogen atom is substi- 
tutional and that one of the C-N bond directions is a 
hyperfine axis. The resonance spectrum from a given 
nitrogen would then consist of 3 equal-intensity hyper- 
fine lines whose spacing |K| depends on the angle 6 
that the hyperfine axis makes with the direction of the 
external magnetic field Ho. The spacing |K{ should 
vary as K*=A?*cos’*@+ B* sin*é. It does follow this 
variation. The constants are A = 40.8 oersteds, B= 29.2 
oersteds. The nearest-neighbor configuration in diamond 
is tetrahedral, with bonds along [ 111 ] directions. There- 
fore, when Hp is parallel to a [111] crystal direction, 
25% of the donor centers will have their axes parallel 
to Ho. This is the relative orientation for which | K| is 
a maximum. The remaining 75% of the donor centers 
make an angle 6=cos~'!(—4) with Ho. The expected 
3:1 ratio of the satellite intensities is clearly seen in 
Fig. 1. Similar considerations apply to the two other 
orientations. That there are 4 types of impurity centers 
equally abundant is also shown by the fact that for 
arbitrary orientations the outer satellites are resolved 
into as many as four lines. No effects due to an inter- 
action with the nuclear quadrupole moment of nitrogen 
are seen, presumably because of its small value 
(O=+0.02X 10~™ cm”). 

If the nitrogen donor electron occupied a hydrogen- 
like orbit of predominantly s character the hyperfine 
interaction would be isotropic, with A= 8B. As this is 
not the case experimentally one must therefore con- 
clude that there is a large admixture of p or higher 
type orbit to the donor wave function. In the model we 
propose, the electron with unpaired spin occupies prin- 
cipally an antibonding'® orbital between the nitrogen 
impurity atom in a substitutional site and one of its 
carbon nearest neighbors. Schmid'® has shown that 
bonding orbitals of the Hund-Mulliken type give a 
better account of the cohesive energy of diamond than 
those of a simple Heitler-London type. These orbitals 
are of the form 


¥p,a=(1/V2)(@n+¢c), 


where $n, @c are hybridized sp* orbitals of tetrahedral 
type which point towards the appropriate nearest 
neighbor. The + and — signs are for the bonding and 
antibonding orbitals. It should be noted that if one 
assumes that the valence electron states are made up of 
bonding electrons of this type, one is forced to make up 
conduction band states from antibonding orbitals of 
the same type. 

The model assumes that the repulsive force exerted 


4B. Bleaney, Phil. Mag. 42, 441 (1951). 

16 See, for example, C. A. Coulson, Valence (Oxford University 
Press, Oxford, 1952), p. 88. 

16 ., A. Schmid, Phys. Rev. 92, 1373 (1953). 
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on the nitrogen-carbon pair distorts the lattice suffi- 
ciently to localize the electron principally on a simple 
antibonding orbital. The situation may be viewed as a 
manifestation of the Jahn-Teller effect.'7 The orbital 
degeneracy is that associated with the possibility of 
locating the electron on any one of the four equivalent 
antibonding orbitals about the nitrogen atom. The 
frequency of reorientation of the Jahn-Teller distortion 
must be less than one Mc/sec at room temperature in 
order not to conflict with the observed line width. 
Comparison is invited between the model proposed 
here and that which is known to predict rather well 
the properties of donors in silicon and germanium. For 
the latter one considers the excess electron subject to a 
Coulomb potential e/er of the impurity ion. The orbits 
are large (25 A) compared to the distance between 
atoms and the time needed for the electron to pass an 
atom is greater than the time required for the inner 
bound electrons of the atom to readjust and polarize 
the medium, thus reducing the Coulomb force by a 
factor 1/e. In the case of nitrogen in diamond, on the 
other hand, the much more tightly bound donor electron 
orbits at a much greater frequency so that electronic 
polarization is not allowed to develop, the Coulomb 
potential then remaining approximately e/r. Thus the 
“radius” of the nitrogen donor is more or less consistent 
with the free hydrogen Bohr radius; the donor ioniza- 
tion energy is likewise probably closer to the value for 
a hydrogen atom than it is to the 0.4-ev value predicted 
for a potential V=e/er with e=5.7, the value of the 
dielectric constant of diamond. Being thus confined to 
the immediate vicinity of the donor ion where it can 
essentially exist in any one of four available antibonding 
orbitals, the electron falls into a selected one, the corre- 
sponding N-C bond is weakened and distorted, and the 
electron is trapped in the given antibonding orbital. 
Consideration of this model suggested the following 
experimental check. As there is a small natural abun- 
dance (1.1%) of C", and as this isotope has a nuclear 
spin of 3, there should exist a weak additional spectrum 
arising from donor electrons in antibonding orbits on 
N"¥-C'8 pairs. The C" nuclear spin splits the nitrogen 
triplet so that the weak additional spectrum should 
consist of pairs symmetrically disposed about each of 
the main nitrogen absorptions. Each line would be 
roughly 180 times smaller than the corresponding line 
of the main spectrum. A careful search revealed this 
set of C™ lines; portions of this spectrum which will 
hereafter be called B are reproduced authentically in 
Fig. 2 and schematically in Fig. 3. In the traces shown 
in Fig. 2 the recorder zero adjustment was changed 
several times to keep the pen on scale as the low-field 
satellites of the main spectrum were approached. These 
traces show clearly why we are able to see only those 
portions of the spectrum B which lie outside the outer 


17H. A. Jahn and E. Teller, Proc. Roy. Soc. (London) A161, 
220 (1937). 
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Fic. 3. Schematic representation of the microwave absorption spectrum of bound nitrogen donors in diamond. Relevant portions 
of the spectra A, B, C discussed in the text are shown. 


satellites of the main spectrum A. Other runs made by 
reducing the spectrometer gain by 100 at the positions 
of the main satellites enabled relative intensities to be 
measured. The measured intensity of the B spectrum 
relative to that of the main nitrogen spectrum is in 
approximate agreement with the factor 1/180. From 
the data the C® hyperfine constants are at once deduced 
to be |A’|=60.8 oersteds, | B’| =25.3 oersteds. The 
three other observed line positions may then be 
compared with the predicted values obtained from 
2= 4" cos*@+ B” sin*@. None of the 3 comparisons 
for |K’| is off by more than 1 oersted. The hyperfine 
axis for the C spectrum is again the C-N bond direc- 
tion. Besides this well-defined set of lines B, some weak 
structure was observed on the wings of the main 
nitrogen satellites (dashed lines in Fig. 3). Only for 
the case Ho||[100] could these latter lines, hereafter 
called spectrum C, be clearly distinguished. For this 
particular orientation the complete spectrum C con- 
sists of three groups of 2 lines, the latter being sym- 
metrically located on each side of a main nitrogen 
absorption and separated from it by 6.4 oersteds. The 
measured intensity of these lines C relative to the main 
nitrogen lines A is approximately consistent with the 
factor 1/60. The C splitting is a rough measure of the 
donor electron density on the basal carbon nearest- 


neighbor atoms. From these data it appears that the 
donor electron is indeed mainly confined to the nitrogen 
atom and one of the nearest neighbor carbon atoms. 

In the Appendix it is shown that if the wave function 
for the unpaired electron is axially symmetric, the 
hyperfine constants are given by 


A= 2uourgr{O+2P}, 
B= 2uourgr{O— P}, 
where (1) 
O= (8/3) |¥(0)|?, 
P=([2?—}(a?+ y’) ]/r’). 
The quantity P, which vanishes for a pure s orbital, 
would be a positive quantity for an antibonding orbital 
of the type proposed. Solving for O and P, and using 
the experimental values for A, B: 
a= (40.8) yei%, 
B= (29.2)yeh, 
one has 
Ox = 16.3X 10"4 cm~*= 2.41 atomic units, 
Py =1.9X 10" cm*=0.28 atomic unit. 
The value for Oy implies a density of the donor wave 
function at the nitrogen nucleus: |¥(0) |’y=1.95X 1074 
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cm™*, Likewise one obtains from the C™ data the 
experimental values 
Oc=5.24X 104 cm-*=0.78 atomic unit, 
Pco=1.67X 104 cm*=0.248 atomic unit, 
|W (0) | °c =0.63X 104 cm-, 
To compare theory and experiment, the values which 
are predicted from the N-C antibonding orbital assump- 
tion will now be considered. From the Hartree solutions 


for the free atoms'*:"’ and from (1/r*) averages com- 
puted from spectroscopic data,” one obtains 


38.0 atomic units, 


8 
Ox4=—|(0) |?= 
3 


roi 


Oc’ = 24.6 atomic units, 


cae ea 


=1.01 atomic units, 


Pco4=0.48 atomic unit. 


Neglecting overlap integrals entirely, the following 
values are obtained for the case of a simple antibonding 


orbital Y= (1/V2) (¢n—¢c): 


Ox 484.75 atomic units, 
Py 480.38 atomic unit, 
Oc48=3.08 atomic units, 


Pc 480.18 atomic unit. 


It is clear that this simple approximation predicts a 
greater density of the donor wave function at the 
nitrogen and carbon nuclei than is experimentally 
observed. The discrepancy is enhanced if the overlap 
integral S= f¢n¢cdr is taken into account, for the 
above quantities are then increased by a factor n=1/ 
(1—S)&3. The situation improves markedly if one 
assumes an axial displacement of both nitrogen and 
carbon atoms away from one another. This process 
reduces the amount of s character in the antibonding 
wave function. The overlap integral /@n¢cdr is also 
reduced, due both to the above mentioned change in 
hybridization and to the increase in internuclear dis- 
tance. For an orbital @~(s+Ap) directed along the 
axis of a distorted tetrahedral set of bonds, the amount 
of hybridization is determined by the formula?! tang 
=—[2(1+A) ]!, @ being the angle between the axial 
and basal bonds. If the nitrogen or carbon atom is 
displaced far enough, the hybridized orbital which 
contributes to the donor wave function becomes a pure 
P, orbital. In this case the three basal bonds are 


18 Brown, Bartlett, and Dunn, Phys. Rev. 44, 296 (1933). 
19C, C. Torrance, Phys. Rev. 46, 388 (1934). 

20 R, G. Barnes and W. V. Smith, Phys. Rev. 93, 95 (1954). 
*1 See, for example, reference 15, p. 194. 
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TABLE I. Comparison of observed atomic constants with values 
obtained from models discussed in the text. n’ is the value of the 
overlap integral for a 10% increase in the C-N bond distance. 


Model 
predicted 
(no dis- 
tortion) 


4.8n 
0.38n 
3.19 
0.18n 


Model 
predicted 
(10% dis- 
tortion) 


Atomic 


38.0 
1.01 


Obee rved 


oy 41 
0.28 
0.78 24.6 

0.248 0.48 


ry 
1.53’ 
0.21’ 


trigonal, sp?. Clearly, this represents an extreme situ- 
ation, and an intermediate displacement probably 
occurs which reduces On4? and Oc4* sufficiently. 
A guess at what might be a maximum tolerable dis- 
placement for the nitrogen atom could reasonably be 
put at 5% of the normal C-C bond length. This would 
change the hybridization of the orbital used in the anti- 
bonding wave function to approximately 12.5% s 
character, 87.5% p character (A=4/7). In this estimate, 
the connection is made between the displacement of 
the nitrogen atom and the angle ¢ between the axial 
and basal orbitals of the nitrogen atom by requiring 
the axes of the nitrogen basal orbitals to pass through 
the midpoints of the undistorted C-N basal bonds. The 
tetrahedral orbitals on the basal carbon atoms remain 
fixed, and ‘“‘bent’” bonds are thus formed. This con- 
straint is approximately consistent with the require- 
ment that maximum overlap of the basal bonds be 
established for a given nitrogen displacement. The 5% 
distortion reduces Ox“ to 2.37 atomic units while Py 42 
becomes 0.44 atomic unit. The same displacement 
applied to the carbon atom results in values for Oc4# 
Po*® of 1.53, 0.21 atomic units, respectively. Table I 
summarizes the comparisons discussed above. While 
the above estimate of a net increase of 10% in the 
C-N bond length due to the presence of the extra anti- 
bonding electron may be too high, an increase of several 
percent is certainly Localization of the 
donor electron in one antibonding orbital argues for a 
fairly severe atomic distortion. Furthermore, not too 
much energy may actually be lost in the ‘‘bent” bonds 
which are formed between a displaced nitrogen or 
carbon atom and the three basal carbon atoms. A rela- 
tively large increase in the separation of an N-C pair 
may be responsible for the anomalous x-ray spots” 
which characterize type I diamonds. 

Actually, while it is clear from the experimental data 
that the bound donor electron spends by far the greatest 
time per atom on the nitrogen and on one of the nearest- 
neighbor carbon atoms, one sees from Fig. 3 (dashed 
lines) that the wave function density is not entirely 
negligible on all other carbon atoms. The resolvable 
weak lines (C) on either side of the main satellites for 
the case Ho||[ 100] seem to indicate that the wave func- 
tion cunalty on each of the basal carbon atoms is about 


reasonable. 


eae Proc. Roy. Soc. (London) A179, 315 (1942). 
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one-sixth that on the axial carbon atom. Calculation 
shows that this density cannot “be attributed to the 
extension of the principal antibonding orbital at these 
points. The true wave function is, therefore, a linear 
combination of antibonding orbitals, with a predomi- 
nant contribution from one of them. A more detailed 
consideration of the donor wave function is unwarranted 
until better data are obtained for the distribution of the 
wave function on atoms other than the N-C pair. It is 
clear that this could be done by using the double- 
resonance technique of Feher. 


IV. REMARKS ON SPIN CONCENTRATIONS 


In the majority of our samples the line width AH of 
the nitrogen resonance was measured to be about 0.3 
oersted between the points of maximum slope. This 
limiting line width may inhomogeneous 
broadening caused by the presence of C™ atoms on 
second-nearest-neighbor sites. Some variation of AH 


represent 


towards greater values was observed in a few stones, 
but the cause of this is unknown. For spin-spin broaden- 
ing to become important a nitrogen concentration 
2 10"*/cc is required. (For .V electrons per cc which 
are randomly distributed in the lattice, AHN X10-" 
oersteds.*) Our estimates of the spin concentrations of 
our various samples fall more in the range 10'°-10'"/cc. 
Since this is several orders of magnitude less than the 
amount of substitutional nitrogen Kaiser observes in 
diamonds, it is probable that most of the nitrogen may 
be present in a nonparamagnetic form, for example, in 
adjacent substitutional pairs. The isolated donors 
which produce the microwave resonance we see there- 
fore represent only a small fraction of the nitrogen 
present. 

No exchange effects*:** due to overlapping donor 
wave functions such as occur in the case of donors in 
silicon were observed. This seems reasonable because 
the bound donors in diamond are much more compact 
structures. 
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APPENDIX 


The hyperfine interaction between an unpaired elec- 
tron and a nucleus is represented by the operator 


3(r-S)(r-1) (S-1)| 
J int — 2uomrgi 4 a ; 
= rn | 


> 


T 
+ 2uourgr 3 5(r)S- I. 


To express this interaction as a function of spin coordi- 
nates only, an integration over the spatial coordinates is 
performed : 


=f y*( r) Hind r= (Hint) 


| 8a 
= 2uourgr “ y?(0)S- I 
s 


3(S-r)(r-D S-I 
+f ene 1) Li 
r° r 


For an axially symmetric electron wave function (x?/r°) 
=(y"/r>), while terms such as (xy/r°) are zero. Collecting 
terms, one easily finds 


| ‘ee )| 
W = 2yourgr 7 y?(0)+2(a—38) [fess 
3 


[8x 
+) "sibel (1,S2+1,S,) }, 
where , 

a=(1/r), B= (x?/r'). 
That is, 


W=AI,S.+B(I,S:+1,S,), (Al) 


with A and B as given in Eqs. (1). It can be shown" 
that the Hamiltonian (A1) implies an angular variation 
of the hfs given by K?=A?cos*@+ B? sin’@ when the 
g factor is isotropic. 
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Theory of the Nuclear Magnetic Resonance Shift in Paramagnetic Crystals* 
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A theoretical study is made of the shift of the F® nuclear resonance in paramagnetic and antiferromagnetic 
Mnf, which has been observed by Shulman and Jaccarino. The problem is reduced to that of a single Mn—F 
pair. A net hyperfine interaction is shown to arise from overlap effects in the ground-state ionic configuration 
(3d)5(2s)?(2p)* and from overlap and transfer effects to the configurations (3d)®(2s) (2p)® and (3d)®(2s)?(2p)5. 
These three configurations are equivalent to a single configuration involving bonding-type molecular 
orbitals. The results are in reasonable agreement with the experiment, the theoretical isotropic shift being 
slightly too small and the theoretical anisotropic shift (small nondipolar part) being slightly too large. A 
re-appraisal is made of Tinkham’s data on paramagnetic resonance of Mn** impurities in ZnF2, which 
Bleaney has shown to be closely related to the Shulman-Jaccarino data. It is found that there is no need 
to include, as did Tinkham, a large fraction of fluorine 3s and 3p functions into the bond. 


I. INTRODUCTION 


HULMAN and Jaccarino! (SJ) have recently suc- 

ceeded in measuring the nuclear F'® resonance in 

the paramagnetic state of a single crystal of MnF», and 

they find that the resonance frequency is shifted from 

the free-ion F~ value by a surprisingly large internal 

field. In this paper we examine various possible causes 
of the SJ resonance shift. 

The experimental results of SJ can be summarized as 
follows: In the paramagnetic state of MnFs, in an 
applied field Hy parallel to the c axis, or [001 | direction, 
the nuclear F'® resonance can be fitted by the relation 


w=ynv(Hot+ AH), (1) 


with AH, as a function of temperature, directly propor- 
tional to the paramagnetic susceptibility x. When Ho 
is rotated in the (001) plane, the average resonance 
frequency is the same as with Hy along c, and there is 
in addition a small anisotropic term. 

Part of the effective field entering into A// is of 
course the dipolar field (including demagnetizing effects) 
coming from the paramagnetic Mn** ions. This has 
been calculated by Smilowitz,? and the results are 
quoted in Appendix A of SJ. 

The observed value of AH is found to be linear in Ho, 
as required for dipolar fields in a paramagnet. The 
dipolar fields account for nearly all of the anisotropic 
term but are an order of magnitude too small to explain 
the large average shift. After subtracting the calculated 
dipolar field from the measured average AH, SJ are 
left with 

(AH /Hy) =0.0735+0.0003, (2) 


* This work was done in the Sarah Mellon Scaife Radiation 
Laboratory and was supported by the United States Air Force 
through the Air Force Office of Scientific Research of the Air 
Research and Development Command. 

t On leave from Tokyo University of Education, Tokyo, Japan. 

t Present address: U. S. Naval Postgraduate School, Monterey, 
California. 

§ Present address: Institute for Solid State Physics, University 
of Tokyo, Tokyo, Japan. 

1R. G. Shulman and V. Jaccarino, Phys. Rey. 108, 1219 (1957). 

2 B, Smilowitz (private communication). 
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at 77°K. It is this large nondipolar average shift, and 
also the small remainder of the anisotropic shift, which 
we propose to explain. 

We shall show that the overwhelming contribution 
comes from a net hyperfine interaction accompanying 
overlap effects between F~ and Mn** ions in the 
ground-state nearly-ionic configuration of the crystal. 
Very briefly, the mechanism is as follows: Consider an 
Mn** ion with magnetic moment up. The 3d electrons 
will automatically be orthogonal to the down neighbor 
fluorine electrons (that is, electrons with magnetic 
moments down), but not to the up fluorine electrons. 
As a consequence the up electrons, in the crystal, will 
no longer exactly pair off with the down electrons— 
that is, their crystalline orbitals will differ. This results 
in a net internal magnetic field at the F® nucleus. We 
shall speak of an “effective fraction” f of an unpaired 
fluorine electron as being responsible for this field. The 
field will fluctuate at a rate corresponding to the Mn** 
spin-relaxation frequency, which is very rapid compared 
to the nuclear resonance frequency. The nucleus will 
therefore see an average field which will be proportional 
to the excess of Mn moments in the direction of Ho, 
or directly proportional to x. 

As has been discussed by SJ, we may picture this 
effect in terms of antibonding molecular orbitals. But 
we may also take the atomic-orbital point of view and 
think of overlap effects in the ground Mn*+tF~ state 
and transfer effects to the excited MntF state. In 
Sec. VI it is shown that these two points of view, 
molecular orbitals and atomic orbitals, lead to the 
same theoretical predictions. 

Bleaney*® has shown that the measurements of SJ can 
be correlated with the data of Tinkham* on para- 
magnetic resonance of Mn** substitutional impurities 
in ZnI’s. Much of our analysis will be based on Tink- 
ham’s work. 

We shall also use data® obtained by SJ from F" 

3B. Bleaney, Phys. Rev. 104, 1190 (1956) 

4M. Tinkham, Proc. Roy. Soc. (London) A236, 535 and 549 


(1956). 
5 V. Jaccarino and R, G, Shulman, Phys. Rev. 107, 1196 (1957). 
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resonance in antiferromagnetic MnF». From Tinkham’s 
data, Bleaney predicted this resonance frequency at 
0°K to be 179 Mc/sec, and from their paramagnetic 
data, SJ predicted 177 Mc/sec. The actual resonance 
was found near 160 Mc/sec. In Sec. III we present a 
consistent analysis of both paramagnetic and anti- 
ferromagnetic results which removes this discrepancy. 

A theoretical interpretation of Tinkham’s experiment 
has been given by Mukherji and Das.® Their calculation 
is restricted to the isotropic effect. We discuss their 
results in Sec. VIII. 


II. THE SHULMAN-JACCARINO EXPERIMENT 


Following SJ we introduce a nuclear Hamiltonian for 
the kth F® nucleus: 


xt= —yvhl*-Hot+> ; Tt. Av*- $2, (3) 


The first term is the usual Zeeman energy and the 
second term is an effective hyperfine coupling between 
the kth F' nucleus and the jth manganese spin, 
summed over all near neighbors. In addition there will 
be terms from the manganese dipolar field as mentioned 
in Sec. I, and negligible contributions from nuclear 
dipole and quadrupole interactions. It will be the task 
of our theory to account for the effective hyperfine 
tensors A”. 

Since the manganese spin-relaxation time is very 
short compared with the nuclear resonance time, the 
nucleus will see the thermal average value (S’),. This 
is related to the applied field and the paramagnetic 
susceptibility by 


(Sn, = —Hox/N 8. (4) 


As emphasized by SJ, the simplicity of this equation 
depends upon the existence of an isotropic g tensor. 
We may now write (3) in the form 


t= —yvh Dy Holl ttad; Dr L*Ani*], (5) 


where 
= NgBywh, 


and where the subscripts indicate vector and tensor 
components along appropriate orthogonal axes. We 
recall that the superscripts connect the &th F* nucleus 
to its jth near-neighbor manganese spin. There are two 
nonequivalent fluorine sites in MnF.; but since these 
differ only by a 90° rotation about the c axis, we need 
concern ourselves with the hyperfine interaction at 
only a single F'® nucleus. Thus we drop the index k 
and consider a nucleus situated as in Fig. 1. The x and 
z axes (the latter along c) are taken as shown in the 
figure; the y axis is normal to the page; these axes are 
principal axes of the hyperfine tensors. 
For the nucleus shown in Fig. 1, 


= —yvh Dy, Hol [1 t+a(AV+A!"+4,")]. (6) 


6 A. Mukherji and T. P. Das, Phys. Rev. 111, 1479 (1958). 
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By reflection symmetry A,'’=A,!", but we preserve 
the distinction for the sake of clarity. The nucleus at 
the other type of site in MnF, will have an identical 
Hamiltonian with Ho rotated by 90° about the z axis. 
It is important to realize the difference between the 
environment seen by a manganese atom (Tinkham’s 
problem‘) and the environment of an F"* nucleus. If 
we consider the manganese at site II in Fig. 1, then it 
will see type II bonds to two neighbor fluorines along 
the x axis and type I bonds to four neighbor fluorines 
in the yz plane. The notation type I and type II is 
Tinkham’s. We see that his A,!! will be identical to 
our A,!!, but that his A,! will be related to our A,"’ 
and A,!” by a 90° rotation about the z axis. Thus, 


” 
=A, ; 


(7) 


From application of Eq. (6) directly to their experi- 
ment, SJ were able to determine 


24 3+A,1=47.0 
2A +A ,!'=47.8 [46.3 (8) 
2A+A,"=46.3 |47.8). 


All values are with accuracies of +0.5 and are times 
10~* cm~'. The numbers in braces are an alternate 
possible interpretation of the experiment. 

To this set of parameters we shall add one obtained 
by the same authors® from F" resonance in antiferro- 
magnetic MnF»:. This parameter was determined by 
assuming complete sublattice magnetic saturation at 
0°K. According to Kubo,’ however, spin-wave theory 
predicts that this structure should be short of complete 
saturation by 0.075/S, or by 3% for S=3. We make 
this correction first to the dipole field, which we have 
calculated,? assuming saturation, to be 12 500 oe and 
which now becomes 12 120 oe at 0°K. This accounts 
for 48.56 Mc/sec of the measured resonance frequency 
and leaves us with 


111.43 Mc/sec= (2A ,!—A.") (0.978), 


24 J—A "= (15.340.2) X 10-4 cm“, (9) 


as compared with the value 14.6+0.2 given by SJ. 
In order to compare the four experimental values of 


Fic. 1. Fluorine site in MnF». 
The bond lengths are from Stout 
and Reed (see references 10 and 
11). 


Mn I" 


7R. Kubo, Phys. Rev. 87, 568 (1952). A recent theory of J. C. 
Fisher, Bull. Am. Phys. Soc. 4, 53 (1959), predicts only 1.3% 
short of saturation. Fisher’s theory gives 14.9 for the numerical 
value in Eq. (9). The consequent changes in the values in Eq. 
(18) would be within the quoted errors, 
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(8) and (9) with theory we must return to the Hamil- 
tonian in the form of Eq. (3) and relate this to contri- 
butions from fluorine 2s and 2% functions. But first 
we need a model of the fluorine orbitals. 


III. THE INDEPENDENT-BONDING MODEL 


If the covalent bonding tendencies are small, one 
may consider the fluorine as almost pure F~. Then in 
a calculation involving any single F— Mn bond we may 
think of that bond as a very small perturbation of the 
ionic configuration, and we may neglect any correlation 
effects among bonds as being of higher order in the 
perturbation. We simply add the independent effects 
of all the bonds. We shall call this the independent 
bonding model. Although they do not say so explicitly, 
both Tinkham and SJ analyze their data on the basis 
of this model; we shall rework their analysis with 
changes in emphasis and detail in order to clarify the 
argument and in order to correct some small, but 
important, numerical errors. In Sec. IV we consider 
the implications of a model in which the fluorine 2s 
and 2 orbitals hybridize and direct themselves towards 
the manganese neighbors. 

We may choose pz, p,, and p. functions corresponding 
to the axes of Fig. 1. However, since the matrix prop- 
erties of these functions transform as simple vectors, 
we may equivalently analyze the orbitals along and 
perpendicular to any bond we are considering. We 
make an independent analysis for each of the three 
types of bonds of Fig. 1. For each case we will call the 
p,-orbital along the bond £,, the orbital along the y 
axis p,, and the third and orthogonal orbital p,- 

If we think of the hyperfine interaction of Eq. (3) as 
caused by an unbalance of fluorine electrons of up and 
down spins, we may write for the hyperfine expectation 
value: 

1 {TS y2)= f(Ho+>d v- 
where f, and f,, are the effective fractions of 2s and 
2p, electrons producing the net hyperfine interaction. 
The Hamiltonians of (10) are the standard atomic 
hyperfine interactions and have the expectation values 
for F~ ions: 


fovd IC 2pv) , (10) 


(Foe) AsdluSs) (11) 
(opr) =A op TySy)(3 cos’Oy,»— 1), 
where 

(0) !2=1.57 cm7, 


12) 


= (8/3)rgbynh| go. 
(2/5) gByvh(r).,=0.044 cm. 


= 


Here 6,., is the angle between the bond direction of the 
py orbital and the w axis. The numerical values are 
those given by Moriya’ and are based on calculations 
by Hartree and by Barnes and Smith.® 

We note that S,’ of Eq. (10) is the spin of an Mn** 
ion, whereas S, of Eq. (11) is the spin of a single 


8 T. Moriya, Progr. Theoret. Phys. Japan 16, 23 (1956). 
9R. G. Barnes and W. V. Smith, Phys. Rev. 93, 95 (1954). 


PARAMAGNE 


FIC CRYSTALS 

unpaired fluorine electron. Therefore 
¢ i w.!)= SI Sa) ; 
and hence we may write 
Py 1,/+) ,. 

A f=$ fs'Aoe; 


tc. for p, and p,- 
Noting that the three direction cosines in 
in the square to one, we write 


A,i=A,’+A,_7(3 cos’Oy,c—1)+A4'(3 cos’,,2—1), (15) 


(13) 
(14) 


1,,7(3 cos’, ,—1), 
where 


= ° 
Ass = 5) p’Aop; 


(13) sum 


where A,’ is shorthand for (A p¢’—A px’) and A,’ is 
shorthand for (A px?—A px’). 
We now apply Eq. (15) to the bonds of Fig. 


we use Eq. (7). The result is 


1, and 


A,}=A,!+- (3 sin%é6—1)A,'—A,! 


A= {,1+24,), 
A/J=A,!+ (3 cos6—1)A,!—A, 
AM=A4U4+24 U- 

A ,U=/4 I-A U4+24,1! 


AM=4UN—A4U-—A,H, 


These equations contain six parameters on the right 
sides, whereas we have only four measurements in 
Eqs. (8) and (9). Strictly speaking, the independent- 
bonding model requires A px?= A px’, and hence A ,’=0. 
This reduces (16) to four parameters. However, it also 
requires A,!!=A,"", a relation in sharp disagreement 
with Tinkhi im’s measurements (see next section). For 
reasons we shall give later, we do not believe that 
either A,,/ or A,,’/ is large enough to make a significant 
contribution, regardless of whether the two cancel or 
not: and therefore we shall drop A,’ from (16). We 
may look on Tinkham’s discrepancy as either an experi- 
mental error or as an indication that our theory is not 
quite complete and that A, is taking the place of 
uncalculated effects which split A," from A, 

With A, dropped, Eqs. (8), (9), and (16) lead to 
(all values in units of 10~* cm) 


15.7 
15.5 
)-0.2{ 

1-03 


cos*6 = ($¢/2.11)? 
lattice!!! The 


Here we have used the value of 
=0.615, as appropriate for the MnF, 

1 We use the x-ray data of J. W. Stout and A. Reed, J. Am. 
Chem. Soc. 76, 5279 (1954). 

1 Recent data of W. H. Baur, Acta Cryst. 11, 488 (1958), gives 
the bond lengths as 2.13 A for type I, 2.10 A for type I. This 
makes cos*6=0.604, which is very near the value 0.603 used by 
SJ in their analysis. Small changes in cos*6 (the value of which 
we regard as somewhat uncertain because of the conflicting data) 
fortunately do not lead to very significant changes in the results. 
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values in braces correspond to the braces of Eq. (8). 
Since all numbers are estimated to be in possible error 
by +0.3, little significance can be attached to A,' and 
A,''. These should, however, be positive, and hence 
the nonbraced set is to be preferred. Using the values 
in Eqs. (14) and (12) we find 


f.'= (0.4940.02) X10, 

f= (0.5240.02) X10, 
fe — fox? = (0.240.3) X10, 
five! — fel = (0.440.3) X10. 


(18) 


These are the “measured values” of the effective 
fraction of fluorine electrons participating in the hyper- 
fine interaction. In Secs. VIII and [IX we compare these 
with theoretical values calculated from electronic over- 
lap and hopping matrices. 

The values (17) which are not in braces may be 
inserted into (16) to obtain (all in units of 10~* cm™) 


A,!=15.4, 
A }=15.2, 


A,'=15.6, 


A,"=17.0, 
A,=15.8, 


A M=15.8. 


(19) 


IV. THE HYBRIDIZED-ORBITALS MODEL 


Let us assume that covalent tendencies are strong 
enough so that the appropriate fluorine orbitals are a 
hybridized set directed towards the manganese ions. 
We may form such an orthonormal set as follows: 


Vy y= 2-41 (1—tan’s)'@,— (tand)¢.+¢. |, 
Wi1= (tand)¢,+ (1—tan?6) '¢-. 


(20) 


The subscripts on V indicate the manganese of Fig. 1 
towards which the orbital is directed. The p, function 
¢, is nonhybridized. 

We now assume these orbitals to be occupied by 
electrons of both up and down spins (with respect to 
H) with a slight fractional excess of down spin given by 

Orbital Fractional excess 


Wp a 

Vu b 

dy ¢ 
Straightforward evaluation of the expectation value of 
the standard atomic hyperfine Hamiltonian then yields 


5A,'=a[$(1—g)A.+}3(2q—1)A, ]—CA,, 
5A,'= a[4 (1 —g)A.— 3 (1+ 9)A,]+ 2cA p, 
5A 1=a[3(1—g)A.+3(2—q)Ap]—cAp, 
5A, =b[gA.+2(1—g)A,]—CAp, 

5A, =b[ gA.— (1—g)A, ]+2cA ,, 

SA ,"=b[gA,—(1- 


with 


q )A at —cA Pp) 


g=tan*d. 
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A consistent fit (well within experimental errors) of 
Eqs. (8) and (9) can be made with 


a= (1.27+0.03)X10~, b= (1.63+0.03) X10, 


22 
c= (0.30+0.1) X 10-2. - 


No fit can be made for the experimental parameters in 
braces. We should point out that the above fit is 
possible only if we allow 24,'—A,'=—15.3X10— 
cm. However, the experiment cannot distinguish the 
sign of this parameter, since in the antiferromagnetic 
state half of the nuclei see a manganese spin environ- 
ment just inverted with respect to that seen by the 
other half; thus two resonances will always be seen at 
+ the absolute value of the effective internal z field, 
plus applied Ho. 

We may insert the values (22) back into (21) to 
obtain (all in units of 10~* cm~’) 


A,}=7.34, A,=32.83, 
AJ=7.09, A,™=32.01, 


AfJ=7.97, AJt=32.21. 


(23) 


Comparison of Eq. (23) with Eq. (19) reveals a 
marked difference between the two theories. The 
hybridized-orbitals model puts a major portion of the 
2s function into Yy;, and therefore all 4," are much 
larger than A,!. (In order to obtain an equal amount 
of the 2s function in all the directed bonds we must 
have tan*6=4, i.e., trigonal symmetry.) On the other 
hand, as Eq. (19) shows, the independent-bonding 
model predicts all A,' approximately the same magni- 
tude as A,". 

In the next section we discuss Tinkham’s experiment, 
which gives direct evidence that Eq. (19) is very nearly 
correct for Mn-doped ZnF». We are forced to conclude 
that the hybridized-orbitals model gives a very poor 
picture of the wave functions in MnF, or ZnF». This is 
not surprising, since we know from other evidence that 
these crystals are highly ionic. 

It would be interesting to try the hybridized-orbitals 
model on a reasonably covalent crystal, such as perhaps 
MnfF;. Observations of nuclear resonance in this crystal 
have recently been reported, very briefly, by SJ.” 


V. TINKHAM’S EXPERIMENT 


In his paramagnetic resonance experiment on Mn**- 
doped ZnF2, Tinkham obtains (all values in units of 
10-* cm=): 


A,'=16.5+0.7, A,'!=14.64+1.2, 


24 
A1=18.2+0.2, A,1!=12.5+0.2. salad 


The values include the direct dipolar interaction, of 
which more below. As pointed out in Sec. III, Tinkham 
noted that since A,!!#A,"!, it is necessary to retain 
A, in Eq. (16). Since we have then six unknowns and 


2 R. G. Shulman and V. Jaccarino, Phys. Rev. 109, 1084 (1957). 
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only four measurements, we must make some simplifi- 
cations.!"* 

For Tinkham’s crystal, assuming that the presence 
of the slightly oversize Mn substitutional impurity does 
not distort the bond angles, we have!” 


cos’?6= (1.565/2.03)?=0.594. 

From the fourth and fifth of Eqs. (16) we obtain, 
from Tinkham’s data, A ,!!=0.7, and from the first and 
third of (16) we obtain A,'=3.0. If we now assume 
A,'=A," and A,'=A,"! we find (all values in units of 
10-* cm) 

A,!=16.5641.0, 
A ,!!=16.2+11.5. 


A,i=A,''=0.740.4, 
Af{=A,!'=3.041.5, 


(25) 


As noted by Tinkham, his data include direct dipolar 
effects between the Mn spin and the F nucleus, and 
these vary with the same angular dependence as the A, 
term in Eq. (15). Thus the dipolar contributions may 
be thought of as contained in A,. The magnitude" is 
(gBhyn/r*) =2.96X10~- cm for bond I, 2.9210 
cm™! for bond IT. Thus, without direct dipole: 


A,{=A,'=0+15. (25’) 

We see that Tinkham’s results are surprisingly close 
to those of SJ, Eq. (17). This close agreement was first 
noted by Bleaney,* who actually used Tinkham’s data 
to predict, with considerable accuracy, the IF’ resonance 
frequency in antiferromagnetic MnF». 

If we correct (24) for the direct dipolar contribution, 
we obtain (all in units of 10~* cm~) 


A,'=15.9, 


; A J} =15.9, 


12a Note added in proof.—Recent work of Clogston, Gordon, 
Jaccarino, Peter, and Walker (private communication) demon- 
strates that the hyperfine tensors for Tinkham’s case contain 
nondiagonal components. When these are incorporated into the 
analysis, the deduced values of A, become insignificant. 

13 Tinkham used the incorrect value cos*é = 0.633 in his analysis. 
This leads directly to A,!=1.2A,!!, which is an unusually rapid 
dependence on internuclear distance (the bond distances are 2.03 A 
for type I, 2.04 A for type IT). Tinkham explained this dependence 
by including a large fraction of fluorine 3s and 3p functions into 
the bond. On the other hand, SJ analyzed their data assuming 
A,!=A,!! to be reasonable, and our analysis [Eq. (17) ] gives 
A,4=1.05A,! (the bond distances in MnFy, are 2.11 A and 
2.14 A; note the greater difference than in ZnF2). It is hard to 
reconcile these MnF» results with Tinkham’s large 1.2 factor for 
ZnF2. However, as we show above, when our value of cos*é is 
used we obtain A ,!=1.02A,!! from Tinkham’s data—a reasonable 
result. 

Recent data of W. H. Baur (see reference 11) give bond lengths 
in ZnF2 as 2.04 A for type I, 2.01 A for type II, and hence cos’5 
=0.590. This is a trivial change. It is interesting to note, however, 
that Stout and Reed find type I bonds shorter than type II in 
both MnF: and ZnF, whereas Baur finds type I bonds always 
longer. On the other hand, from SJ data we have, in MnF» 
A,!>A,!; and from Tinkham data, in ZnF2, A,!<A,!!. Probably 
neither the x-ray nor the resonance experiments are sufficiently 
accurate to warrant any conclusions, however. 

4 Tinkham uses a direct dipolar contribution one-half as large 
as ours. This, together with his value of cos*5, accounts for the 
difference between his and our final numerical results. 


A, =17.6, 


(26) 
AP =15.4. 
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These should be compared with the SJ results, Eq. (19), 
based on the independent-bonding model. Comparison 
with Eq. (23) indicates the complete inapplicability of 
the hybridized-orbitals model. 


VI. MANGANESE-FLUORINE WAVE FUNCTIONS 


We take the ground state of MnF, as that of a perfect 
ionic crystal, which means that both ions have a 
closed-shell structure. Since, however, the charge clouds 
of both ions will overlap, this definition has a meaning 
only in the first approximation. For each ion we choose 
a set of Hartree-Fock wave functions which are com- 
puted for the ground state of a free ion. In the next 
approximation we take overlap of the wave functions 
into account. 

We denote the Hartree-Fock orbitals of the (2s)? 
and (2p)° electrons in the F~ ion by 


(’), +("). +(*). (°). 


and the Hartree-Fock orbitals of the (3d)° electrons in 
the Mn** ions by 


(W1,W2,W3,WaWs)a. 

Consider the pair of F~, Mn** ions on the x axis of 
Fig. 1. Let y: have the angular factor P2"(cos@) where 
6 is with respect to the x axis, and call this ¥i=ya. The 
overlap integrals are defined by 


S,= forovatr a)dr, 


Se= oer r)ya(r—a)dr, 


where a is the distance between the centers of the two 
ions. 

There will be no overlap between ¢, and the other y,, 
which are of the form Y2*! and Y2**. The overlap S, 
between Y,*! and the combinations Y,*! may be 
neglected since these charge clouds point parallel to 
each other. (It is for this reason that we dropped the 
A, contribution in analyzing SJ data.) 

Since from the experimental facts we expect the 
overlap integrals (27) to be rather small, we may 
restrict our attention to the 5-electron problem involv- 
ing da, $8, 2a, 628, Yaa of a single Mn—F pair. This 
corresponds to the independent-bonding model of Sec. 
IIT. 

The most important configurations are the ground- 
state configuration (A) = (3d)5(2s)?(2p)® and the two 
excited configurations involving electron transfer from 
fluorine to manganese (B)= (3d)°(2s)(2p)® and (C) 
= (3d)®(2s)?(2p)®. We note that only the fluorine elec- 
tron of spin 6 can transfer. The relevant portion of the 
Slater determinants corresponding to these configu- 
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rations are 


4) peae(1) 668 (2) ,b0(3) 6.8(4) Waa(S)| ; 
1 | psa(1) a8 (2) ,2a(3) ,28(4) Waa(S)| ; 
| pae(1) 6 68(2) 203) WaB (4) Yaa(S) |. 
Because of manganese-fluorine overlap, these con- 


figurations are nonorthogonal. Neglecting terms higher 
than first order in the overlap, we have 


(28) 


frends = for (r)Wa(r—a)d7=S,; 


freer = oer ovate- a)dr = Se : ( 29) 


[verweas =(), 


The secular equation is 


54 1—-W Kan—WS, Rac—WS, | 
‘Kea—WS, 


\Rea—-WS. 


Rep W Ipc 


ep Ree W 


To order S we may replace W in the off-diagonal 
elements by its zeroth-order value 3044 and we may 
neglect Hgc and Hep. Then standard perturbation 
theory yields as the first-order wave function 


w= NEPA se tHlc |, (30) 
with 

A,=—T, (31) 
where 


A,=Rgpp—KHaa4; Ac=Kece—KHa4; 


(32) 


T,=KHap—KHaaS,; To=Hac—HaaSz. 


The constant N is for normalization. It is seen that the 
transfer integrals T include nonorthogonality effects. 

The function (30) may be written as the single 
Slater determinant 


w= N’|d.a(1), (6. +AsWa)B(2), o2a(3), 
(d2t+Acpha)B(A4), Waa(5) |, 


and therefore the mixture of three atomic-orbital con- 
figurations is equivalent to a single molecular-orbital 


(33) 


configuration. One could, of course, have started with 
the function (33); the variational principle would then 
have yielded the solution (31) for the \’s. It should be 
noted that (30) and (33) are equivalent only because 
the electron transfer in both excited atomic-orbital 
configurations is to the same orbital ¥28, and therefore 
the term in A,A, in (33) is zero. 


VII. SPIN PROBABILITY DENSITIES 
The nuclear resonance shift is produced by an 
unbalance of probability densities in the spin states a 
and 8. To determine thus unbalance, we must calculate 
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the probability density associated with our first-order 
wave function W”. This is given by 


pD.= f |W", +5) |*dro-+-dr5! 


rj=r 


+ f (w(t, 9))dedre dr +::: 


lrg=r 


+ f )w'(t,-+-,8)[tdr dra (34) 


|r5=r 


This expression is rather difficult to evaluate with the 
orbitals as given by Eq. (33). We may, however, 
simplify the problem tremendously by transformation 
of the orbitals of (33) into an orthogonal set. As 
discussed on p. 238 of Seitz,!® the Slater determinant is 
invariant to the orthogonalization process. This is 
because this process is equivalent to adding a constant 
multiple of the elements of one row of the determinant 
to the corresponding elements in another row, which 
procedure leaves the value of the determinant un- 
changed. 

Let us assume we have transformed to an orthonormal 
set ¢;. Then (33) becomes 


we’ = (5!)-4| 6: (1),¢2(2),¢3(3),¢4(4),65(5) |, 


and the probability density may be written simply as 


(35) 


5 


P.D.=D] ¢.(n)|?. 


i=l 


(36) 


An orthonormal set may be obtained by the method 
of Léwdin'®: 
¢d, na o;(S N) i 

= Si: Toi} 23 OS its ar G59 jr ki ae ], 


where the ¢; are the original nonorthonormal orbitals 


of (33) and 
Si= f 60a, 


The integral includes a sum over spin states, that is, 
S;;=0 if the spins of ¢; and ¢; differ. 

With this procedure, the orbitals become, to second 
order in \ and S, 


$1 _ [os jars 5S or ; (S¢s+S.S oz) la, 
$= (14-22-+20,S,)-1b.—FASetAeSetAde He 
+r.Wa 18, 


(37) 


(38) 


@3 kas [o:— aS oWat ; (S.°b2t+S Sos) Ja, (39) 


b1= (1422+ 20S 0) 2—3 (AsSstrAvSetAsro)bs 
+a], 
$5=[Va—3S bs— BS ort § (SP +Se*)a la. 
18 F. Seitz, The Modern Theory of Solids (McGraw-Hill Book 


Company, Inc., New York, 1940). 
16 PO. Léwdin, J. Chem. Phys. 18, 365 (1950). 
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The probability density of a spins is given by 


(P.D.)a= | $1{?+| $3|?+ | gs]? 
= (1+5,7)62+ (14+S.?)62+ (14+SP+S.")pa 
+ 26.25 Se — Qva (S.d.+S.62), (40) 
and similarly the probability density of 8 spins is 
given by 
(P.D.)g= (1—AP— 2AS.)G2+ (1+A2— 2S) b2 
+ Ar+A) bart QWa(Ashst+Achz) 
—2(AsSstrAaSctAsr\o)Pshz. 
In these equations the expansion is to second order 
in \ and S. It is to be understood that ¢, and ¢z have 
argument (r) whereas yz has argument (r—a). 
We shall be interested in the difference: 
(P.D.)a— (P.D.)g=[(1+3S2+482—Pr2—2)Wa 
~ (Ss+As)bs— (SetAc)oz /. (42) 


This may be thought of as the probability density of 
the augmented manganese function 


va = (1+35/7+35/- svg Ao Wa 

= (Ss +As)bs— (Sa+Ac)oz. 
This function is even correctly normalized through 
second order. This is the antibonding molecular orbital 
suggested by SJ in their Eq. (12), but we now see how 
it naturally arises from the total solution of the problem. 
If only S, is taken account of, (43) reduces to 


Ya = (1-S2)4(Ya- Sis). 


This is the approximation of Mukherji and Das.® 


(41) 


(43) 


(44) 


VIII. CALCULATION OF THE ISOTROPIC SHIFT 


The isotropic shift is given by the fractional proba- 
bility charge evaluated at the fluorine nucleus (contact 
interaction). This is 


[= (P.D.), 
(P.D.) 





=(S.+A,.)?. (45) 
nucleus 


Here we have neglected ~a(—a) as negligible, that is, 
we assume that the charge density of the 3d function 
is very localized around the Mn nucleus. 

We must now compute (45) and compare with the 
“measured value” of Eq. (18). From Sec. VI we have 


As= —T,/As= (HaaSs—HaB)/(Kes—KHaa). (46) 


Terms of order S$ in 5044 may be dropped, and hence 


Raa~2(s!|H|s)\+(d H d) 
+2(sd|G!sd)+(ss|G! ss). (47) 


Here, as usual!’ 


H= (p?/2m)—e? D0 o(Z,/rig), G=e?/riz. 
17 For a more complete explanation of this notation, together 
with a discussion of the order of magnitude of the various integrals, 
see J. Yamashita and J. Kondo, Phys. Rev. 109, 730 (1958). 


PARAMAGNETIC 


CRYSTALS 


To order S we have 
Kan~ (s|H|d)+(ss|G|sd)+ (sd|G| dd) 
+[(s|H|s)+(d|H|d)+(sd|G|sd)]S,, (48) 
and thus 


T,~ (s|H|d)+(ss|G|sd)+ (sd|G| dd) 
—[(sd|G|sd)+(s|H|s)+(ss|G!ss)]S,. (49) 


(a) Estimate of S, 


By using the Hartree-Fock wave functions of 
Mn*+— (3d) and F~— (2s) we have estimated the value 
of S,. The computation is straightforward and the 
result is 


S,~0.05. (50) 


The same result was obtained independently by 
Mukherji and Das.® These authors calculate the overlap 
appropriate to the two Mn—F distances of Tinkham’s 
experiment. They obtain a theoretical ratio A,!/A,"! 
=1.05. (However, although they use a molecular 
orbital picture, they do not consider the A, contribu- 
tion.) Their theoretical ratio should be compared with 
Tinkham’s 1.20 and with our correction to Tinkham, 
1.02 (see reference 13). The distances used by Mukherji 
and Das are those of Stout and Reed; use of the data 
of Baur would lead to a theoretical ratio less than 1. 


(b) Estimate of 2, 


By a rather rough calculation we have estimated the 
value of the first four terms in (49) to be ~—0.01 
atomic unit (a.u.). For the fifth term we have used the 
value of (s|H|s) computed by Hartree, that is, — 1.0765 
a.u., and we have calculated the sixth term by using 
the Hartree-Fock wave functions of the 2s electron. 
The total result is 


T,~—0.015 a.u. + —0.4 ev. (51) 
This number is difficult to estimate with precision since 
the last three terms of (49) tend to cancel the first 
three terms. However, the order of magnitude should 
be correct. 

Next let us estimate A,. The principal contributions 
are 


— (a) the orbital energy of a 2s electron in the F~ ion, 
which is calculated by Hartree as —1.0765 a.u., or 
about —29 ev; 

(b) the orbital energy of a P2°-type electron in the 
(3d)® configuration of Mn*, which we estimate from 
the spectrum data (Kaiser’s table) as about — 10 ev; 

(c) the change in the electrostatic energy due to the 
transfer. This will be equal to the energy created in an 
Mnf; crystal by placing a negative electron at a Mn** 
site and a positive electronic charge at a F~ site, or 
(neglecting charge overlap and polarization effects) 
equal to 


—eVynteV r—(e’ ‘a). (52) 
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Here Vun and Vy are the crystalline electrostatic 
potentials at the Mn and F sites, respectively, and a is 
the smallest Mn—F separation. We may relate this 
expression to the Madelung energy per molecule,'* 


—(a ‘a)e?=4[ Vun(2e) +2V r(—e) ], (53) 


where a is the Madelung constant (=4.816). The 
electrostatic energy-change due to transfer is thus 


(4.816—1) (e?/a) =0.95 a.u. ~ 26 ev. 


The value of A, is (29—10+426)=45 ev in the first 
approximation. The effect of lattice polarization around 
the excess charge will of course reduce this energy. 
We know from the work of Mott and Littleton however 
that this polarization energy is only of the order of 
several electron volts, so it is doubtful if A, can be 
smaller than ~35 ev. Thus we can hardly expect a 
larger value than 


d,~0.4/35=0.01. (54) 


We note that A is positive, which corresponds to a 
reasonable molecular orbital in Eq. (43). 
(c) Estimate of Isotropic Shift 
The isotropic shift is now given by 


f,= (S,+A,)?= (0.06)2=0.36X 10-2, 


(55) 


which is to be compared to the experimental value 
0.50% 10~* of Eq. (18). At the present state of the 
development the reason for the discrepancy is not clear, 
although we can imagine the following possible causes: 
(1) The theoretical value of S, is not accurate, because 
the wave functions of the free ions should be modified 
in the crystal. In particular, the crystalline field may 
deform the F~ ion; and since this field consists of 
electrostatic field, exchange effects, and overlap effects, 
the degree of deformation will depend on the spin 
configuration of the surrounding Mn ions. (2) The 
theoretical value of 7, is not accurate for similar 
reasons. (3) The estimate of A, is in error. (4) There 
are other causes of the shift. 


IX. CALCULATION OF THE ANISOTROPIC SHIFT 


The net anisotropic hyperfine interaction will be 
proportional to 


f1—s cost 8[(P.D.)a—(P.D.)s]dr, (56) 


where @ is the angle between r and the axis of quanti- 
zation. On substitution of Eq. (42) into the above, it is 
seen that the contribution from y,’ is simply the dipolar 
field, at the F nucleus, of a Y2° electron centered about 


18 See M. Born and M. Goeppert-Mayer, Handbuch der Physik 
(Verlag Julius Springer, Berlin, 1933), Vol. 24, part 2, page 707. 
The definition of the Madelung constant given on p. 77 of reference 
15 does not correspond, for the rutile structure, to the value of 
the Madelung constant given in that book. 
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the neighbor Mn (multiplied by a very small correc- 
tion). When this is added to the dipolar fields of the 
other four Mn 3d electrons, one has the dipolar field 
of a spherical cloud at a point essentially outside the 
cloud. This is the same as the dipolar field from an 
equivalent spin at the Mn nucleus, plus higher poles 
which may be neglected. This dipolar field has already 
been included in the dipolar lattice sum. 

The contribution from the ¢,? and the ¢.¢. terms 
will integrate to zero. 

The contribution from the Yad, and Pad. cross terms 
may be estimated as follows. We assume that the 
overlap region is concentrated along the Mn—F axis 
at the fluorine ionic radius r;. Then we obtain, approxi- 
mately 


(1—3 cos’;)r; {- Sn) | vatoudr 


—2(So+d.) fvroar| 


= —2(1—3 cos8:)r7[S(SetAs) +Se(Se+Az) ], (57) 


where 6; is the angle between the axis of quantization 
and the Mn—F axis. 

Finally we have the contribution to (56) from the 
¢, term: 


(Si +) f 1—3 cos’6)r-*| .|%dr 


= (S,+X,)?(1—3 cos’6z)(2/5)(r-)oy. (58) 
The anisotropic shift is given by the sum of (57) and 
(58) divided by 


: fa —3 cos’6)r(P.D.) adr. 
Thus: 


fra= (Setde)?—0.045[.5,(SetAs)+Se(Seto)], (59) 


where the numerical value is obtained from (r~*)s, as 
given by Barnes and Smith’ and r; as approximated 
by the Pauling radius of F-. 

By using the Hartree-Fock wave functions, we have 
computed'® 


S,=0.060. (60) 


By methods similar to those of Sec. VILI we have 
estimated 7,~ —0.4 ev. In the case of the 2,,-electron 
the orbital energy in the F~ ion is only about —4 or 
—5 ev, so that A, becomes ~10—15 ev. Hence 


Ag~0.4/10= 0.04, (61) 

8 A preprint version of our work which was distributed in 
October, 1958, gave S,=0.045. A subsequent calculation by W. 
Marshall yielded a higher value. We wish to thank Dr. Marshall 
for calling our attention to this discrepancy, which led to discovery 
of the omission of an important term in our original calculation. 
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or perhaps less. Finally 


fre~1X10-, (62) 


or perhaps less. The Yad, and Wad, cross terms in (59) 
make a very small contribution. 

This is to be compared to the experimental values of 
Eq. (18). It is seen to be much too large, although the 
uncertainty in the experimental values makes definite 
conclusion unwise. The discrepancy could be cdused by 
(1) a non-negligible value of f,,; (2) underestimate of 
the size of the cross terms. 

We note that, regardless of the signs of S or A, Eqs. 
(55) and (59) predict a paramagnetic shift, i.e., AH 
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positive. However, the y; electron must exist in the 
transition metal ion for the theory to apply. In the 
case of CrF;, where this electron is expected to be 
absent, the nuclear resonance is found to be unshifted.” 
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Anomalous Skin Effect in Bismuth*t 
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High-frequency (23.5-kMc/sec) surface resistance measurements have been made on plane surfaces of 
single-crystal bismuth at 2°K as a function of orientation. It has been ascertained that extreme anomalous 
skin effect conditions prevail, allowing details of the Fermi surface to be deduced from Pippard’s theory. In 
Shoenberg’s model of the electron band, components of the inverse effective-mass tensor divided by the 
Fermi energy are found to be a:/E.=9.10, a2/E,=0.088, a3/E,=4.7, and a4/E,=0.38 (in units of 10°/ev). 
These results are in essential agreement with values obtained from de Haas-van Alphen experiments and 
cyclotron resonance. The number of ellipses is definitely established to be six and the number of electrons 
found to be N=5.5X10!"/cm’. The parameters for the two hole ellipsoids are found to be 8:/E,;=82/E,=1.5 
and §3/E,=0.12. Assuming Shoenberg’s value E,=0.0177 ev, we calculate E,=0.00112 ev from specific 
heat data. It is also found that the reflection of carriers from the surface of the sample is predominantly spec 
ular in contrast to diffuse reflection found in other metals. 


1. INTRODUCTION 


T has been shown! ? that measurements of the high- 
frequency surface resistance of pure single-crystal 
metals can yield valuable information about the Fermi 
surface when so-called extreme anomalous conditions 
are achieved. This requires that the mean free path / 
be much greater than the skin depth 6, and that wr be 
small enough to prevent relaxation effects, where w is 
the frequency and 7 the relaxation time. 
For a plane surface, the surface resistance will have 
the form 


R=R, cos*0+R, sin’6, (1) 


where x and y are principal axes of the surface, and 6 


* Submitted as a thesis in partial fulfillment of the requirements 
for the degree of doctor of philosophy at the University of Chicago. 

t This work was supported in part by a grant from the National 
Science Foundation to the University of Chicago for research on 
the solid-state properties of bismuth, antimony, and arsenic, and 
in part by a grant for low-temperature research under the direction 
of Professor E. A. Long of the Low-Temperature Laboratory of 
the University of Chicago. 

t Now at Bell Telephone Laboratories, Murray 
Jersey. 

1A. B. Pippard, Proc. Roy. Soc. (London) A224, 273 (1954). 

2 , H, Sondheimer, Proc. Roy. Soc. (London) A224, 260 (1954). 
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the angle between the current direction and the «x axis. 
Pippard,' assuming that only those electrons traveling 
nearly parallel to the surface are effective in absorbing 
energy, derives the expression 


V3P wh’ | 
=p 
2 Let S| py| dy 


where py is the radius of curvature of the Fermi surface 
in a plane normal to the y axis at the point where the 
normal to the Fermi surface is parallel to the surface 
of the metal. The quantities p, and y are expressed in 
units of momentum. The integration must include 
summing over all sheets of the Fermi surface when it is 
not simply connected. A similar expression holds for 
R,. The constant 6 is equal to 8/9 for specular reflection 
of electrons from the surface of the metal and equal to 
1 for diffuse reflection. The choice of b for the inter- 
pretation of the present experiment will be discussed 
in Sec. 5. It will be noted that R is independent of the 
relaxation time and depends only on the geometrical 
shape of the Fermi surface. It is further evident that a 
the Fermi surface from the measured 


(2) 


deduction of 
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Fic. 1. Schematic drawing of the apparatus. 


dependence of R on orientation may admit more than 
one solution so that its value lies primarily in fixing 
details once a general shape is known from other 
considerations. 

The band structure of bismuth has been studied 
extensively, and it has been found that the uppermost 
filled band overlaps slightly with the next highest band. 
This causes a small number of electrons to “spill over” 
and leaves an equal number of holes behind. The Fermi 
surface has been studied experimentally by the de 
Haas~van Alphen,’ cyclotron resonance,*~* and galvano- 
magnetic’ effects so that a reasonably well-established 
model exists for the interpretation of anomalous skin 
effect data. The model consists of either three or six 
electron ellipses and one or two hole ellipses, and various 
values for effective masses and Fermi energies exist in 
the literature. The possibility of greater accuracy for 
the electron band, the relative lack of data concerning 
the hole band, and the possibility of ascertaining the 
actual number of ellipses instigated the present 
investigation. 


2. EXPERIMENTAL METHOD 


To obtain a large mean free path and small skin 
depth, the experiment was carried out at 23.5 kMc/sec 
and 2.1°K on samples of bismuth highly purified by 
zone refining. Since the skin depth is approximately 0.5 
micron, it is necessary that the surface be smooth on 
this scale and free of strain in order to approximate the 
bulk material. This was accomplished by cutting }-in. 
slabs of desired orientation with a 280-grit alundum 
wheel and sandblasting them into 3-in. diameter disks. 
One surface was then electropolished by passing a 
current of about 1 ampere through a solution of 35 g 
KI, 1 g I, and 10 ccSHCl in 200 cc of water, and a rinse 
of } HCI and 3 alcohol was used to remove a chemical 


83D. Shoenberg, Proc. Roy. Soc. (London) A170, 341 (1939); 
Trans. Roy. Soc. (London) A245, 1 (1952). J. S. Dhillon and D. 
Shoenberg, Trans. Roy. Soc. (London) A248, 1 (1955). 

4J. E. Aubrey and R. G. Chambers, J. Phys. Chem. Solids 3, 
128 (1957). 

6 J. E. Aubrey (private communication). 

6 Galt, Yager, Merritt, Cetlin, and Brailsford, Phys. Rev. 114, 
1396 (1959). 

7B, Abeles and S, Meiboom, Phys. Rev. 101, 544 (1956). 
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film which accumulates. This was done for about 10 
cycles of 60 seconds each during which approximately 
0.015 in. of material was removed. Microscopic ex- 
amination indicated that the resulting surfaces were 
flat on the scale of 1 micron and x-ray examination 
indicated that they were free of strain. 

The experimental technique was essentially the 
calorimetric one used by Fawcett® in measuring the 
microwave surface resistance of tin and is shown 
schematically in Fig. 1. A rectangular microwave cavity 
C which terminates the end of a stainless steel rec- 
tangular line W is enclosed in a vacuum space with a 
brass plate P serving as contact to the helium bath. The 
sample S is lightly spring loaded against one end of the 
cavity and a dummy sample of brass D at the other. 
Both are thermally isolated from the cavity by means 
of 0.003-in. mica spacers M and connected to the helium 
bath through heat leaks of stainless steel tubing Z. The 
pressure is lowered to 10~® mm of mercury in order to 
prevent heat conduction by gas. Standard choke plate 
flanges F on the ends of the cavity prevent microwave 
power from being coupled into the vacuum space. 
Carbon resistance thermometers T are placed on the 


Z 


Fic. 2. The relation- 
ship between lattice 
vectors a), a2, and a; of 
the rhombohedral unit 
cell and the Cartesian 
coordinate system in 
which the Fermi surface 
is expressed. 








sample, on the dummy, and on the contact to the helium 
bath, and form arms of a dc bridge. The microwave 
frequency is adjusted to the resonant frequency of the 
cavity, causing a temperature rise in the samples due 
to the microwave power absorbed, and the ratios of 
sample to bath and dummy to bath resistances are 
separately measured. With the microwave power off, 
dc heaters H on both sample and dummy are adjusted 
to give the same ratios and the dc powers measured. 
Since sample, heater, and thermometer are in good 
thermal contact compared to the heat leak, this gives 
an absolute measure of the microwave power absorbed. 
The surface resistance R of the sample is given by 
R=KP,/P., where P, and Pq are the powers absorbed 
in the sample and dummy, and K depends upon 
geometry and the dummy surface resistance, both of 
which are kept constant throughout the course of the 
experiment. This constant K is found by measuring an 
alloy for which the surface resistance can be computed 
from its de resistivity p using the classical formula 


R= (2rwp/c*)!. (3) 
8E, Fawcett, Proc. Roy. Soc. (London) A232, 519 (1955). 
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The resistance of the carbon thermometers at 2.1°K 
was 5000 ohms and changed 15% with 10 microwatts 
of power being absorbed by the sample, corresponding 
to a temperature rise of approximately 0.1 degree. This 
change and the dc power could be measured to better 
than 0.1%. The main source of error in the power 
measurement came from drifts in the incident micro- 
wave power so successive readings of the two ratios 
were made and averages taken. Repeated measurements 
of P./Pa show less than 1% scatter. The heat leak 
across the mica spacers was measured and found to be 
negligible compared to the stainless steel tubing. A 
separate carbon resistor not in contact with the sample 
or cavity was placed near the side of the sample to 
determine the leakage of microwave power past the 
choke plate. Errors from this source are estimated to be 
less than 1%. 

3. RESULTS 


Bismuth has a crystal structure which can be 
described by a rhombohedral unit cell containing two 


Z 
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Fic. 3. Angles used to specify the directions of surface 
normals in the X YZ coordinate system. 


atoms. A Cartesian coordinate system XYZ is chosen 
in which the trigonal axis lies along Z and a binary axis 
along X. The relationship of the unit cell vectors in this 
system are given in Fig. 2. Another coordinate system 
xyz is defined for each sample in which z is normal to 
the surface and y perpendicular to the trigonal axis. 
From symmetry considerations, this results in « and y 
being principal axes of the surface as defined by (1). 

To investigate the dependence of surface resistance 
upon relaxation time, samples of two different purities 
were studied. One was a spectroscopic grade purchased 
from Cerro de Pasco from which crystals were grown 
by pulling from a melt. The residual resistivity ratio 
for current flow perpendicular to the trigonal axis was 
104. The other was the same material which had been 
further purified by zone-refining in this laboratory§ and 
samples cut from large grains in the resulting ingot. 
The residual. resistivity ratio for the same current 
direction was 220. 

§ The author is indebted to D. H. Reneker for supplying the 
zone refined material. 
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TABLE I. Experimental values of the surface resistance and 
values calculated from the deduced mass parameters assuming 
specular reflection. 


Measured R Calculated R 
P,/Pa (10-2 ohm) (10-2 ohm) 


Spectroscopic grade 
R,(X) 
R,(X) 
R,(Y) 
R,(Y) 
R(Z) 

Zone-refined 
R,(X) 
R,(X) 
R.(Y) 
R,(Y) 
R,(0=135° 
R,(@=135 
R,(@=75°) 
R,(@=75°) 
R,(6=105°) 
R,(@=105°) 


Sample 





1.333 
1.617 
1.058 
0.741 
0.961 


7.50 
9.10 
5.96 
4.17 
5.41 


7.24 
901 
6.01 
4.47 
5.38 


7.43 
8.56 
5.70 
4.37 
8.51 
6.59 
8.68 
6.46 
6.33 
4.03 


7.24 
9.01 


1.320 
1.521 
1.012 
0.776 
1.512 
1.170 
1.541 
1.147 
1:138 
0.716 


The measured values of P,/P., and the corresponding 
values of surface resistance R=KP,/Pa for each 
orientation measured are shown in Table I. The symbol 
in brackets to the right of R indicates the direction of 
the surface normal in the coordinate system of Fig. 3. 
For the alloys, P,/Pa, their measured dc resistivities, 
and the computed value of K are shown in Table II. 
Both the brass and bronze were found to be homo- 
geneous by metallographic inspection and were chosen 
largely because their surface could be given a smooth 
polish. The similar values of p are merely fortuitous. 
The scatter in K indicates that the absolute calibration 
is accurate to 2%. The largest discrepancy between 
spectroscopic grade and zone-refined samples is 6% 
and the average error for the four where comparison is 
made is 4%. It is felt that these discrepancies result 
largely from surface preparation and misorientation 
errors. The average error in orientation is estimated to 
be 14° and the maximum error estimated to be 3°. 
This can be shown to give at most a few percent error 
in R for all orientations except 0=75° and 6=105°. 
These orientations will be discussed further below. 


4. DEDUCTION OF THE FERMI SURFACE 


Following the model proposed by Shoenberg,’ the 
electron band is assumed to consist of six ellipsoids, one 
of which is given by 


2mE.=aipr+arp,+asp2t 2aspyp:, (4) 


TABLE IT. Absolute calibration data. 


P./Pa 


1.025 
1.039 
1.061 


K =R/(P./Pa) 


0.0573 
0.0564 
0.0553 


K 4, =90.0563 


Sample 
Brass No. 1 
Brass No. 2 
Bronze 


p 
(10-*ohm cm) (ohms) 


0.0587 
0.0587 
0.0586 


3.74 
3.74 
3.71 
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TaBLe IIT. Values of a;/E, and 8;/E, in units of 10°/ev for the cases of specular and diffuse reflection of carriers from the surface. 
Shoenberg’s original data and the data as modified by cyclotron resonance is presented for comparison. The number of electrons N is 


calculated from a;/E, using Eq. (6). 


Source ar/ Es 


0.088 


0.062 
0.045 


Anomalous skin effect (specular) 

Anomalous skin effect (diffus 

de Haas-van Alphen effect* 

de Haas-van Alphen effect modified 
by cyclotron resonance” 

de Haas-van Alphen effect modified 
by cyclotron resonance® 


0.113 


0.094 


* See reference 3. 
b See reference 4. 
© See reference 5. 


where pz, py, and p, are momenta along the XYZ axes 
shown in Fig. 2. The other five are generated by ro- 
tations of 120° about the Z axis and inversion through 
the origin. The six minima coalesce into three only if 
they happen to lie at the centers of zone faces.’ The 
hole band is assumed to consist of two ellipsoids of 
revolution about the Z axis given by 


Bi(p2+ p,2)+Bsp2. 


Abeles and Meiboom,’ in their interpretation of gal- 
vanomagnetic effects in bismuth, propose one such 
ellipsoid, but since it is allowed by inversion symmetry, 
and to obtain somewhat closer agreement with the 
present data, two are assumed here. A drawing of the 
proposed Fermi surface is shown in Fig. 4. 

Using the above model, the integration (2) was 
carried out (see Appendix I) for each orientation leaving 
the a’s and §’s as variables and a numerical iteration 
used to determine the best fit to the data. In doing the 


2mok),= (5) 














Fic. 4. A drawing of the proposed Fermi surface and its re 
lationship to the Brillouin zone. The XYZ coordinate system is 
drawn on one electron ellipse to show the direction of tilt and the 
other five electron ellipses tilt alternately in opposite directions. 
The two hole ellipsoids lie with their principal axis along the Z 
direction 


¥M. Cohen and E. I. Blount, Scientific Paper 6-94760-2-P19, 
Westinghouse Research Laboratories, 1958 (unpublished). 


1077 XN 
(cm~) 


5.5 
9.4 
8.2 


B3/En 


0.118 
0.083 


Bi/En 


1.52 
1.07 





calculation, we assumed that the number of electrons 
N,, and the number of holes V;,, were equal. Expressing 
the number of carriers by 2//* times the volume 
enclosed by the Fermi surface in momentum space, 


8 (2moE.)! 
Ni nillpevee: ecxsermene 


7)? 


3h* [as (ava3—a4’) |} 


and 
Sr (2moli,)? 
N,=2 ’ (7) 
3h? (By7B3)? 


As noted previously, it is a Fermi momentum which is 
measured or essentially ratios of a;/E, and 6,;/E, so 
that VN,=N,=N may be used to eliminate one ratio. 
This condition of charge neutrality shows that the 
momenta are comparable, and hence electrons and holes 
will, on the average, contribute equally to /|p,!dy. 
Because of the high anisotropy, however, at certain 
orientations the resistance is largely determined by 
electron conduction and at others, largely by hole 
conduction. 

The effect of the “tilt” of the ellipses (i.e., a4) is 
largest for the orientations @= 75° and @= 105° and the 
fact that the ratios R,(@=75°)/R,(105°)=1.37 and 
R,,(75°)/R,(105°)=1.60 are greater rather than less 
than one show that a, is positive. This can be seen from 
the equations in Appendix I. However, since the actual 
values of R change very rapidly with orientation in this 
region, and are hence more subject to misorientation 
error, these samples were used largely to fix the sign of 
a, and not weighted greatly in the calculation of 
quantitative values. The resulting values for a;/£, and 
8,/E,, and the computed number of electrons N are 
given in Table III for the cases of both specular and 
diffuse reflection of carriers from the surface. Shoen- 
berg’s values for electrons are presented for comparison 
since the de Haas-van Alphen effect is the one other 
experiment in which an observed quantity is deter- 
mined directly by suitable combinations of a;/E,. No 
de Haas-van Alphen effect for holes has been reported. 
The most accurate quantities which he measures are 
of the form a.i/F2, and, as pointed out by Aubrey 
and Chambers, a1//, may be individually changed as 
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TABLE LV. Absolute values of the mass parameters and their ratios to one another. An electron energy of E,= 17.7107 ev and a 
hole energy of E,=1.2X10™ ev are assumed. Other data are presented for comparison. No tilt to the electron ellipses was assumed in 
the galvanomagnetic data. 


Source 


Anomalous skin effect (specular) 
Anomalous skin effect (diffuse) 
de Haas-van Alphen effect* 
Cyclotron resonance || field» 
Cyclotron resonance || field* 
Cyclotron resonance | field“ 
Galvanomagnetic effects® 

Sound attenuation‘ 


® See reference 3. 
> See reference 4. 
© See reference 5. 
4 See reference 6. 
¢ See reference 7. 
{D. H. Reneker, Phys. Rev. 115, 303 (1959). 


long as their product is kept constant. Such a reinter- 
pretation of Shoenberg’s data based on cyclotron 
resonance data has been carried out by Aubrey and 
Chambers and later by Aubrey. Their results are also 
included in Table III. It should be pointed out here 
that if three rather than six electron ellipses were 
assumed, the values of a;/#, derived from the present 
work would be a factor of two smaller, and be in dis- 
agreement with Shoenberg’s data. This result is in 
accord with the symmetry argument presented 
previously. 

To separate the above ratios, Shoenberg’s value for 
the electron Fermi energy, £.=17.7X10™ ev, is 
assumed. The electronic contribution to the specific 
heat y= (1.6+0.1) X10~° cal/deg?-gram atom as meas- 
ured by Kalinkina and Strekov” has been used to 
obtain a Fermi energy for the holes. This can be 
expressed as 

y=4n'h?(N/E.+N/E,), (8) 


where & is the Boltzmann constant. Inserting the value 
of N from the present work and Shoenberg’s electron 
Fermi energy results in a Fermi energy for holes of 
E,=1.12X10-* ev. The resulting mass parameters are 
shown in Table IV together with values obtained from 
other experiments. It will be noted that ay has been 
made positive in all cases to agree with the coordinate 
system used here. The sign of ay would change if the 
XYZ coordinate system in Fig. 2 were rotated 180° 
about the Z axis with the lattice vectors fixed." 


5. DISCUSSION 


Table III has been computed for the cases of both 
specular and diffuse reflection of carriers from the 
surface of the sample; the difference between them 
being that the computed a,/Z, and 8;/£, are larger in 

0 T, N. Kalinkina and P. G. Strekov, J. Exptl. Theoret. Phys. 
U.S.S.R. 34, 616 (1958) [translation: Soviet Phys. JETP 34(7), 
426 (1958) J. 

1 Jn connection with the sign of a4, we note that the relation 
between a, and m, in Shoenberg’s 1952 paper contains an error 
in sign. 


as : :a@aiasa 

6.7 ‘ 0.13 753:4.4 
4.9 : 0.16 :53:4.4 
4.0 Oe AR 
10 £47350:50 
8.33 :50:5.0 
:78:6.8 

724: 
76:6.5 


the specular case by a constant factor (9/8)*=1.42. 
Because the results following from the assumption of 
specular reflection are in much better agreement with 
the modified de Haas-van Alphen effect data, we 
believe that specular reflection does indeed take place. 
On the other hand, it has been shown experimentally 
that the reflection of electrons from the surface of metals 
previously studied” (Cu, Ag, Au, Pb, Sn, Cd, Al) is 
diffuse. This is reasonable on the basis that the electron 
wavelength for these metals is of the order of the 
distance between atoms, thus roughness on an atomic 
scale would give rise to diffuse reflection. In the semi- 
metal bismuth, however, the wavelength is of the order 
of 100 atomic separations; hence surface roughness on 
an atomic scale is probably unimportant. From this 
point of view, the better agreement with specular 
reflection is not surprising. 

There are restrictions on the magnitudes of the 
relaxation times and Fermi momenta which must be 
met to insure the validity of (2). For a single isotropic 
carrier, the condition is that 


3 (1/6)2>>(1+0*r?)!, (9) 


where / is the mean free path and 6 the skin depth as 
calculated from the classical formula 


° 


6=c/(2rwa)}, (10) 


and where o is the de conductivity. For two carriers, 
the skin depth is determined by the total conductivity 
and the inequality (9) is required to hold for each 
carrier. The condition (9) can be checked if the 
electron and hole mobilities known. However, 
mobility estimates based on existing magnetoresistance 
and Hall-effect data may be unreliable because the 
magnetic fields used did not satisfy the low-field con- 
dition w.rK1, where w, is the cyclotron frequency. 
Nonetheless, as a guide, electron and hole mobilities 
obtained from galvanomagnetic measurements” at 


are 


2 R. G. Chambers, Proc. Roy. Soc. (London) A215, 481 (1952). 
3 A. L. Jain, Phys. Rev. 114, 1518 (1959). 
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4.2°K and the measured dc conductivity of our zone- 
refined samples, «= 2.0X10® (ohm-cm)~', have been 
used to obtain mean free paths for electrons and holes 
of /,=0.18 cm and /,=0.06 cm and relaxation times of 
tT-=3X10-" sec and r,=3X10~* sec. The inequality 
is seen to be satisfied by a factor of 10° for the electrons, 
but for the holes it is not satisfied by a factor of 10? 
because of the large wr,= 370. 

The above result led to an investigation of the effect 
of a large hole relaxation time upon surface resistance. 
Dingle has presented tables which show the variation 
of surface resistance with relaxation time for a single 
isotropic carrier. They exhibit the feature that when 
1/6>>1, as 7 increases, one goes from the anomalous skin 
effect region into a transition region which is dependent 
upon 7, and finally into a so-called extreme anomalous 
relaxation region. This last region, for diffuse reflection, 
is again independent of 7, and, for specular reflection, 
reverts to the r-dependent classical (J/6<«1) result. 
The above estimates indicate that the electrons are in 
the anomalous skin effect region and the holes in the 
extreme relaxation region. Therefore, an exact calcu- 
lation of surface resistance for the case of two carriers 
was initiated, the details of which are presented in 
Appendix II. An isotropic model was assumed by 
transforming the ellipses into spheres of the same 
volume and, in the above limit, it was found that for 
both types of scattering from the surface, the hole 
contribution to the surface resistance was completely 
negligible. Assuming electrons alone, however, leads to 
mass parameters which are radically different from 
those obtained by other methods. Measurements on 
the four orientations R,(X), R,(X), R.(Y), and R,(V) 
were made using both zone-refined and spectroscopic- 
grade samples and, as seen in Table I, were found to 
agree within experimental error. Since the relaxation 
time of the zone-refined bismuth is a factor of two larger 
than that of the spectroscopic grade, as shown by the 
residual resistivity ratios, this independence of surface 
resistance upon 7 allows us to assume that the holes are 
not in some intermediate region. No quantitative calcu- 
lations were made, however, to find an expected vari- 
ation of R with 7, because of the complexity of the 
problem. The results of these considerations leads one 
to believe that the hole relaxation time calculated from 
the mobilities derived from galvanomagnetic experi- 
ments is either incorrect or not directly applicable to 
this problem. It is felt that 7, should instead be a 
factor of ten lower, justifying the use of Eq. (2) in 
interpreting the experimental data. In connection with 
this supposition we note that Galt ef al. find that the 
shape of the absorption curves in their cyclotron 
resonance experiments are best fitted assuming an wr 
for holes of approximately 10. 

The rms error between the observed and calculated 
values of R is found to be 4% so that adding the cali- 


4R. B. Dingle, Physica 19, 311 (1953). 
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bration error of 2% and recalling that a/£ is propor- 
tional to 1/R*, the resulting error in a,/E, and B;,/F,, 
is approximately 209%. Comparison with the de Haas- 
van Alphen effect data as modified by cyclotron reso- 
nance data shows that the electron data (assuming 
specular reflection) is in essential agreement. There 
are no existing data for a direct comparison of B;/ 
but it will be seen from Table IV that the ratio B,/B3 
agrees well with that obtained by Galt et al. The 
discrepancy with mobility ratios might be accounted 
for by an anisotropic relaxation time. There is, however, 
a factor of 10 disagreement in absolute magnitude 
between Galt’s 6; and the 8; obtained from this experi- 
ment in conjunction with the specific heat data. 

A possible explanation of the discrepancy would be 
a division of the holes into a heavy-mass band and a 
light-mass band. If both hole bands have the same 
symmetry and mass ratio, it would be impossible to 
distinguish between them from the anomalous skin 
effect data. However, the specific heat data places a 
restriction on the possible numbers of light and heavy 
holes. As an example, we can assume Galt ef al.’s 
average hole mass of m,=0.16mo for the light band 
and m,= 2m» for the heavy band; then the ratio of the 
number of light holes to the number of heavy holes 
must be no more than 3. In connection with this 
supposition, Jain has suggested that an anomaly in the 
low-temperature resistivity of bismuth-antimony alloys 
may be explained by the existence of a third set of 
carriers. 


APPENDIX I 


The integral in Eq. (2) has been carried out as a 
function of orientation for the relatively simple cases 
of the surface normal being parallel to the XY, XZ, and 
YZ planes shown in Figs. 2 and 3. The directions of the 
surface normals are specified here by the angles shown 
in Fig. 3. Since the total integral includes summing over 
ellipses, the contribution of each ellipse is presented 
separately. A contribution from an ellipse obtained by 
inversion is the same as that of the original ellipse. 

For the electrons, ellipse No. 1 is that given by Eq. 
(4), and ellipses No. 2 and No. 3 are those obtained by 
rotations of +120°. Because of their similarity except 
for signs, ellipses No. 2 and No. 3 will be presented as 
a single expression with the upper sign referring to 
No. 2 and the lower to No. 3. Letting a;=a;/2mok,, 
we find that 


f | py|dy= (2/2) ,/(a1(a2a3— a’) |}, 


(11) 
fie-\ae- (x /2)I2/La1(a2a3— a4") }}. 


The J’s for the various ellipses and orientations are 
given below. 
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ZY Plane 
No. 1: J,=2d;/ (a2 sin*@+ a3 cos’6+ 2a, sind cos#)!, 
T= 2(a2a3— a4?) / (a2 sin?@+-a; cos’é 


+ 2a, sinO cos6)?. 
No. 2 and No. 3: 


T,=4{4aja2 sin?6+ [a3 (a:+3a2)—3a,? | cos’6 
— 4aa, sind cos} (3a;+ a2) sin’é 
+4a; cos?@— 4a, sin® cosb}-}, 
T,=4[[a3(3a1+ a2) — ag |/[(3a1+¢2) sin’d 
+ 4a; cos?@— 4a, sind cos6 }?. 
XZ Plane 
No. 1: J,=2[a2(a; sin’6+ a; cos*o) — a4? cos*¢ |/ 
[ a; sin’@+ a; cos*@ |}, 
1, = 2a,a3/ (a; sin’@+ a3 cos’) !. 
No. 2 and No. 3: 
IT ,=4{4a,a2 sin’¢+ [a3 (341+ a2) —a¢ ] cos’ 
¥4v3a,a, sing cosd}[ (ai +3az) sin’ 
+ 4a; cos’*6F4vV3a, sind cos¢ |}, 
1, =4(a,a3+3a2a3— 3a2)/[(ai+3az) sin’ 
+ 4a; cos*@F4v3a, sing cos¢ ]}. 


XY Plane 
No. 1: [,=2a,a2/ (a; cos*¢+<az sin’¢)?, 
T= 2[a3(sin*¢a2+a, cos*¢) — az? sin’t |/ 
(ay sin’*¢’+a, cos’) !. 
No. 2 and No. 3: 
1 ,=16a,a2/[ (3a;+ a2) sin’¢+ (a;+3a2) cost 
+2v3(a2—a1) sing cos¢ ]}, 
T,=4 fas (3a:+ a2) sin’f+ (a1+3a2) cos*f 
+2v3(a2—4,) sing cost ]—a,?(sing 
+Vv3 cost)?}[ (3ai+a2) sin’*¢+ (a:+3a2) 
X cos’¢ + 2V3 (a2— a) sing cost }-?. 
The hole band, given by Eq. (5), is much simpler 
and its_full orientational dependence can be described 
by the single angle @. Any rotation of a surface normal 
about the Z axis leaves the surface resistance un- 
changed. The integral in (2) for a single ellipse is given 
below. 


f |p. |dx=a2moE;/B3(B1 sin’?@+B; cos’é)?, 


f | py | dy=a2moF183'/ (B81 sin’?0+B3 cos’6)’. 
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APPENDIX II 
We shall use the following symbols: 
j=e refers to electron parameters, 
j=h refers to hole parameters, 
p;= Fermi momentum, 
m;= effective mass, 
/;=mean free path, 
lo= (Le+I,)/2, 
L;= L;/lo, 
A ;= 89 wp 7le/Ch', 
j= L;/(1+7;), 
a;= 173A, 
ok fia lyj, 
B=o"l,?/c?= displacement current term, 
Z=complex surface impedance. 

A calculation of the surface impedance from the 
Boltzmann equation has been carried out for the case 
of two spherical bands of the form E.=p7/2m, and 
E,=pi?/2m,. The derivation closely follows Reuter 
and Sondheimer’s!® calculation for the case of a single 
carrier and need not be repeated here. We first consider 
the case of diffuse scattering from the surface. Utilizing 
Dingle’s“ suggestion for simplifying Eq. (A36) of 
Reuter and Sondheimer, the admittance Y=1/Z is 


given by 
(12) 


Y= cy/Ariwly, 
where 


i 8) 


=f of dt Inf{1+[@.K(s,.)+@,K (s,)—B]/@. (13) 


0 
Partial integration of (13) puts it in the form 
Ss GK (Se) + AnrK (Sp) 
y=(2 of dl : ; 
0 GK(s.)+QnK (si) +?—B 


where 


(14) 


K(s) = (1/s*)[—5s+ (35°+5) tans], 
and 
K(s) = (2/s*)[ —s+ (s?+1) tan's ]. 
The surface impedance for the case of specular re- 
flection, similar to Reuter and Sondheimer’s equations 
(21) and (29), is found to be 
Z= (Arial o/c >it, 
where 


p= (2/n) f di @.K (s.) + @aK (s,) +l—B}. (15) 


The functions K(s) and K(s) are similar and, to a high 
16G, E. H. Reuter and E. H. Sondheimer, Proc. Roy. Soc. 
(London) A195, 336 (1948). 
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degree of accuracy, exhibit the behavior of being 
constant for s<1 and going smoothly into a 1/s de- 
pendence for s>10. The estimates presented in 5 show 
that, in the present notation, the inequality (9) 
becomes for the electrons, |A,||y,-|*>>1, and for the 
holes, | Ax! |y,|*<<1. Since A, and A, are of the same 
order of magnitude, this implies that |y.|>>|y,|. In 
this limit, it can be seen that one may replace K,(s.) 
by its asymptotic value 2/s,, K(s.) by 3m/2s,, and 
K(s,) and K(s,) by their value for s=0 which is 4. 
Equations (14) and (15) now become 


y (2/m) f dt(C’+Dt)/(C+Di+F), (16) 
0 


di(C+Di+F) i (17) 
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where C=7@./7., D=4@,/3, and C’=3C/2. Equations 
(16) and (17) can be easily integrated and the result 
is that one obtains a value for the surface resistance 
which is, to an accuracy of better than 0.1%, the same 
as the anomalous skin effect value for the electrons 
alone. This is not surprising when one considers that 
the electron term is C= 108 and the hole term is D~ 104. 
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Zeeman Splittings of Paramagnetic Atoms in Crystalline Fields 


H. Starz, Research Division, Raytheon Company, Waltham, Massachusetts 


AND 


G. F. Koster, Massachusetts Institute of Technology, Cambridge, Massachusetts 


The paper deals with the energy levels as a function of magnetic field of paramagnetic ions in crystalline 
surroundings. The necessary information is given to write down the Hamiltonian matrix for ions in common 
crystalline fields. The method has been previously developed and it consists of determining the Hamiltonian 
matrix of BH-(L+2S) for the lowest states of an ion by the methods of group theory and time inversion 
considerations. The Hamiltonian matrix contains a certain number of undetermined constants which may 
be evaluated if the wave functions of the ions are known or the constants may be determined from experi- 
ment. In general, the number of available constants is larger than found in the conventional spin Hamil- 
tonian formalism. The present method may be applied in cases where the conventional spin Hamiltonian 
formalism is inadequate to describe experimental results or where greater accuracy is required. 


I. INTRODUCTION 


N a previous paper' the authors developed a method 
of treating Zeeman splittings of paramagnetic 

atoms. The method was based on symmetry arguments 
and thus was very general in its results. In particular, 
it contained more terms and available constants than 
the generally used effective spin Hamiltonian.** It has 
been demonstrated by the wide use of the spin Hamil- 
tonian that the additional terms of the new method in 
general do not represent a sizable contribution to the 
energy. However, as more precise measurements cover- 
ing a large range in frequencies become available, the 
theoretical treatment of reference 1 may be useful in 
fitting the experimental data. 

1G. F. Koster and H. Statz, Phys. Rev. 113, 445 (1959). 

2B. Bleaney and K. W. H. Stevens, Repts. Progr. in Phys. 16, 
108 (1953). 

3K. D. Bowers and J. Owen, Repts. Progr. in Phys. 18, 304 
(1955). 


In order to apply this new method to the various 
crystalline symmetries and the various paramagnetic 
ions certain matrices have to be evaluated. In reference 
1 the method has been demonstrated for a ®S state in a 
crystalline field of cubic symmetry. In this paper we 
shall give the necessary information to write down the 
Hamiltonian matrix for ions in some common‘ point 
groups of crystal lattices. 

For coherence it will be necessary to repeat here 
brietly some of the results of reference 1. 

In the method of reference 1, it is assumed that the 
wave functions of the ion in the crystalline field are 


‘The complete tables may be obtained by writing to the 
authors. They have also been deposited as Document No. 6059 
with the ADI Auxiliary Publications Project, Photoduplication 
Service, Library of Congress, Washington 25, D. C. A copy may 
be secured by citing the Document number and by remitting $6.25 
for photoprints or $2.50 for 35-mm microfilm. Advance payment is 
required. Make checks or money orders payable to: Chief, Photo- 
duplication Service, Library of Congress. 
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exactly known. The influence of the magnetic field is 
then taken into account by adding to the unperturbed 
Hamiltonian Hy) the perturbation 1,=8H-(L+2S), 
where @ is the orbital gyromagnetic ratio, H is the 
magnetic field, L and S are the resultant sum of the 
orbital and spin angular momentum operators of the 
individual electrons in the crystal. Diamagnetic terms 
quadratic in the magnetic field have been neglected. 
These terms are expected to be very nearly the same 
for the lowest states under consideration. For transi- 
tions where only energy differences between levels are 
important these terms may be omitted. The resulting 
Hamiltonian matrix will be diagonalized only for the 
lowest states of the unperturbed Hamiltonian Hp. 

The error due to this approximation can be made 
arbitrarily small by including more and more states 
into the matrix which is to be diagonalized. The re- 
maining error of the ith state can be estimated by 
second order perturbation calculation to be 


Lill As |? (E— i) 1, 


where the sum j is over all states not included in the 
diagonalization, H/;;") is the matrix element of 4, be- 
tween the states 7 and j, and £; and &; are their un- 
perturbed energies. As an example, consider the ®S of 
the d° configuration considered in reference 1. All 
multiplets have been neglected except the levels arising 
from the ®S. One knows? that the neglected multiplets 


are separated from the *S state by an energy of the 
order of 20000 cm and more. If one estimates H;; 
to be of the order of the splitting of the ®§ state due to 
the magnetic field (0.3 cm for measurements in Y- 


band), one obtains a correction of about 107° cm“. 
This is more or less an upper limit to the error since if 
the higher multiplets were not split by spin-orbit 
interaction and the crystalline field the sum of the 
matrix elements over the various states of a multiplet 
would yield zero. The probable error, of course, becomes 
much larger when one has a degenerate orbital state 
lying lowest which will be split by a crystalline field. 
If one neglects the split-off components, E;—£; will 
be typically of the order of 100 cm™'. However, the 
present treatment is always expected to be better than 
the conventional spin Hamiltonian as is evidenced by 
the larger number of available parameters. 


II. SYMMETRY TYPES OF WAVE FUNCTIONS 


For the application of the method it is first required 
to find the transformation properties of the lowest 
states of the paramagnetic ion in the crystalline field. 
We shall illustrate the necessary procedure in the ex- 
ample of Cr*** in Al,O3. The eigenfunctions in the ab- 
sence of a magnetic field must transform as partners in 
irreducible representations of the point group of the 
site of the paramagnetic ion. Consider now Cr*** in 
Al,O3. We want to mention at this point, however, that 


5C. K. Jérgensen, Acta Chem. Scand. 11, 53 (1957). 
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there are actually four different places in the unit cell 
at which the Cr*** ions may be located. The surround- 
ings of these sites are related to each other either by an 
inversion operation or by twofold rotations. As can be 
shown, for a given direction of H, Cr++* ions sitting in 
sites which are related by an inversion have the same 
energy values. For sites which are related by twofold 
rotations the energy values of one may be obtained from 
that of the other by a rotation of the magnetic field 
around one of these axes. It will thus suffice to consider 
the Hamiltonian matrix for one particular site. The free 
Cr*** site® is C;. If we consider only the lowest state, i.e., 
the state with the angular momentum of 3, then we have 
to find out what kind of linear combinations we can make 
from the four substates such that these combinations 
form bases for irreducible representations for C3. From 
group theory 


1 
Ca=— > X(R)Xq*(R). (1) 
g R 


Here ¢q is the number of times which the ath irreducible 
representation is contained in a representation of the 
group. X,(R) is the character of the ath irreducible 
representation, x(R) is in our case the representation of 
the group generated by the four substates of angular 
momentum 3. g is of the order of the group. The char- 
acter x(R) under any rotation by an angle ¢ around any 
axis is 


x(R);=2 cos(3o)+2 cos(3¢). (2) 


If the rotation is followed by an inversion, the character 
is still given by Eq. (2) since the states arise from a d" 
configuration. Equation (2) is easily generalized for any 
angular momentum. If we had an ion with spin 3} we 
would have to add to Eq. (2) the quantity 2 cos($¢) etc. 
There is an ambiguity in sign in Eq. (2) for if we in- 
crease @ by 27 the sign of x(R) changes. As was out- 
lined previously,’ this results from the fact that we are 
dealing with an odd number of electrons and we have to 
make use of the double group. One choice of angle corre- 
sponds to the ¢ rotation; the other to the @ rotation. 
For an odd number of electrons the character of the 
barred operation is always the negative of that of the 
unbarred one. In checking in Table I the various irre- 
ducible representations of the double group C; with the 
aid of Eqs. (1) and (2), one finds that Ty and I’; is con- 
tained once, and that I's is contained twice. 

The same type of procedure has to be carried out for 
any problem. For example, for an even number of 
electrons with spin 1, the Eq. (2) would go into 


x (R)i=2 coso-+1. (2a) 
lor an even number of electrons there is no ambiguity 


in sign for a given rotation and @ and @ have the same 


6L. Pauling and S. B. Hendricks, J. Am. Chem. Soc. 47, 781 
(1925). 

7G. F. Koster, Solid Slate Physics, edited by IF. Seitz and D. 
furnbull (Academic Press, Inc., New York, 1957), Vol. 5, p. 173. 
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Tas_e I. Character tables of some common irreducible representations together with bases for 
representations and behavior under time inversion operation. 


(C2) (C2’) 


(C2”) 
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Time . 


Group D2: 


(ov) 


Bases for D2 
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I, 
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Double{ I; 


1 y 
—1 Sy or y 
—1 S, or 2 
1 S, Ore 


0 o(4, 4), 


Single 


1 
1 
—1 
—1 
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x or Sy, 
S, or xy 
S;ory 


¢(3, 4), o(3, —4) 
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TABLE II. Irreducible representations contained in functions transforming like angular momenta §, $, 3, 3, 3, 2, 1 (units of #). 


Angular 
momentum 


Groups D2, Cov Groups, Ds Cw, , Dra 


40; V6+20; 

3rs I'e+2r; 

2rs e+; 

’ rs 
Mtl3+ly+2Prs 
M+lstlstls 
P+T's 


rs 
l,+2P2+2P3+27F, 
2Pi+let+Ps+ls 
PetTst+l's 


DS Gone role rojas 


character. If we would thus apply Eq. (1) to an ion 
with spin 1 in the cubic group O, we find that the three- 
dimensional representation I’, (or in another notation 
I's) is just contained once. This means that we can 
make three linear combinations of the three substates 
such that they form a basis for I'y(T's5). 

In Table I, we have listed the various character 
tables for some of the point groups. Since several 
groups are isomorphic we could use the same char- 
acter tables for them; however, the labeling of the 
operations is different. We have also indicated in many 
instances bases for the irreducible representations 
which have been used in the evaluation of certain 
matrices. The main purpose of giving the various irre- 
ducible representations is to define the notation and 
identify the bases that will be used in this paper. When 
bases have been given, the order of the functions in the 
basis is as shown. When a basis is listed as the direct 
product of a given representation and a one-dimensional 
representation we mean that the matrices in the first 
are to be multiplied by the characters of the one- 
dimensional representation. In general, many sets of 
equivalent bases may be found. The bases at this point 
are only supposed to represent the transformation 
properties under spatial operations. Further below, it 
will become necessary also to consider the transforma- 
tion properties under time inversion. The bases given 
may, however, also be considered to represent the 
behavior under time inversion. 

In order to avoid lengthy tables we have omitted the 
barred operations. For the single group, the barred 
operations have the same character as the unbarred 
ones; for the double group, the barred and unbarred 
operations have characters which differ in sign. In 
general, the barred and unbarred operations belong to 
certain — 


different classes. There are, however, 


tions.’ We e have indicated by parentheses 
(T'4 | H, | I's) 
(5! | I';) 
(I's © | H,|Ts) 
(I's (11) | | H,|T's) 


around 


| I's] H1| 14) 

bee te 
(Pe |i] 4) 

trea) H,\v 4) 


We had to distinguish between the two representations 
I's by upper indices (I) and (II). 
In the present method we decompose H; into parts 


_Group C3 


Pet30st+20's 
214+2Ps+21's 
My+Ts+27Ps 
My+Ps 

3, +2P2+27s 
M+202+2Ps 
Mitts 


(4) Ay Ig"! 
('s| Hi | Te) 

(6? | Ai |P6) 
(M6?) | Hy | 


Groups D3, Cav Groups O, Ta 


3Mt+rst+ls 
2P4+Tst+T's 
Mtlst+ls 
rT, 
P,+2Fr2+27Pr; 


r,+2P3 
P.+Ts 


Groups De, Cov, Dar 


P+20s+Ts UetT+1's 
+P r+Ts 
: HPs I's 

rs 
rethePe P P+P.+Ts 


Mit+rst+ls P3s+T's 
Ps+T's I's 


+I, 


given operation when the barred and unbarred opera- 
tions belong to the same class. 

As indicated above for the examples, we are inter- 
tsted to see how many linearly independent wave func- 
eions we can form from the substates of an angular 
momentum J that form bases for irreducible representa- 
tions of the group under consideration. This process 
using Eq. (1) has been carried out for the groups of 
Table I for angular momenta ranging from } to } 
(units of #). The results are tabulated in Table II. 

In Table II we have not distinguished between I 
type and I~ type representations which occur in groups 
containing the inversion operation as a symmetry 
element. It and indicate whether the irreducible 
representation is even or odd under inversion. The I 
representation can only occur in the case of an odd 
number of electrons with an electronic orbital wave 
function that is odd under inversion (p- and /-type 
orbitals). 

In practice, one never has to consider whether a 
representation is of the I+ or I~ type since in the direct 
product to be considered further below we have either 
both I'* or both I~ type states. Thus in the decomposi- 
tion of the direct product into irreducible representations 
only I+ type states occur. 

By the above-outlined procedure it is implied physi- 
cally that as we turn on the crystalline field and distort 
the wave functions the symmetry types of the lowest 
eigenstates are those listed in Table IT. 


III. HAMILTONIAN MATRIX 


The complete Hamiltonian in the presence of the 
magnetic field is 7o+H;,. Since we start with the cor- 
rect eigenfunctions in zero magnetic field Ho is always 
diagonal. The various eigenfunctions belonging to one 
irreducible representation have, same 
eigenvalues with respect to H». Consider for the above- 
mentioned case of Cr***, the matrix elements of H/;: 


of course, the 


yay ioe *g(11)\ 
5|H; Tel) 
vig |H;| Tg!) 
srt afd) 


| e(l)) 


which transform irreducibly under operations of the 


group. Those parts of H; with the derived transforma- 


tion properties are always very simple. In the cubic 
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TABLE IIL. Decomposition of direct products of irreducible represent ations into irreducible representations. The irreducible repre 
* 
pew a —— _ Ah are also given. (By de, we mean operators L,+25S,, etc. In this table we abbreviate, for example, I's* X's 
=fj,+T', st 2714 2 5 by 8*X8=1, 2, 3, 42, 5 
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group O, (L.+25S,), (Ly+2S,), (£.+2S,) form a basis 
for the irreducible representation I',(T'1;). In other cases 
(L.+25S,) belongs to a one-dimensional irreducible rep- 
resentation of the group and #[1,+2S,+i(L,+25S,) | 
form either the basis for a two-dimensional representa- 
tion or two one-dimensional representations which are 
complex conjugates to each other. In the case where 
F[L.+2S,+(L,+25S,)] forms a basis for a two- 
dimensional representation also L,+2S, and L,+2S, 
would form such a basis. However, in the matrices 
which have to be evaluated below it makes a difference 
which choice is taken. We have thus indicated in Table 
III explicitly in which way HM, should be decomposed 
for each group in order to be consistent with our 
matrices. 

The Hamiltonian matrix is to be evaluated for each 
of the irreducible parts of H, separately. In the case 
of Cr*** in a field of symmetry C; we have only one- 
dimensional representations. It follows trivially from 
the orthogonality relations between group characters of 


E\t+qH, 
E,+aeH, 

complex 

conjugate 


In the matrix we have added in the diagonal the matrix 
elements due to H» which we have called -,, Eo, Es, 
and E£,y. The matrix must be Hermitian and thus £; 
through £, and a; through as must be real. All other a, 
will in general be complex. The inclusion of time in- 
version symmetry below will materially reduce the 
number of independent constants. The case of Crt*? 
in C3 is a particularly simple example since only one- 
dimensional representations appear. 

Consider now as an additional example the ion of 
angular momentum one in a cubic field of symmetry 
type O. In general, of course, the lattice in the vicinity 
of the ion will distort due to the Jahn Teller effect and 
assume a lower symmetry. According to Table II an 
angular momentum of 1 remains degenerate in a cubic 
field and belongs to the three-dimensional representa- 
tion 'y= (15). As above, we must investigate whether 
I'y*XI, contains the complex conjugate of the repre- 
sentation according to which the operator transforms. 
From Table III we notice that 1,+2S,, L,+25,, and 
L,+2S, forms a basis of a representation of I'y. From 
Table II we see that ['y*=I', is contained once in the 
direct product 'y*XT', and thus there are nonvanishing 
matrix elements. As is shown,’ the 3X3 matrix of H, 
is just a constant times a known matrix. The result is 
not surprising if one considers that one can generate 
from one of the three functions the other two by sym- 
metry operations of the cubic group just as one can 
generate from one partner in the basis of the operator, 


8G. F. Koster, Phys. Rev. 109, 227 (1958). 
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as(H,+iH,) 
E3+ a3H, 


a;(H,+iH,) a5 
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irreducible representations that a matrix element is 
only nonvanishing if the direct product of T,*XT, 
(where a stands for the symmetry type of the wave 
function to the left of the operator and 6 for the func- 
tion on the right of the operator) contains the complex 
conjugate of the representation according to which A, 
transforms. This result holds for irreducible repre- 
sentations of all dimensions. If I',*xI, contains the 
complex conjugate representation of the part of AH, 
under consideration then there are nonvanishing matrix 
elements between the functions transforming like 
r,* and Ty. For C3, (1,+2S,) transforms like T); 
(L,+2S,—iL,—i2S,) transforms like T.; —(L2+2S; 
+iL,+i2S,) transforms like I'3. (The minus sign in 
front of the last parenthesis has been added in order to 
have a uniform notation throughout.) 

In Table II we have also given the decomposition 
the direct products I'.*XTI, for the point groups of 
Table I. Application of these results yields, for our 
example, the following matrix: 


a’(—H,+iH,)| 
ay(H,+1H,) 
ay, 


EytaH, 


for example, 1.+25,, the other two by symmetry 
operations. 

In the case that the irreducible representation of the 
operator is contained twice in a direct product of irre- 
ducible representations (which may occur in the groups 
O, O,, T, and T),), the corresponding matrix is the sum 
of two matrices with unknown coefficients. 

As was shown,® these matrices are proportional to 
certain rows of matrices U’ which reduce the direct 
product I',*(R)XT,(R) into the complex conjugate 
irreducible representation (or representations) accord- 
ing to which the operator transforms, i.e., 


CT .*(R) XT (RIUT=P(R). (5) 
I'(R) is the reduced form in which the complex conju- 
gates representation (or representations) of the operator 
are considered to appear in the upper left-hand corner. 
If the representation is contained once, then the first 
ny (ny is the dimension of the irreducible representation 
of operator) rows of U’ have to be found, if the repre- 
sentations are contained twice then the first 27, rows 
have to be determined. 

In Table IV the relevant matrices have been evalu- 
ated for the point groups of Table I. In ‘Table IV the 
individual rows of U’ have been broken up into several 
smaller rows to yield matrices of the form in which they 
have to be inserted into the Hamiltonian matrix. If the 
representation of the operator is contained just once, 
then the number of matrices is equal to the dimension 
of the irreducible representation of the operator. Each 
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TaBLE IV. Submatrices in the Hamiltonian matrix as obtained from reduction of direct products of the type 'a* XT» 


into irreducible representations of operator. (J, stands for L;+25Sz, etc.) 


Groups D2, Cow 


Block Operator Matrix 


(2 3) 
( > -1) 
wo RD 


Groups D4, Cae, Dea 
Operator 


Jy 
J; 


Jz 


Block 
1 
0) 
0 

—1 
0 

—1 
0) 
1) 
1) 
0) 
0) 

—1) 
1) 
0) 


° ) 
i “0 ) 
1 0) 


se" as G@' @ @'s 


Ky Sy ay ay hy hy Oy 


6*X6 


0*X6 


0) 
1 0) 
“o) 


Matrix 


6*X7 
7*X7 


7*xX7 


( 
( 
6*x7 Ca 
J, ( 
( 
( 


Groups D3, Cav 


Block Operator 


Ji } . 1) 


J 4 4) 


(J,+iJ ,) 1) 
J,—-tJy (1 0) 
(J,+iJy) 1) 
J,—t, —_ 0) 


. 0 
—(J,+iJ,) 5) 
1 


(Jz+iJy) 
J,—iJy 
(J,+iJ,) 


J,-tJ, 


Matrix 





Block 


8*X<8 
9*x9 
1*X5 
i°XS 
2*X5 
2*X5 
3*X6 
3*X6 
4*X6 
4*X6 


5*X6 


5*X6 


7*X7 


7*X7 


7*xX9 


7*x9 


8*X8 


8*xX<8 


8*x9 


8*xX9 


Block 
1*X4 
1*X4 
1*X4 
2*x<5 
2*x<5 


xD 


3*xX4 


3*xX4 


3*xX4 


xX 


Operator 


Groups De, Cov, Dar 


Matrix 





—(J.+iJ,) 
J,—iJy 
—(J,+iJ,) 
J2—tJy 
—(Jz+iJ,) 
J,—iJy 
—(Jz+tJ,) 
J.—tJy 


—(Jz+#Jy) 
J,—tJ, 
— (Jz+iJ,) 
J2—tSy 
—(Jz+iJ,) 
Jr—iJy 
—(Jz.+iJy) 
Jz—tJy 
— (J,+iJ,) 


J,—iJ, 


Operator 


( 
( 
( 
( 
( 


0 -1) 
1 
( 


1 
0 
0 -1) 
1 


fate 


a 
ap 


(0 1) 


(1 


0) 


(0 1) 


0) 


O° 2 


(1 


0) 


(0 1) 


oe 
-oO 


oi. 


eS Ee eR ek 
oo eo oc oo 


es 
coo = 


Group O, Ta 


(1 
(O 
(0 
(1 
(0 
(0 

1 


0 


-oOo OOo Oo 


0) 


0 
0 


) 
) 
) 
) 
) 
) 
0 


0 


0 
0 


~o) 


Matrix 
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Group O, Ta 
Matrix 


0 -3 0 
0 VB 0) 


0 23) 


0 0 


Operator 





Jy 


rr, 


J; 


coco coo 


ooo Oro KF OOS 


( 
( 
( 
( 
( 
( 
( 
( 
( 
( 


= = as 20202 © =~ din si 
Coro rw COO OCOOCOoO OFo KOS 


6*X6 
6*X6 


6*X6 


matrix can be thought to be associated with one partner 
in the basis of the irreducible representation. In the 
present case L,+2S,, Ly+2S,, and L,+2S, are the 
three partners which span the irreducible representation 
of the operator. Thus the first row gives rise to the 
matrix of the operator 1,+2S,, the second row to that 
of L,+25S,, etc. If the operator were contained twice 
in the direct product then a linear combination of the 
first and fourth row would result in the matrix of 
L,+2S, and linear combination of the second and fifth 
row in the matrix of L,+2S,, etc. We have indicated 
in Table IV the operator to which each matrix belongs. 
The three matrices in the example are thus to be 
multiplied by aH, aH,, and a//,, respectively, and we 
obtain 
Eo aH, 
—<aH, Eo 
| aH, —aH, 


—oH, 
aH, 


| ; (6) 
Eo | 
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TABLE IV.—Continued. 


Group O, Ta 


Block Operator Matrix 


3i 0 (3i)4 0 
0 (32)4 0 3i 


G —2v3 0 0 
0 0 —2v3 0 


2) 

7*X7 (? ge 

1) 

cr Le. 
= m 


0 (31)4 ri 
(31) O —i 


6*X8 
6*x<8 


7*X7 


7*X7 G 
7*X8 
“x8 (0 


a 0 0 0 2 
"x8 ‘ 2 0 0 0) 


0 3/2 0 —v3/2 
sf2- «@ v3 /2 0 
0 v3/2 0 3/2 
L—_v3/2 0 3/2 0 


r 0 —3i/2 0 — (31)#/2 
3i/2 0 — (31)4/2 0 
0 (3i)3/2 0 — 31/2 
(32)#/2 0 31/2 0 


v3 0 0 0 
0 v3 0 0 
0 0 —v3 0 
0 O 0 —v3 


r 0 : 0 
3/2 
0 
5/2 —V3/2 


r 6 68 (37)4/2 0 
— (31)4/2  O — 3i/2 0 
0 3i/2 0 3i/2 
| — §7/2 0 — (3i)3/2 0 


1 0 0 
10 —3 0 
1 0 3 
0 090 


8*x8 (Set I) 


(Set I) 


(Set IT) 


, (Set IT) 





We have again added the diagonal elements due to Ho. 
Since the matrix has to be Hermitian, a must be pure 
imaginary. 

Again, let us call attention to the fact that if we have 
two sets of matrices, i.e., when the representation of 
the operator is contained twice, then we need two un- 
known constants, one each for the two sets. 

With the information that has been presented it 
should thus be possible to write down any Hamiltonian 
matrix for an ion in any crystalline environment of 
Table I. The matrices of Table IV may be considered 
to take into account the geometry of the wave functions 
of the operator and of the lattice. The constants which 
are left are of nongeometrical nature. 


IV. TIME INVERSION SYMMETRY 


Besides the geometrical symmetries we also know 
that the Hamiltonian, in the absence of an external 
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magnetic field, is symmetrical under the operation of 
time inversion. This means that degenerate eigen- 
functions of Hy have to transform among themselves 
under time inversion.’ Time inversion symmetry will 
introduce additional restrictions between the constants 
in the above-discussed Hamiltonian matrices. A time 
inversion operator K is introduced.’ A may be con- 
sidered to be the product of the operator Ko which 
transforms the wave function into its complex conjugate 
and a real unitary operator which changes all the spin 
angular momenta into their negatives. For determining 
the additional conditions on the constants we need 
three equations from Wigner relating to the operator K°: 
(7) 


K*g=(—1)"@; K*=(—1)", 


K (ag+by) =a*Kot+b*Ky, 
(Ko,Ky)*. 


(8) 


(py (9) 

In Eq. (7), 2 is the number of electrons, @ and y are 
wave functions, and a and 6 are constants. From the 
above-considered group-theoretical arguments we know 
that eigenfunctions belonging to the same irreducible 
representation of the group of Ho (in the absence of a 
magnetic field) are degenerate. However, due to time 
inversion symmetry additional degeneracies may arise. 
Wigner distinguishes between the following cases de- 
pending upon the nature of the irreducible representa- 
tion D of the wave functions. 


1. Even Number of Electrons 


(a) D is equivalent to a real representation: no addi- 
tional degeneracies. The functions may be chosen such 


that they transform into themselves under time 
inversion. 

(b) D is complex but D and D* are not equivalent : 
there are additional degeneracies. D and D* have the 
same eigenvalue—D and D* have to appear in pairs. 

(c) D is complex and D and D* are equivalent but 
cannot be made real: there are additional degeneracies. 
D and D* belong to the same eigenvalue—D and D* 


have to appear in pairs. 


2. Odd Number of Electrons 


(a) Dis equivalent to a real representation: there are 
additional degeneracies. The representation of D always 
has to appear in pairs; the eigenfunctions belonging to 
a pair of representations D have the same eigenvalues. 

(b) D is complex but D and D* are not equivalent : 
there are additional degeneracies. D and D* have the 
same eigenvalue—D and D* appear in pairs. 

(c) D is complex and D and D* are equivalent but 
cannot be made real: there are no additional de- 


generacies. 


*E. P. Wigner, Nachr. Akad. Wiss. Gottingen, Math.-physik. 
K1., 546 (1932). 
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Before applying these results let us also state that 
if a certain set of functions transforms like a representa- 
tion D, the time inverted functions transform according 
to the complex conjugate representation D*. In Table I, 
we have indicated to which category each representation 
belongs as far as time inversion symmetry is concerned. 

Consider now the examples. Cr*** in Al,Og is at a 
lattice site with symmetry C;. Furthermore Cr*** has 
an odd number of electrons. From Table I, it follows 
that T'y*=I; and we thus conclude from 2(b) above 
that in matrix (4) £,= E>. I's is real and thus according 
to 2(a) has to appear in pairs. Since there is only one 
pair of representations I's we can conclude £3= £4. In 
the case of 2(a) where D is real, ¢ and K@¢ are corre- 
sponding partners in two distinct bases for D. ¢ and K¢ 
are not identical to each other but are orthogonal on 
each other. Further consider the relations between the 
matrix elements. In (3) it follows, for example, from 
Eq. (9) that 


=a « Kf, T'g())* 
— (T's; Hf, I's IT ‘wh 


Ps) A, /T," 


(AT ,| KM, K™ | KT )*= (10) 


In Eq. (10) we have made use of the fact that KH, AW 

—H), i.e., the operator H, in a coordinate system 
in which the time axis is inverted is equal to the nega- 
tive of the original operator. (L and S change their 
sign under time inversion.) Also, from what has been 
said above, ATy=I; and AT.) =T',"". Note, however, 
that AT;=—Iy and AT,?”?=—T,? kK?=—1. 
From Eq. (10) we thus conclude that in matrix (4), 
d¢=ay*. Similarly, we find that 


(KT ,| KA, K—"| KT ,¢%”)* 
= (I's! A, rg)*. 


since 


(T'; HT, I's IT) 
(11) 


Thus we find a;= — as". 


For the diagonal elements of 77; we find, for example, 
(KT ,| KH,K—|KT,)* 
=—(T; Hy, | = = (Ts! Hy I's). 


Ty My iTs 
(12) 
Thus, @;=—ey» and a; and ay are real. Similarly, it is 
found that a3= —a,. Also, a; and ay are real. 

Since for our calculations the zero of energy is im- 
material we may set E;=£,=0. We thus have 3 real 
and 4 complex constants. left. We may ask ourselves 
how many of the complex constants are by necessity 
complex and cannot be made real? We have still certain 
phase factors in the wave functions at our disposal. 
If we modify the wave function designated by I'y by 
multiplying it with exp(+ i), then we have to multiply 
I’; by exp(—i@) since the two are related by the time 
inversion operator K. The original matrix element 
(4 H,\I;) thus will be multiplied by exp(— 27). We 
can now always choose ¢ such that as becomes real. 
Similarly we can modify I'¢ by a factor exp(ty). As 
above, we can choose ¥ such that the modified matrix 


element 
(T'¢' I) |, | P64), 
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and thus dy) becomes real. We have, however now dis- 
posed of all available phase factors and the remaining 
two complex constants will in general remain complex. 
Thus we have now a total of 5 real and 2 complex con- 
stants and since the complex constants have a real and 
an imaginary part we must consider the total number of 
adjustable constants to be 9. This result has to be com- 
pared to the conventional spin Hamiltonian 


H=DS?+ 2. 8H,S.+2:6(H2S:+H,S,), (13) 


which has three adjustable constants D, gi, and g,. 

In matrix (6), we do not obtain any additional re- 
sults by the application of Eqs. (7) to (9). Consider, for 
example, the (1,3) element which stands in the upper 
right-hand corner. From Table I, we see that S,, S,, 
and S, may be chosen as a basis for I'y. The group O 
does not contain the inversion and thus x, y, and z 
would generate the identical representation. We may 
thus schematically write 


(x|H,|2)=(Kx| KH,K-| Kz)*=—(x! Hy\2)*. (14) 


In case of an even number of electrons, real functions 
transforming like x and z go into themselves under 
time inversion. From Eq. (14) follows again a= —a 
and thus a must be pure imaginary as already concluded 


* 
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further above. Functions transforming like S, and 8. 
would both go into their negatives under time inversion 
and the conclusions from Eq. (14) would, of course, 
have remained the same. The eigenvalues of matrix (6) 
are E=Ey and E=EytiaH (ia is real) which is, of 
course, identical with the predictions of the spin 
Hamiltonian H(S-H)=g8H-S for a zero of energy such 
that Eo=0 and g8= ia. 

The behavior under time inversion of the various 
functions may always be chosen to be that of the bases 
given in Table I. The functions ¢(j,m) given in Table I 
transform like an angular momentum / with a 2 com- 
ponent m. Under time inversion they may be chosen to 
transform like” 


Ko(j,m) = (—1)**™6(j, —m). (15) 


V. CONCLUSION 


We have derived the Hamiltonian matrices for ions 
in various crystalline surroundings. Group theory allows 
more general matrices with more undetermined con- 
stants than the spin Hamiltonian. The spin Hamiltonian 
thus contains physical assumptions in addition to 
symmetry arguments. 


10 A. R. Edmonds, Angular Momentum in Quantum Mechanics 
(Princeton University Press, Princeton, New Jersey, 1957), p. 30. 
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The variation of the hole ionization energy of imperfections in cadmium sulfide as a function of the 
impurity concentration has been measured using photoconductivity in a series of CdS:Ga:Cu powders. 
The Cu concentration varies from 4X10" to 2X 10” cm~, and each sample was prepared with a Cu-concen- 
tration to Ga-concentration ratio of 1.05. The hole ionization energy of the sensitizing centers, as determined 
from the thermal quenching of photoconductivity, decreases from about 1.0 ev for low Cu concentrations 
to about 0.3 ev for 2X 10” Cu cm™. The results are analogous to other recent findings of small hole ionization 


energies in CdS and CdSe crystals. 


INTRODUCTION 


DECREASE in impurity ionization energy for 

high impurity concentrations in semiconductors 
has been well known since the experiments of Pearson 
and Bardeen! with boron impurity in silicon. Similar 
experiments have not been carried out to date, however, 
for impurity ionization energies in insulators; of 
particular interest are those impurities with very large 
ionization energies, such as the acceptors in sulfides. 
The present paper uses the phenomenon of thermal 
quenching of photoconductivity in CdS: Ga: Cu powders 
to follow the hole ionization energy of sensitizing 
centers as a function of impurity proportion. It is found 
that for high impurity concentrations in the range of 
10'5 to 10” cm-*, a large decrease in hole ionization 
energy does occur. 

Previous investigations? have indicated that the 
photosensitivity of CdS photoconductors is associated 
with the capture of photoexcited holes by centers with 
a small cross section for subsequent capture of photo- 
excited electrons. These particular centers may be 
called sensitizing centers; they are characterized by a 
single or double negative charge relative to the crystal, 
and may be associated with either crystal defects such 
as cation vacancies or with impurities such as copper. 
Such centers give rise to high photosensitivity only 
when they are affecting the recombination kinetics of 
photoconductivity; at temperatures or light levels at 
which these centers act only as hole-trapping centers, 
the photosensitivity decreases to a low value determined 
by other high-probability recombination centers. It can 
be shown’ that the beginning of a decrease in sensitivity 
with decreasing light intensity at constant temperature, 
or with increasing constant light 
intensity, occurs when the hole demarcation level 


temperature at 


associated with these centers is located at these levels, 
i.e., When the probability for thermal escape of a hole 
out of these centers into the valence band is equal to 
the probability of recombination of a free electron with 


a hole captured at the centers. Identification of the 


1G. L. Pearson and J. Bardeen, Phys. Rev. 75, 865 (1949). 
2 R. H. Bube, J. Phys. Chem. Solids 1, 234 (1957). 


location of the hole demarcation level for the onset of 
desensitization, therefore, provides a determination of 
the hole ionization energy for the sensitizing centers. 
Since electron currents are being measured, it proves 
most convenient to calculate the location of the steady- 
state electron Fermi level for the onset of desensitization 
from 

Ejn=kT InLN./n]. (1) 


Here E,, is the energy difference between the bottom 
of the conduction band and the steady-state Fermi 
level, T is the temperature for the onset of desensiti- 
zation, .V, is the effective density of states in the 
conduction band, and » is the free electron density at 
the onset of desensitization. Once the value of Ey, is 
known, the location of the hole demarcation level can 
be calculated from it?: 


Eayp=EjnthkT In[Sp/Sn]. (2) 


Here Eq, is the energy difference between the hole 
demarcation level and the top of the valence band, S, 
is the capture cross section of a sensitizing center for a 
hole when occupied by an electron, and 5S, is the capture 
cross section of a sensitizing center for an electron when 
a hole has been previously captured. Past research? 
has indicated S,/S,= 108 as the value to be taken for 
sensitizing centers in CdS. This value has been assumed 
to be constant throughout the series of measurements 
discussed in this paper; a decrease in S,/S, with 
increasing impurity concentration would only accentu- 
ate the decrease in hole ionization energy with impurity 
concentration, whereas an increase in S,/S,, of sufficient 
magnitude to make an appreciable difference is unlikely. 

Usual measurements of photoconductivity in single 
crystals of CdS have been made with fairly low impurity 
concentrations, i.e., below about 10'8 cm-*, and the 
variations of hole ionization energy with impurity 
concentration described in this paper have not been 
observed. There have been a number of incidental 
observations on CdS photoconducting powders, how- 
ever, which have indicated a change in the hole ion- 
ization energy with impurity concentration. Such 
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observations** have been concerned primarily with the 
variation of photocurrent as a function of light intensity 
at various temperatures for different impurity concen- 
trations. Photocurrents varying as a power of light 
intensity less than or equal to unity were found for 
low impurity concentrations at both room temperature 
and liquid nitrogen temperature. When the copper 
concentration was in the range of about 10!° cm-* or 
higher, however, it was found that the photocurrent 
varied as a power of light intensity greater than unity 
at room temperature while still varying as a power of 
light intensity less than or equal to unity at liquid 
nitrogen temperature. Thus, these observations indi- 
cated that desensitization could occur at room temper- 
ature in the materials with high Cu concentrations, 
whereas for low Cu concentrations desensitization did 
not occur at room temperature. There have, in addition, 
been several recent observations of low hole ionization 
energies in certain specially treated single crystals of 
CdS and CdSe.* In these cases the treatment involved 
seems likely to lead to situations where at least local 
concentrations of impurities much higher than normal 
are encountered. The investigation described in this 
paper is a detailed study of this phenomenon. 


EXPERIMENTAL 


A series of CdS:Ga:Cu photoconducting powders 
were prepared with a [Cu ]/[Ga] ratio of 1.05, where 
the square brackets indicate concentration in moles 
per mole of CdS. This ratio was shown experimentally 
to give optimum light-to-dark current ratio for a given 
[Cu], in agreement with the work of van Santen.’ 


TABLE I, Photoconductivity characteristics of 
of CdS:Ga:Cu powders. 


Aix yr 
(for a light 
intensity of 

13 ft-c) 


Ai« Ln 
(for an applied 
voltage of 
300 volts) 


[Cu] 
moles per moles per 
mole of mole of iq x Ya 
CdS CdS 


2X10" 1.9X 10-5 
4X10°5 3.810 

10 9.5X10-° 
210 1.910" 
4X10 3.8X 10" 
2x10-3 1.9K 10-3 
4X10 = 3.8X 10-3 

10 9.510" 
A* 1.5 
B 2xX10™ 


[Ga] 


Sample 
No. 


b n 
High V Low V High L Low L 


1.6 


NORRS 


Con ewre 
So 


NK dO WN Nd 


r 
( 


0.8 


— 
— 


_ * Sample without added Cu or Ga, fired like other members of the series. 
Spectrographic analysis by H. Whittaker indicated about 30 ppm Cu 
present in this starting material. 


3S. M. Thomsen and R. H. Bube, Rev. Sci. Instr. 26, 664 (1955). 

4W. Veith, Z. angew. Physik 7, 1 (1955). 

®* Tolstoi, Kolomiets, Golikova, and Tsenter, J. Exptl. Theoret. 
Phys. (U.S.S.R.) 30, 575 (1956) [translation: Soviet Phys. 
JETP 3, 465 (1956). . 

6 R. H. Bube, J. Chem. Phys. 30, 266 (1959). 

7W. van Santen, paper in the Proceedings of the Brussels 
Conference, 1958 (unpublished). 
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TABLE IT. Cu concentrations after firing and cyanide washing.* 


% Cu present 
after cyanide 
wash 


% Cu present 
after firing 
0.090 
0.173 


Sample No. 


% Cu added 


7 0.088 
0.176 


8 
8W 
9 0.440 
OW 

B 0.880 
BW 


0.165 
0.410 

0.398 
0.787 

0.203 


® Analysis for Cu concentrations by R. J. Paff and M. C. Gardels. 


Keeping this ratio fixed, the [Cu] was varied from 
2X10-* to 10-*; this corresponds to a variation from 
4X 10" to 210° Cu atoms cm™, 

Standard solutions of Ga and Cu were prepared by 
dissolving the pure metals in nitric acid. Appropriate 
volumes of the standard solutions were pipetted into an 
aqueous slurry of 5 grams of CdS (RCA 33C-291A) ; 
this activated slurry was then dried for 15 hours at 
110°C, ground and intimately mixed in an agate 
mortar, and a portion of the mixed sample fired for 1 
hour in an atmosphere of H2S at 850°C and rapidly 
cooled. Test cells of the fired material were prepared 
without further grinding or sieving, by mixing the 
powder with 1% ethyl cellulose in amyl alcohol as a 
binder and applying to a 0.5X5.0 mm gap between 
two conducting portions of a glass slide. The various 
concentrations used are summarized in Table I, 
together with certain photoconductivity characteristics. 

Three samples with the highest Cu concentrations 
were washed by boiling for one-half hour in a solution 
of 5% NaOH-10% NaCN to remove unincorporated 
copper. The powder was washed with water by decan- 
tation after the CN treatment until washings were 
neutral to pH paper, and then dried for 15 hr at 105°C. 
The results of the washings are given in Table II, 
indicating that, except for the special sample with 
only Cu and no Ga, essentially all the added Cu was 
incorporated in the final sample. 

Measurements of the photoconductivity properties 
of the cells were made in an atmosphere of He gas, 
between the temperatures of liquid nitrogen and 100°C. 
Spectral response curves were obtained with a Bausch 
& Lomb grating monochromator; other curves were 
obtained by excitation from an incandescent source, 
using neutral wire-mesh filters to vary the light in- 
tensity. 

RESULTS 
General 


A summary of the over-all effects of increasing 
impurity concentration on dark conductivity and 
photosensitivity is given in Fig. 1. Beyond a critical 
impurity concentration at room temperature, the 
photosensitivity decreases rapidly ; the decrease sets in 
at a lower impurity concentration the lower the light 
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Fic. 1. Variation of dark current, and photocurrents at several 
light intensities and temperatures, as a function of Cu concen 
tration 


intensity used. At liquid nitrogen temperature, however, 
there is little variation in photosensitivity over the 
whole impurity concentration used. Here it must be 
noted that the CN wash did increase the low-tempera- 
ture photosensitivity of the high concentration samples, 


B. DREEBEN 
TABLE IIT. Types of variation with similar functional dependence. 
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Cu] 


even though analysis shows that the CN wash removed 
only 3% or 4% of the total added Cu; we shall discuss 
this point again further on. Because of the nature of 
the phenomena involved, the experimental curves of 
Figs. 1, 3, and 4 are very similar to other curves 
commonly measured at fixed impurity concentration.’ 
These interesting correspondences are summarized in 
Table ITT. 
Spectral Response 


Spectral response curves of representative samples 
are given in Fig. 2 as measured at room temperature. 
Sample A, with no added impurities, and Sample B, 
with only a high Cu concentration, both show photo- 
sensitivity maximum at the band edge and no strong 
long-wavelength response characteristic of compensated 
acceptor centers. All samples containing both Cu and 
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HOLE IONIZATION ENERGY 
Ga, however, show long-wavelength response out to 
9000 A (1.4 ev); when this energy is subtracted from 
the band gap of CdS (2.4 ev), we obtain the usual level 
location 1.0 ev above the top of the valence band for 
sensitizing centers in CdS.* In samples with high Cu 
concentration, the photosensitivity maximum shifts to 
the infrared; CN-washing does not appreciably alter 
the spectral response. 


Photocurrent vs Light Intensity 


The variation of photocurrent with light intensity at 
room temperature for the series is shown in Fig. 3. 
Breakpoints obtained for Samples 4 and 5 can be used 
to calculated Ey, according to Eqs. (1) and (2). It can 
be noted that the effect of CN-washing on Sample 8, 
for example, is not only to increase the photosensitivity, 
but also to change the behavior of the sample as a 
whole as if the actual concentration of Cu were nearer 
to that of Sample 7 (while retaining the same [Cu ]/ 
[Ga] ratio). This apparent reduction in Cu as the 
result of CN-washing is largest in Sample 9, apparent 
in Sample 8, and negligible for samples with lower 
impurity concentrations. 


Photocurrent vs Temperature 


The variation of photocurrent with temperature at 
fixed light intensity for the series is shown in Fig. 4. 
From estimates of the breakpoint for the onset of 
desensitization in these curves, values of Eg, for 
Samples 7, 8, 9, 8H’, and 9 can be calculated. 


DISCUSSION 


The calculation of Ey, i.e., the hole ionization energy 
for the imperfection centers involved, from the data of 
Figs. 3 and 4, is summarized in Table IV. Several 
assumptions are involved, none of which greatly affect 
the ultimate results. An electron mobility and an 
effective cell thickness must be assumed in order to 
calculate the location of £,, from the temperature and 
photocurrent ; it was assumed that the electron mobility 
was 100 cm?/volt sec, and that the effective thickness 
lay in the range of 1 to 100 microns. Values are calcu- 
lated for the two extremes of this thickness range, and 


TABLE IV. Calculated locations of hole demarcation level 
for onset of desensitization. 
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12 0.87 
66 0.82 
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240 0.63 
12 0.48 
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1.8 0.30 
100 0.48 
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Fic. 3. Photocurrent as a function of light intensity at room 
temperature for the members of the CdS:Ga:Cu series defined 
in Table I. 


the values for a thickness of 1 micron are plotted as a 
function of Cu concentration in Fig. 5. The choice of 
effective thickness makes only a small difference in 
the values of Eu,, especially at high values of Cu 
concentration. 

In Fig. 5 the values calculated for the CN-washed 
samples are plotted at the same Cu concentrations as 


TEMPERATURE 

Fic. 4. Photocurrent as a function of temperature at fixed 

light intensity for the members of the CdS:Ga:Cu series defined 
in Table I. 
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Fic. 5. Hole ionization energy calculated from the data of 
Figs. 3 and 4 as a function of the Cu concentration, assuming a 
cell thickness of 1 micron as far as optical absorption is concerned. 


the unwashed samples. The following line of reasoning, 
however, seems valid: (1) the light in the high concen- 
tration samples particularly is being absorbed in a 
portion of the crystallites very near the surface; (2) 
although CN-washing should not affect incorporated 
Cu, yet it might well have an effect on the surface 
layers; (3) if the 3% or 4% of the total Cu which is 
removed by CN-washing were to come mainly from 
these surface layers, the actual Cu concentrations in 
the regions of light absorption would be sufficiently 
decreased by the CN-washing so that the points for 
the washed samples in Fig. 5 should actually fall on 
the same curve as that fitting the points for the un- 
washed sampies. 

There are two ways for the hole ionization energy of 
the imperfection centers to decrease: (1) the imper- 
fection level can effectively move down closer to the 
top of the valence band, or (2) the top of the valence 
band can move up closer to the imperfection level. The 
fact that the long-wavelength cutoff of the spectral 
response does not change with increasing impurity 
concentration, as shown in Fig. 2, indicates that the 
level lying 1.4 ev below the bottom of the conduction 
band remains present in all the samples. 

In considering the second possibility, i.e., that in 
some manner the top of the valence band moves up 
toward the imperfection level, it is proper to call 
attention to the work of Stern and Talley’ and Stern 


, F. Stern and R. M. Talley, Phys. Rev. 100, 1638 (1955). 
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and Dixon.’ These authors have shown on a simple 
hydrogenic impurity model that for high concentrations 
of compensating impurities, an effective decrease in 
the width of the band gap can be expected. Since the 
acceptors in sulfides are by no means hydrogenic 
impurities, any quantitative comparison would be 
unjustified. Such a mechanism would require also that 
the sensitizing imperfection centers be of a different 
nature than the Cu centers which would raise the top 
of the valence band by impurity band formation. 

The sample with high concentration of Cu and no 
Ga does not show high conductivity such as might be 
expected if the hole ionization energy for Cu centers 
were reduced from 1.0 ev to 0.3 ev as the result of the 
high concentration of Cu. The decrease of the hole 
ionization energy seems to occur only when compen- 
sated Cu centers are present. 

It has been well established that the energy level 
structure of the Cu center in sulfides is not just a 
simple level lying 1.0 ev above the top of the valence 
band, but that there are also lower levels lying in a 
range about 0.3 ev above the top of the valence band. 
These lower levels have been detected through optical 
quenching of photoconductivity,” as well as by exci- 
tation of infrared luminescence emission and obser- 
vation of the emission itself." Interaction between 
centers at high concentration might well cause these 
lower levels to effectively raise the top of the valence 
band and produce the observed decrease in hole 
ionization energy. 

If the valence band is considered by some mechanism 
to be moving upward to decrease the band gap and 
hence the hole ionization energy, in samples with high 
impurity concentrations, it would be expected that the 
band gap would be decreased by the same energy as the 
hole ionization energy, or by about 0.7 ev (assuming 
negligible shift of conduction band edge). This would 
mean that the band gap would be decreased to 1.7 ev, 
corresponding to a wavelength of 7300 A; the spectral 
response curves of Fig. 2 show that the photosensitivity 
maximum, usually associated with the absorption edge, 
has indeed shifted to about 7100 A. 


*F. Stern and J. R. Dixon, J. Appl. Phys. 30, 268 (1959). 

0 R. H. Bube, Phys. Rev. 99, 1105 (1955). 

1 P. F, Browne, J. Electronics 2, 154 (1956); see also R. H. 
Bube, Photoconductivity of Solids [John Wiley & Sons, Inc., 
New York (to be published) ], Chap. 11. 
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Structure in the Energy Distribution of Photoelectrons from K;Sb and Cs;Sb 
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The energy distributions of photoelectrons from K3Sb and Cs;Sb show structure that is similar in form to 
structure in the spectral dependence of the optical absorption. One may rationalize this empirical result by 
saying that both the photoelectric and optical effects arise from structure in the state density of the valence 
band. Assuming that the optical absorption involves transitions to the conduction band, a lower limit for 
the electron affinity of the crystal is 0.6 ev for K;Sb and 0.4 ev for Cs,Sb. 


INTRODUCTION 


HE semiconductor Cs;Sb is a well-known photo- 
emitter. Following the initial work of Gorlich in 
1936, it has been the subject of extensive research.! As a 
result, its salient properties are rather well known. They 
are outlined in a recent paper by Spicer.? Spicer also 
investigated (among other materials) the closely re- 
lated compound K;Sb and found pronounced structure 
in the spectral dependence of the optical absorption. 
The present report describes further work on these 
substances. The spectral dependence of the optical 
absorption is compared in detail with photoelectron 
energy distributions at ~300°K and ~90°K. The 
optical and photoelectric results bear a surprising re- 
semblance to each other. We speculate that this may 
be a result of structure in the density of states of the 
valence band. On this basis, it is possible to deduce a 
lower limit for the electron affinity of the crystal. In 
addition, for Cs;Sb, one particular peak which appears 
at hy~2.25 ev (in both the optical absorption and in 
the photoelectric yield) behaves in a very different way 
from the others. This result supports a previous con- 
clusion by Borzyak* that this feature is due to exciton 
formation. 


EXPERIMENTAL DETAILS 


Previous papers describe the kind of phototube used 
in this work.®® It was a very simple type, in which a 
small emitter disk was mounted inside a cylindrical 
collector closed at both ends. Energy distributions were 
measured by the usual retarding potential techniques. 
Despite the departure from the customary concentric- 
sphere design, these tubes are surprisingly trustworthy 
for energy distribution experiments. In tubes designed 
for more precise work, the effects described in this 
paper should be even more pronounced. 


1 See G. Wallis, Ann. Physik 17, 401 (1956), and cited references. 

2 W.E. Spicer, Phys. Rev. 112, 114 (1958), and cited references. 
See also P. G. Borzyak, Trudy Inst. Fiz. Akad. Nauk Ukr. 
S.S.R. 2, 16 (1952). 

3 P. G. Borzyak, Trudy Inst. Fiz. Akad. Nauk Ukr. S.S.R. 4, 
28 (1953). See also, H. Miyazawa, J. Phys. Soc. (Japan) 8, 169 
(1953). 

4Yu. K. Shalabutov and N. S. Maslenikova, Zhur Tekh. Fiz. 
26, 1166 (1956) [translation: Soviet Phys. Tech. Phys. 1, 1137 
(1957) ]. 

5 Apker, Taft, and Dickey, J. Opt. Soc. Am. 43, 78 (1953); 
Taft, Philipp, and Apker, Phys. Rev. 110, 876 (1958). 

6 E. Taft and L. Apker, J. Opt. Soc. Am. 43, 81 (1953). 


Optical absorption data were obtained on films (or 
wedges) of various thicknesses formed on the inside of 
tubes made of fused silica. In many cases, optical data 
were also obtained on deposits formed on the Nonex 
glass phototube envelopes. Very rough estimates of the 
thickness of K;Sb deposits were made by evaporating 
weighed amounts of Sb in a known geometry. Thus, 
the magnitude of the absorption constant for K;Sb 
could be assessed in a crude way. For Cs;Sb, only 
relative magnitudes of the absorption constant were 
determined. It was then assumed that the absorption 
constant at Ay=2 ev was 10° cm™ in agreement 
with previous determinations.':7:* Magnitudes at other 
photon energies were thus evaluated. 

The compounds were formed as outlined in Spicer’s 
paper? and in the references cited there. Both optical 
and photoelectric data were obtained near 90°K by 
cooling the entire tube in a copper enclosure mounted in 
a Dewar vessel with suitable windows. 


RESULTS AND DISCUSSION 


Figure 1 shows the optical absorption of K;Sb at 
~300°K. There is a pronounced peak near 2.4 ev, 
a minimum near 3.0 ev, and another maximum near 
3.4 ev. These features are substantially the same as 
those in Spicer’s data (although they may occur at 
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Fic. 1. Optical absorption of K;Sb. Absolute magnitudes were 
estimated as described in the text. 
7 Morgulis, Borzyak, and Dyatlovitskaya, Izvest. Akad. Nauk 
12, 126 (1948). 
8 J. A. Burton, Phys. Rev. 72, 531(A) (1947). For data see 
V. K. Zworykin and E. G. Ramberg, Photoelectricity and Its 
A pplication (John Wiley & Sons, Inc., New York, 1949). 
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Fic. 2. Spectral distri 
bution of the photoelec- 
tric yield for two differ 
ent samples of K;Sb. 


ELECTRONS/ INCIDENT PHOTON 


hy (ev) 


slightly different values of hv). Very weak inflections 
near 1.3 ev and 1.85 ev appear to us tentatively to be 
reproducible. They agree with the data points in Spicer’s 
Fig. 1.2 Further, the first strong peak appears to be 
double, with the weaker component near 2.2 ev. Other 
structure occurs in the region beyond 3.5 ev. Among 
these various features in the absorption, we shall be 
most interested ir the strong double peak near 2.4 ev 
Figure 2 shows the spectral distribution of the photo- 
electric yield from two different K;Sb surfaces. Re- 
sponse in the threshold region is not reproducible among 
different samples, apparently because of the influence of 
excess K.* In the plateau region, however, the yield is 
much more reproducible. It becomes maximum at 
about the same points as does the optical absorption. 
Figure 3 shows photoelectron energy distributions for 
several values of hv. When hv= 6.38 ev, a major fraction 
of the electrons have low energies. An effect of this 
kind has previously been ascribed (in the case of Cs;Sb) 
to scattering of excited electrons by electrons in the 
valence band, a process which apparently is important 


4/4V (1/Is) 


| 3 
PHOTOELECTRON ENERGY E (ev) 

Fic. 3. Normalized energy distributions for photoelectrons from 
K;Sb. These were obtained by differentiating current-voltage 
characteristics.*.® Estimated errors in ordinates are below 10% 
for abscissa above 1 ev. The features shown have been reproduced 
in more than ten samples. 
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at this value of /v.°** In spite of this, three weak but 
reproducible peaks appear in the energy distribution 
near energies E= 1.8, 2.5, and 3.4 ev, respectively. This 
part of the distribution, especially the two peaks of 
higher energy, is composed of electrons that have not 
been involved in the strong scattering process mentioned 
above. 

It is conceivable that these peaks arise from structure 
in the state density of the valence band of K,Sb. If this 
is correct, the energy distribution for hy=4.42 ev should 
show the same kind of effect, but with the peaks shifted 
by an amount equal to the difference in the photon 
energies. The data appear to follow this pattern. For 
hv=4.42 ev, the peak of highest energy, with its charac- 
teristic double form, appears near 1.4 ev—approxi- 
mately the expected position. 

We may also ask if this structure can be correlated 
with structure in the optical absorption. Figure 4 is 
arranged to throw light on this question. Consider first 
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a photon of energy hy which is involved in the following 
optical absorption process: namely, the excitation of an 
electron from a given initial state in the valence band 
to a final state (in the conduction band) which lies at 
an energy below the vacuum level by an amount a. 
Also, consider the kinetic energy E of an external 
photoelectron ejected from the same initial state by a 


*In Si, an electron must have an energy near 2.25 ev in the 
conduction band before it can make an inelastic collision with an 
electron in the valence band and produce an electron-hole pair 
LA. G. Chynoweth and K. G. McKay, Phys. Rev. 108, 29 (1957) ]. 
Thus, the threshold energy for pair production by electrons is 
about twice the forbidden band gap (the energy at which the 
process is just possible energetically). One concludes that im- 
ere restrictions are imposed by momentum conservation 
[P. A. Wolff, Phys. Rev. 95, 1415 (1954) ]. In reference 5, data 
for Cs;Sb showed that strong scattering, attributed there to pair 
production, apparently set in only a little above the energy at 
which it became energetically possible. This is surprising, es- 
pecially in view of the results for Si. It may be pertinent, however, 
that Cs;Sb has a disordered lattice [K. H. Jack and M. M. 
Wachtel, Proc. Roy. Soc. (London) A239, 46 (1957)]. This dis 
order, along with other defects in these films, may break down the 
stringent momentum restrictions generally expected for a more 
perfect crystal. 
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photon of energy 4.42 ev, the value used in Fig. 4. The 
photon energy hy involved in the optical process will 
lie directly above the photoelectron energy E in Fig. 4 if 
the abscissa scales are arranged so that E+/v=4.42—a. 

Now, in this diagram, a peak in the state-density of 
the valence band will produce a related and similar 
structure in the optical absorption only if the optical 
transitions take place to a single final state (or a narrow 
group of states) of the type just mentioned (and if the 
transition probability does not vary rapidly with the 
energy of the initial state). 

A related and similar structure will also appear in the 
energy distribution of the photoelectrons if both the 
escape probability for excited electrons and the photo- 
electric transition probability do not vary rapidly with 
energy. Clearly, it would be surprising if these condi- 
tions were satisfied in any accurate way. However, if a 
is adjusted to 0.6 ev in Fig. 4, the structures which 
appear, one above the other, are very similar. Further, 
near 90°K, the components of the double peak appar- 
ently sharpen and become better resolved, the effect 
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Fic. 5. Optical absorption of CssSb. Absolute values were fixed 
by setting the absorption constant equal to 105 cm” at /v=2 ev; 
see text. 


being somewhat more pronounced in the optical results 
than in the photoelectric. The value of a given above is 
determined from the data for 90°K. Thus, we can make 
a consistent picture of these results by saying that there 
are rather sharp peaks in the state-density of the 
occupied energy band, and that similar structure in the 
optical absorption is due to transitions to an effective 
level (or narrow group of levels) lying at an energy 
a~0.6 ev below the vacuum level. For reasons discussed 
at greater length below, we consider it improbable that 
this effective level lie below the bottom of the conduc- 
tion band, since it would then be associated with defects. 
Accordingly, in this rationalization, a~0.6 ev is a 
lower limit to the electron affinity of K3Sb. This does 
not disagree with the range of values 1.1 to 1.8 ev 
obtained by Spicer from an approximate comparison of 
photoelectric, photoconductive, and optical absorption 
thresholds.’ 

The absolute values given for the optical absorption 
constant of K;Sb in Fig. 1 are estimated only in order 
of magnitude. Nevertheless, they are so high that one 
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associates them with the behavior of the bulk material 
rather than with defects present in small concentration. 
This is consistent with the reproducibility of the peak 
structure, as pointed out by Spicer in reference 2. Thus, 
we consider that these phenomena arise from properties 
of the K;Sb lattice as produced by these techniques. 

We turn now to the results for CssSb. The optical 
absorption near 90°K is shown in Fig. 5. Maxima or 
short plateau structures are visible at hy~1.6, 2.0, 2.25, 
and 2.45 ev. (The peak near 1.6 ev shows up in the data 
of Wallis.!) A broad peak appears near 3.2 ev, with very 
weak structure beyond. Figure 6 shows the spectral 
distribution of the photoelectric yield from Cs3Sb at 
90°K. (The peak at 1.2 ev on the curve for 90°K 
changes in height according to the previous irradiation 
of the emitter at higher /v; it is apparently associated 
with electrons in metastable traps; under irradiation at 
1.2 ev it disappears as shown in the alternative curve 
with a threshold near 1.6 ev.) 

Near 2.25 ev, there is a small sharp peak in the yield 
at 90°K. It is associated with a peak at the same 
energy in the optical absorption, as shown in Fig. 5. 
This feature has been discussed in previous work by 
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lic. 7. Normalized energy distributions for photoelectrons from 


Cs;Sb at 300°K (broken lines) and at ~90°K (solid lines). For 
other comments see caption Fig. 3. 
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Borzyak,* who ascribed it to exciton formation. His 
conclusion is supported by the results given below. 

Figure 7 shows energy distributions for photoelec- 
trons from Cs;Sb at several different photon energies. 
At hv=6.38 ev, there is a predominance of slow elec- 
trons. As in the case of K;Sb, we think this is due to 
electron-electron scattering processes which become 
important when hy exceeds a value somewhere near 
4 ev.’ In spite of this, there appear in all the energy 
distributions two peaks separated by roughly 0.4 ev or 
a little more. At Av= 3.96 ev they are located at electron 
energies of about 1.1 and 1.5 ev. At hy=6.38 ev, they 
appear at correspondingly shifted positions near 3.5 
and 4 ev. The magnitudes of these peaks have not been 
determined or understood with any precision in this 
work. Qualitatively, however, we have never found 
them absent. They appear, for example, in older data,°® 
although no attention was focused on them at that time. 

We now follow the argument used for the more pro- 
nounced structure in K;Sb. We associate the two peaks 
of highest energy in the energy distributions for Cs;Sb 
with the features in the optical absorption at about 2 
and 2.4 ev, respectively. It is conceivable that a third 
peak, which shows up at an energy near 2.7 ev on the 
curve for hv=6.38 ev in Fig. 7, is related to the broad 
peak near 3.2 ev in the optical absorption of Cs;Sb in 
Fig. 5. Correlating these features in this way, we arrive 
at a value near 0.4 ev (with an uncertainty of order 
0.1 ev) for a, the lower limit to the electron affinity of 
Cs;Sb. This is consistent with Spicer’s value of 0.45 ev 
for the electron affinity, based on an approximation 
that assumed a more simple band structure. 

In the energy distributions, there is no indication of 
similar structure corresponding to the sharp absorption 
peak found by Borzyak.’ This is in accord with 
Borzyak’s interpretation of the peak as due to formation 
of excitons, which may also stimulate photoemission 
from defects. This kind of process generally gives rise 
to photoelectrons that are very slow as compared with 
those ejected directly by photons. A slow group of this 
kind has been reported for Cs;Sb.‘ 

An interesting feature of this peak is that it lies be- 
tween the peaks at 2 and 2.4 ev. These latter features 
we have assigned to structure in the valence band. Thus, 
the peak at 2.25 ev is above the threshold for band-to- 
band absorption. For the resulting excitons there is no 
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energy restriction on dissociation, a process which may 
then compete with exciton stimulation of defects." 


CONCLUDING REMARKS 


The principal result given here is empirical. There are 
clearly detectable peaks in the energy distributions of 
photoelectrons from K;Sb and Cs;Sb. They seem to 
correlate with peaks in the optical absorption. A lower 
limit for the electron affinity of the crystal may be 
deduced if the effects are assumed to arise directly 
from structure in the state-density of the valence band. 
Values so obtained are consistent with previous esti- 
mates. In the case of Cs;Sb, an absorption peak pre- 
viously found by Borzyak behaves in a distinctly 
different way. This is consistent with Borzyak’s inter- 
pretation of this feature as an exciton peak. 

There are many questions left unanswered in the 
present paper. The detailed origin of the peaks and 
their temperature dependence are interesting points for 
future investigation. The very close similarity in form 
between optical absorption and photoelectron energy 
distribution spectra is surprising, and it is not under- 
stood in detail. We have seen similar phenomena in 
other crystals, including Cs;Bi, Rb;Bi, RbsSb, and 
alkali tellurides such as Cs2Te.* Structure, apparently 
in the valence band, includes peaks like those discussed 
here. In these other compounds, also, the peaks are 
often less than 1 ev in width. Since the electronic energy 
structures of these materials are not known in any 
detail, however, further interpretation is not easy. We 
conclude merely that the valence bands of these 
materials are complex and that photoelectric methods 
are of interest for detecting structure in the density of 
states. An interesting possibility is that photoelectron 
energy distributions may give interpretable results on 
other compounds for which the band structures are 
better understood. Among these other crystals, we 
consider the alkali halides interesting. 
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Ferromagnetic resonance phenomena in rare earth iron garnets (except Gd iron garnet) are dominated 
at room temperature and above by the rapid spin relaxation of the rare earth ions. The rare earth ion 
relaxation controls the g values and the line widths, as well as the temperature dependence of these quantities. 
It is shown that in the appropriate limit the g value of the microwave spin resonance line satisfies the relation 
g=ga(Ma+Mp)/Ma, where ga and M, refer to the ferric lattice and M@4+Msz is the net saturation mag- 
netization of the crystal. This relation obtains essentially when the B lattice relaxation frequency is high 
in comparison with the AB exchange frequency and relaxation frequency of the A lattice. The theory 
accounts quite well, with no disposable parameters, for the sequence and temperature variation of the 
g values reported for Dy, Ho, Er, Yb, and Sm iron garnets. When at low temperatures the B relaxation 
frequency becomes sufficiently low, the g value should approach the usual result for two coupled undamped 
lattices, in the absence of anisotropy effects on the B lattice. The theory predicts further that at ordinary 
temperatures the exchange frequency resonance will occur at the usual position and its width will be pro- 


portional to the damping constant of the B lattice. 


INTRODUCTION 


E present in this paper and in a paper with P.-G. 

de Gennes and A. M. Portis to appear later a 
theory of some general aspects of ferromagnetic reso- 
nance in the rare earth garnets, in impurity-doped 
yttrium iron garnet, and more generally in any ferro- 
magnetic or ferrimagnetic crystal with the property 
that two types of magnetic ions are present, one type 
relaxing rapidly and one type relaxing slowly. The 
theory gives natural and simple explanations of the 
central experimental facts relating to ferromagnetic 
resonance in rare earth garnets 5Fe.03-3M2O3, where 
M is an appropriate trivalent cation. In the accepted 
abbreviation, this garnet would be referred to as MIG. 
The static magnetic properties of these materials have 
been discussed exhaustively by Pauthenet,' who has 
determined and verified the essential features of the 
magnetic lattice structure. Our present observations 
do not apply directly to GdIG, because the Gd ions are 
in an 8S state and are not highly damped in the present 
sense. Our discussion does not apply to ions in J=0 
states. Wherever we write relaxation, in what follows, 
the quantity relaxed is the electronic magnetic moment. 

The following conclusions may be drawn from the 
available microwave resonance work on rare earth 
garnets carried out in several laboratories, notably 
Harvard: 

1. Over a wide temperature range, from below room 
temperature up to within 10° of the Curie temperature 
(~560°K), the line width decreases monotonically 
(roughly as 7~*) with increasing temperature. This 
behavior is anomalous because relaxation frequencies 
usually increase rapidly with increasing temperature. 
There is some suggestion that the widths may decrease 
again at very low temperatures: in YIG (presumably 
slightly contaminated by rare earth ions) the line width 
goes through a maximum at about 50°K. 


* Supported in part by the National Science Foundation. 
1 R, Pauthenet, Ann. phys. 3, 424 (1958). 


2. The widths at room temperature are of the order 
of hundreds or thousands of oersteds, except in highly 
pure yttrium iron garnet which does not contain 
significant amounts of rare earth elements, where the 
width is less than 1 oersted in a highly polished 
specimen. Yttrium is diamagnetic in this crystal. 

3. The g-values range from 2.02 to values less than 
1, and in Ho, Er, and Dy garnets have been observed 
to be markedly temperature dependent, generally 
increasing with increasing temperature above the 
compensation temperature. 

4. The g-values over a wide temperature range are 
found to satisfy quite well the following relation 
involving the saturation moments M,: 


g(M)=g(Y)M,(M)/M.,(Y), 


where Y refers to YIG and M refers to MIG. This will 
not apply at low temperatures. 

The key to the understanding of the resonance 
properties of the garnets is the fact that above 100°K 
or so the relaxation frequency of all paramagnetic rare 
earth ions’ in the garnets may be expected to be high 
in comparison with any other frequency concerned in 
the problem, including the microwave driving fre- 
quency, the relaxation frequency of the ferric lattice, 
and the exchange frequency associated with the ex- 
change interaction of the M*+** lattice with the Fet** 
lattice. The relaxation frequency need not necessarily 
be high in comparison with the exchange frequencies 
among the ferric ions. 

The argument begins with the remark that the 
relaxation frequencies of rare earth ions (excepting, 
always, Gd*+**+) in resonance experiments on dilute 
paramagnetic salts at 10 kMc/sec and 20°K are 
commonly observed to be of the order of 10" sec~. 
For Raman relaxation processes the theoretical 77 
temperature dependence would then give 10% sec™ at 


2 Except Gdt*+t, which has an 8S ground state and is known to 


relax slowly. 
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60°K, at the same frequency; however, there is not 
much experimental evidence that the temperature 
dependence is this marked. The relaxation processes 
in the garnets, moreover, are carried out in an exchange 
field of the order of 10° oersteds acting from the ferric 
lattice on a rare earth ion. At 20°K the direct relaxation 
frequency is proportional to H?, theoretically, so that 
even at 20°K the relaxation frequency in the garnets 
may be higher than 10” sec~!. Evidence on this argu- 
ment is lacking. The H? dependence actually holds only 
if RT > hwex, while for Raman processes the relaxation 
rate is not necessarily field-dependent. Here «,x is the 
exchange frequency of the rare earth ion in the exchange 
field (~ 10° oersteds) of the ferric ion lattice. 

It is convenient to discuss first (Part I) the analysis 
of the g-values observed in the resonance experiments, 
and we discuss in a later paper (Part II) the line width 
observations. A list of the principal symbols employed 


is given below. 
LIST OF PRINCIPAL SYMBOLS 


H,=AM,4=molecular field of lattice A at ions of 
lattice B. 

\Mz= molecular 
lattice A. 
H.=stati 
along the 2 axis. 

M ,7=magnetization of lattice A; in the garnets 
of composition 5Fe,0;-3M.O; there are two 
together 


field of lattice B at ions of 


external magnetic field, directed 


sublattices of Fe*** ions coupled 
strongly to form the A lattice. 

M 47+1M 4"; Mpt=MpttiMy’. 
M,+My=resultant magnetization: Note that 
M, and Mz, in the garnets will have opposite 
signs. 

relaxation constant of lattice 4. We assume in 
Part I that the direct relaxation of lattice 4 may 
be neglected. 

relaxation constant of lattice B. 

apM pH»: Qe apM (H+ H ans 


laxation frequencies. 


associated  re- 


gae/2me. 
gre/ 2mc. 
Ya—~YB: 
molecular field constant describing the exchange 
interaction of lattices A and B. The associated 
exchange energy density is —A\M4-Mz. In the 
garnets A is negative. 
1/r=frequency associated with spin correlation 
time 7 of ions on lattice B. 

=resonance frequency (complex). 
yaH. 

. yl. 


THEORY OF g-VALUES 
We let M { be 


resultant Fe* lattice, treating the two ferromagneti« 
et ** sublattices as coupled together infinitely tightly. 


the magnetization vector of the 


We neglect entirely in the g-value analysis the re- 
laxation of M4 except as it may be damped by contact 
with Mg, the magnetization vector of the lattice 
occupied by the rare earth or the yttrium ions. The 
interlattice exchange energy density is —\M.a-Ma, 
where a negative \ gives antiparallel coupling. The 
equation of motion for the 1 lattice is 


dM, dt=ys{M,XH+\AM.X Mz}. (1) 


For the 'B lattice 
dM, dt=yn{MzXH+AMsXMa} 
—apMpX {Mex (H+AM,4)}, (2) 


where the relaxation term is essentially in the Landau- 
Lifshitz form. The magnetization Mg relaxes toward 
the direction of the effective field H+AM, acting on B. 
Whatever phenomenological form is used to describe 
the relaxation, Wangsness* has pointed out that the 
form should describe relaxation toward the in- 
stantaneous field, as otherwise the second law of 
thermodynamics might be violated. We want the 
eigenfrequencies of (1) and (2) in the usual linear 
approximation. Our problem is a special case of a very 
general problem considered by Wangsness,’® but it is 
well worth solving the special case for the unusual 
insight afforded, an insight concealed in the general 
solution. 

Forming M+=M-+-iM", the equations of motion of 
the free system reduce, for time dependence e‘*‘, to 


wM 4+ = —waM 4+ = 1H eM 4+ +ysH aM ; (3) 


wM pt = —weM pt—ysHaMgtt+ysH eM 4 
—1a,M 4++ 1aoM pt, (4) 


W here H , = AM 4 ° Hz= AM zg; a) =apM pH ps; ao= apM 
X (H+H 4). The secular equation is 


wtoratyale = Aid 4 


—ypHpt+ia, wtweatysH 4—Ia2 


Low-Frequency Resonance 


The usual microwave ferromagnetic resonance solu- 
tion satisfies wx<H4, Hg. If we drop terms of order «”, 
waw, etc., the secular equation for the lower eigen- 
frequency of (5) reduces to 


wy pAM ity AM p—- iapvM 4M») 
+w AM (yp\—iapv\M zg) =0, (6) 


and the factor A be cancelled out; here 
AM=M.,44+Msz. 

If the damping of lattice B is sufficiently high so that 
aphM 4M g>ynd\M 4+7yadrM pu, then the eigenfrequency 


of (6) may be expanded as 


Ww (4 \AM M 1) 17 1 - NY 1 ap\M 1 |. 


may 


. K. Wangsness, Phys. Rev. 111, 813 (1958). 





FERROMAGNETIC 


As aap? ©, 


Ww >y4(AM M DH, (8) 


which is reduced from the usual ferromagnetic resonance 
frequency yaH of lattice A by the factor AM/M,4 
=(M4+Mz)/M 4. The result (8) has a simple physical 
interpretation. Our general theory‘ of g values applied 
to two coupled uudamped lattices gives, as is well 
known, 


M,t+Myz 
meg m (9) 
( Ma ya) + ( Mz, ‘y R) 


Yeff 


But the new result (8) for heavy damping of lattice 5 
is that 
Mat+Mp 


(Ma/ya ) 


so that the damped lattice contributes fully to the 
magnetization but not to the angular momentum 
density. This result is what one would expect, and is in 
good agreement with the experiments reviewed in the 
following section. The B system, being highly damped, 
does not respond to an applied torque as a normal 
gyroscopic system. We may think of the angular 
momentum of the B system as largely imaginary. We 
note that no angular momentum compensation point 
should be observed in spin resonance in this limit; this 
prediction probably accounts for the observations on 
ErIG. However, angular momentum compensation 
points may be observed if they occur at sufficiently low 
temperatures so that the B lattice relaxation is not 
fast in the present sense. Equation (39) of the paper 
by Wangsness reduces to (10) in the appropriate limit. 
We note that for our g value considerations the con- 
dition of heavy damping of lattice B is that the re- 
laxation frequency agAM4Mz be large in comparison 
with the frequency w and with the exchange frequency 
associated with magnetic fields of value \M4 or \M zx. 
At very low temperatures the relaxation frequency may 
become low enough so that (9), rather than (10), will 
be applicable. 

This belief in the largeness of ag\MaMpn may 
account for the width of the resonance; the widths are 
discussed in detail in Part II. The width of the reso- 
nance described by (7) is 


(10) 


Yeftf* 


AH VA YAAM 2 
= y 1AM p71, 


~ (11) 
HH apMa ap\MaMp 


so the width will be xarrower as the relaxation frequency 
of lattice B increases; this accounts for the observation 
that AH decreases with increasing temperature over a 
well-defined temperature range. 

Examination of the general expression for yerr given 
by Wangsness shows that our limit (10) for Yer is very 
accurately satisfied for ratios a4 /ag of the order of 10™% 
as envisaged by us at sufficiently elevated temperature. 


4C, Kittel, Phys. Rev. 76, 743 (1949). 
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Fic, 1. Experimental g-factors as a function of temperature in 
deg K for polycrystalline garnets at 9200 Mc/sec, according to 
Rodrigue, Pippin, Wolf, and Hogan, reference 6. Single-crystal 
measurements (not shown) on ErIG are in general agreement with 
the measurements on polycrystals [R. V. Jones (private 
communication) ] 


J. 


Exchange Frequency Resonance 
In the limit H=0 the high-frequency root of (5) is 
o= —X(y iMpt+yeM a )+ iaprAM pH a, (12) 


where a, is still neglected. This equation describes the 
exchange frequency resonance at the approximate 
frequency 


—wex=A(yaMptyeM 4), 


as given originally by Kaplan and Kittel. The damping 
of lattice B produces a line width 


Aw=apr\M 4M pe, (13) 


where we neglect a4. The damping of the exchange 
frequency resonance is directly proportional to ag. The 
result (17) suggests that low temperatures may be most 
favorable for the detection of the exchange frequency 
resonance in ferrimagnetic rare earth compounds. 


EXPERIMENTAL SITUATION ON g-VALUES 
We now confront the result (8), 


vetrt=74(AM/M 4)=7a(Mat+Mp)/Ma, 


with the experimental facts known to us at present. In 
Fig. 1 we have reproduced the experimental results of 
Rodrigue, Pippin, Wolf, and Hogan,® showing g as a 
function of temperature for polycrystalline garnets at 
9200 Mc/sec. The g factors have not been corrected for 
porosity; the correction should be small. The actual 
line shapes are somewhat asymmetric, but anisotropy 
energy measurements by Jones, Rodrigue, and Wolf? 


5 J. Kaplan and C. Kittel, J. Chem. Phys. 21, 760 (1953) 

® Rodrigue, Pippin, Wolf, and Hogan, [RE Trans. on Micro 
wave Theory Tech. MTT-6, 83 (1958). 

7 Jones, Rodrigue, and Wolf, J. Appl. Phys. 29, 434 (1958). 
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Fic. 2. Theoretical 
g factors as a func- 
tion of temperature, 
as calculated from 
Eq. (10); also shown 
are experimental re- 
sults for ErIG after 
Rodrigue et al., refer- 
ence 6, and Paulevé 
et al., reference 8. 











show that the crystalline anisotropy energy is much too 
small to contribute significantly to the line widths and 
shapes in the temperature range of Fig. 1. Further, the 
single-crystal g-values at room temperature obtained by 
these workers agree with the polycrystalline values 
within 2%. 

To calculate yer¢ using the relation (10) we take 
ya=2.02, as this is representative of YIG. We take 
AM/M 4 as the ratio of the observed saturation moment 
for the appropriate rare earth garnet to the observed 
saturation moment for YIG, using the data of 
Pauthenet. These data may be significantly in error 
because of porosity, nonstoichiometry, and foreign 
magnetic inclusions. Such errors, if present, would have 
somewhat unpleasant effects on the ratio. We have 
calculated moments per unit volume from Pauthenet’s 
tables as if the lattice constants were equal to that of 
YIG; errors of +3% are introduced in this way. 
Further, the temperature dependence of the saturation 
moment for the ferric lattice in a rare earth garnet will 
not be precisely the same as in YIG, and errors in our 
input data from this cause are surely important near 
500°K and may account for the bending down at high 
temperature of the calculated g-factor curves for YbIG 


KITTEL 


and ErlG. The curves calculated on this basis are shown 
in Fig. 2. On Fig. 2 we also give the experimental curves 
for ErlG from Harvard® and from Grenoble.® 

The general features of the experimental results are 
reproduced fairly well in the calculated curves. One 
further fact may be added: In the experiments of 
Rodrigue et al.° the lower wing of the resonance line 
of DyIG was observable at their highest obtainable 
fields (6100 oersteds) above 510°K which puts 1.2 as 
an upper limit on g for DyIG in this temperature range. 
This is compatible with the calculated curve and thus 
the downward sequence of g-values (Sm, Yb, Er, Ho, 
Dy) is accounted for by the theory. 

Note added in proof.—With a relaxation term for 
the B lattice of the Gilbert form, —azMgX (dMz/d1), 
it is not possible to account for the observed g-values 
of the garnets. With this relaxation term one finds 
£ett > 0 as ap— ©. Indeed, the Gilbert form applied 
to a single uncoupled spin system gives the resonance 
condition w= —yH/(1+iaM), so that both g— 0 and 
1/70 as a— ~~. Thus such a relaxation term is 
intrinsically unable to describe a rapidly relaxing spin 
system. 
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The attenuation of sound due to the interaction of electrons with the lattice has been measured in pure 
polycrystalline aluminum at low temperatures. Measurements from 26 to 130 kc/sec, obtained by recording 
the decays of the harmonics of a longitudinally resonating bar, were compared with the electrical conduc- 
tivity of an aluminum wire of identical purity, over a temperature range of 3° to 70°K. The attenuation was 
proportional to the square of the frequency. A detailed agreement between the shape of the attenuation and 
conductivity curves was obtained, but the experimental attenuation was 50% greater than could be explained 


by the present theory. 


1. INTRODUCTION 


HE attenuation of sound produced by the inter- 
action of electrons with the vibrating lattice, in 
pure metals at low temperatures, was first reported by 
Bémmel' for lead, and since has been investigated in 
various metals by a number of authors at frequencies 
above 9 megacycles/second. Shortly after the publi- 
cation of Bémmel’s letter, Mason? and Morse’ presented 
equivalent theories for the effect, in which the attenu- 
ation is proportional to the square of the frequency and 
to the electrical conductivity of the metal. Except for a 
recent report‘ including electrical conductivity measure- 
ments at 4.2°K, the conductivities used for the com- 
parison of theory and experiment were not measured 
in the samples used for the measurement of the acoustic 
attenuation but rather were taken from the literature.” 
The experiment presented in this paper was per- 
formed to obtain a detailed comparison of the electron- 
lattice interaction and the electrical conductivity of 
aluminum by measuring both the attenuation and the 
conductivity over a wide temperature range. Although 
it is of the third group, the almost spherical Fermi 
surface and the near isotropy of this metal suggest that 
it is more amenable for comparison to theory than most 
of those previously studied. 

The acoustic attenuation in the temperature range 
3° to 70°K was measured by the reverberation decay 
method for the first five extensional modes of a long 
aluminum rod, yielding a frequency range of 26 to 130 
kc/sec. According to the theory, in this range the 
attenuation varies as the square of the frequency, 
leading to low values at these low frequencies. The 
success of the method hinged on (1) reducing losses 
associated with dislocation phenomena by carefully 
annealing the samples and (2) minimizing extraneous 
losses to the surroundings. An advantage of working in 
this frequency range is that easily handled poly- 
crystalline specimens may be used without correction 
for grain boundary scattering. 

* Supported in part by the Office of Naval Research. 

1H. E. Bémmel, Phys. Rev. 96, 200 (1954). 

2 W. P. Mason, Phys. Rev. 97, 555 (1955). 

3 R. W. Morse, Phys. Rev. 97, 1716 (1955). 

4W. P. Mason, Physical Acoustics and the Properties of Solids 


(D. Van Nostrand Company, Princeton, New Jersey, 1958), 
p. 327. 


The electrical conductivity of the metal was measured 
over the same temperature range in a wire made from 
the same stock as the acoustic specimen and subjected 
to the same heat treatment. 


2. THEORY 


According to Mason,’ when the mean free path of an 
electron becomes large (at low temperatures in the pure 
metal), momentum acquired by it from the vibrating 
lattice is not returned to the lattice at once, and hence 
a viscous loss occurs. For longitudinal waves at low 
frequencies, the attenuation per unit distance is 


2r? f? 
a=——(§n tx), (1) 


pvt 


where f is the frequency, p the density, and v, the 
velocity of longitudinal waves. Measurements? in tin 
and lead by Bimmel and Mason indicate that the bulk 
viscosity x is zero. The shear viscosity is proportional 
to the electron mean free path / and thus to the electrical 
conductivity o according to the relations 


hx? 
—) (39?) Io, (2) 


2mre 


where N is the number of electrons per unit volume, 
m, e, and @ are, respectively, the mass, the charge, and 
the average velocity of the electrons, and h is Planck’s 
constant. Thus the attenuation becomes 


2 Ff PR 
a,=— — —(3r°N)!o. (3) 
15 pv,’ e? 


The results obtained from the theories of Morse,’ 
Pippard,> and Holstein® are identical in the low- 
frequency approximation to Eq. (3). 

For Young’s modulus waves, with which we are 
dealing in this experiment, Filson’ has derived the 


5 A. B. Pippard, Phil. Mag. 46, 1104 (1955). 

6 T, Holstein, Westinghouse Research Memo, 60-94698-3-M17 
(unpublished). 

7D. H. Filson, Phys. Rev. 115, 1516 (1959). 
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expression 
2 2 h2 
ay=— —— —(3r*N)1(1+9)*0 (4) 


15 pry*® e 


for the attenuation, where vy is the phase velocity of 
Young’s modulus waves and » is Poisson’s ratio. 


3. EXPERIMENTAL METHOD 
(a) Acoustic Measurements 


The acoustic sample was machined from a forged 
bar of Alcoa 99.995% pure aluminum into a 4-inch long 
cylinder, 0.4 inch in diameter, with a 0.005-inch deep 
V-groove around its center. It was etched and annealed 
in vacuum to 500°C and slowly cooled. Re-etching 
revealed that the specimen recrystallized in 0.5 mm 
grains of random orientation. 

The specimen was suspended vertically by a 0.003- 
inch piano wire in the V-groove, in a modification of the 
suspension used by Wegel and Walther.’ Details of the 
suspension are indicated in Fig. 1. This suspension 
offers little constraint to either longitudinal or radial 
motion, and very little vibrational energy is removed 
from the sample when it is vibrating in either an odd 
or an even mode. In a test of the suspension, Q’s of 
3 500 000 were measured for the second (50 kc/sec) 
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Fic. 1. Schematic diagram of the sample chamber, showing 
details of the sample suspension, the electrostatic drive and 
pickup, and the vacuum seal. 


5 R. L. Wegel and H. Walther, Physics 6, 141 (1935). 
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and third (75 kc/sec) harmonics of a recrystallized 24S 
alloy aluminum rod, at 4.2°K in vacuum, using the 
electrostatic drive and pickup system of Figs. 1 and 2. 
The equality of the Q’s of these even and odd harmonics 
suggests that the hysteresis attenuation of the specimen 
was being measured, and that the suspension was not 
affecting the measurement. The ultimate Q capable of 
being measured with this support has not, as yet, been 
determined. 

The sample was excited into longitudinal resonance 
by applying a potential alternating at half the resonant 
frequency of the specimen between the lower electrode 
and the grounded specimen. The drive system consisted 
of a very stable beat-frequency oscillator and a voltage 
amplifier capable of supplying 300 volts rms at low 
impedance. 

The resonance of the sample was detected by an 
electrostatic pickup at the upper electrode. In order 
not to swamp the electrode-to-sample capacitance 
(<3 ywyf) with cable capacity, the cathode follower 
that provided the high input impedance and the 200 
volts dc bias for the detector was mounted in the sample 
chamber. As is indicated in Fig. 2, the 1AC5 sub- 
miniature filamentary pentode was biased by a free-grid 
arrangement in which no grid leak is used and almost 
infinite impedance is presented to the signal.? The tube 
and the Continental Carbon 1-watt metal film resistor 
and the tubular capacitor of the screen grid circuit have 
withstood the shock of sudden cooling with liquid 
nitrogen or liquid helium at least 50 times without 
damage. The cathode follower dissipated a total of 0.1 
watt into the sample chamber and thus presented little 
difficulty in an experiment of this kind.” 

The signal was brought out of the cryogenic apparatus 
through Microdot microminiature hermetically sealed 
coaxial connectors, and so as not to load down the 
1AC5 cathode follower with cable capacity, was fed 
into another cathode follower before being amplified 
by either a variable band-width system or a tuned rf 
amplifier. The frequency doubler indicated in Fig. 2 is 
a tuning aid for the narrow-band system. The amplified 
signal was presented on a voltmeter, an oscilloscope, 
and a high-speed level recorder, and the frequency was 
indicated by a frequency counter. With the drive 
shorted, the exponentially decaying resonance of the 
specimen was recorded as a straight line over a 25-db 
range by the Bruel and Kjaer logarithmic level recorder. 

The sample was enclosed in a vacuum chamber 
suitable for use in liquid helium. Figure 1 indicates 

9 J. K. Hilliard and J. J. Nobel, Proceedings of the Institute of 
Radio Engineers Prof Group on Audio AU-2 November, 
December, 1954. 

© An alternate electrostatic detection technique avoiding this 
heat leak would employ a cathode follower remote from the 
cryogenic equipment, connected to the pickup electrode by a 
doubly shielded cable, the inner shield of which is attached only 
to the cathode of the cathode follower. This method is described 
by P. S. Veneklasen, J. Acoust. Soc. Am. 20, 807 (1948), in 


connection with a condenser microphone at a distance from its 
preamplifier. 
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Fic, 2. Electrostatic drive and pickup circuit for reverberation 
decay measurements. 


how the vacuum seal between the flange and the brass 
container was obtained. A polished 2S aluminum 
gasket was squeezed between the flat stainless steel 
flange and the rounded stainless steel lip of the con- 
tainer by tightening the 16 steel screws of the screw-on 
cap down against the top of the flange. This arrange- 
ment requires a diameter considerably smaller than 
that of the usual flange with outside bolt holes. The 
sample chamber was evacuated and refilled with helium 
gas at a pressure, less than 1 mm Hg, sufficient to effect 
good heat transfer between the specimen and its sur- 
roundings, and yet not damp it acoustically. 

The sample chamber, at the end of a 4-foot long, 
l-inch diameter thin-walled stainless steel tube, was 
immersed in a Pyrex double Dewar system. A range of 
77° to 44°K in liquid and solid nitrogen and 4.2° to 
1.05°K in liquid helium could be attained. In the 
intervening region the temperature in the sample 
chamber was controlled by having cold helium gas flow 
past it from a pool of liquid helium and activated 
charcoal below the sample chamber. The temperature 
was regulated by a 50-liter/sec mechanical pump with 
a system of valves, and an electronic thermoregulator. 

Temperature in the sample chamber was measured 
with a 100-ohm, }-watt Allen Bradley composition 
carbon resistor and a copper-constantan thermocouple. 
The resistor was measured with a precision Wheatstone 
bridge in the usual 3-wire connection. The bridge leads 
were copper and the battery lead constantan," so that 
a copper-constantan thermocouple was formed without 
the use of additional wires. The thermocouple could be 
connected to a White potentiometer by throwing a 
DPDT switch. The resistance thermometer was cali- 
brated at two points in liquid helium and at the triple 
point of nitrogen, according to the formula of Clement 
and Quinnell.’ 


1 Eventually the constantan wire will be replaced by a gold- 
cobalt one. 

2 J. R. Clement and E. H. Quinnell, Rev. Sci. Instr. 23, 213 
(1952). 
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(b) Electrical Conductivity Measurement 


The electrical conductivity sample, 40 inches of 
0.078-inch diameter wire, was prepared from a billet 
cut from the forged bar next to the material used for 
the acoustic sample and extruded through a carefully 
cleaned hardened steel die. The wire was cleaned, 
etched, coiled, and annealed in vacuum together with 
the acoustic sample. Re-etching indicated that the wire 
recrystallized into randomly oriented 0.2-mm grains. 

The sample was connected in a four-terminal circuit 
and measured by the potentiometer method. The 
current leads, copper strips 0.25 inch wide and 0.01 
inch thick, entered the helium Dewar through a liquid 
nitrogen bath and were further cooled by cold helium 
gas flowing through heat exchangers soldered to them. 
For a 25-amp current the helium loss was about 250 
cm’ per hour. 

The potential was measured in 1-microvolt steps by 
a White potentiometer. At 4.2°K the potential drop 
across the sample was almost 200 microvolts at 10 amp, 
yielding an accuracy of 3%. 

Measurements were made over a range of 70° to 
3°K, using the same Dewar system and temperature 
control and measurement methods described for the 
acoustic experiment. 


4. RESULTS 


Figure 3 shows the acoustic attenuation data for the 
third harmonic of the rod as a function of temperature, 
superimposed on the measured electrical conductivity 
curve. The proportionality of the attenuation and the 
conductivity is quite evident. The discrepancy which 
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Fic. 3. Acoustic attenuation, due to the electron-lattice inter 
ation, of the third harmonic (79.1 kc/sec) of a 99.995% aluminum 
rod, superimposed on its electrical conductivity curve. 
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TABLE IJ. Ratios of experimental to theoretical attenuation. 


Harmonic 1 2 3 4 5 


26.5 52.9 791 1049 130.1 
148 151 150 1.56 1.55 


Frequency, kc/sec 
ex pt/ar heor 





becomes apparent at temperatures exceeding 30°K is 
attenuation caused by dislocations. Because the sample 
was well annealed, this discrepancy is only 5% at 30°K. 
It must be emphasized that no background losses have 
been subtracted. 

Similar agreement between the shapes of the attenu- 
ation and the conductivity curves also was obtained 
for the second, fourth, and fifth harmonics. The 
fundamental mode was measured only at 4.2°K. 

The results for the five frequencies used are compiled 
in Table I in terms of the ratios of the measured 
attenuation to that calculated from Eq. (4) using 3 
electrons per atom. The Young’s modulus wave 
velocities used were computed from the low-tempera- 
ture dimensions and resonant frequencies of the sample. 
The ratios for the different modes were obtained from 
the best fit of the attenuation data to the conductivity 
curve except for that of the fundamental which was 
computed from the 4.2°K data. 

A weighted average of the ratios indicates that the 
measured attenuation is 1.52 times greater than pre- 
dicted by the theory. A comparison of the ratios for 
the different harmonics shows that the measurements 
follow an f* law within the accuracy of the experiment. 


5. SUMMARY AND CONCLUSIONS 


The experiment has demonstrated that in aluminum 
the electron-lattice interaction attenuation is indeed 
proportional to the electrical conductivity and that it 
increases with the square of the frequency. However, 
using the maximum number of three free electrons per 
atom available for aluminum, the calculated attenu- 
ation is still only two thirds of the measured attenu- 
ation.” That this is a real discrepancy and not an 


EDWARD 


LAX 


apparent one due to a difference in conductivity of the 
two samples is unlikely because of the extreme care with 
which they were prepared. Nor can grain orientation 
differences be a source of error because (1) aluminum 
is almost isotropic and has a nearly spherical Fermi 
surface, and (2) the samples had a random crystalline 
arrangement. 

It is possible that the disagreement may be traced 
back to the assumption of Eq. (2) that the electron 
mean free path is the same for the acoustic process as 
for the electrical conductivity. Steinberg!® has derived 
an acoustic mean free path for transverse waves one 
third of the electrical one, but has not evaluated the 
mean free path for other types of waves. 

For real metals in which the electrons are not free 
but have band structure, Pippard® points out that if 
the electrons occupy more than one zone (as occurs for 
aluminum) the mean free paths for the attenuation 
and conduction processes could be quite different. This 
would be equivalent to a bulk viscosity (which we have 
assumed is zero here) and would not appear in trans- 
verse wave measurements. A measurement comparing 
the shear wave attenuation with the longitudinal or 
Young’s modulus wave attenuation, or with the 
electrical conductivity, may clarify the problem of the 
anomalous electron-lattice attenuation of aluminum. 
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13 Measurements in aluminum by Filson are in quantitative 
agreement with the results presented here (see reference 7). 

144V. Heine, Proc. Roy. Soc. (London) A240, 340 (1957). 

16 M.S. Steinberg, Phys. Rev. 109, 1486 (1958) ; 111, 425 (1958) . 
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Measurements have been made of the Curie points and magnetic moments of three series of compounds 
(Laves phases) MIre, MOsz, and MRuz, where M is a rare-earth element. The Curie point is highest (85°K) 
for the compounds containing Gd and falls away when the rare earths have larger or smaller atomic numbers. 
This dependence of Curie point on the rare-earth element is in accord with the idea that the principal 
exchange interaction is between the spins of the 4f shell of the rare-earth ions and the conduction electrons. 
The magnetic moments are closely related to, but differ somewhat from, the values of Jg for the trivalent 


ions of M. 


INTRODUCTION 


HE importance of the exchange interaction be- 
tween conduction electrons and unfilled ion cores, 
as proposed by Zener, has been applied to the iron 
group (s-d interaction) and rare earth group (s-/ inter- 
action) by Kasuya.! He concludes that in the rare 
earths this kind of interaction, in which the 4/ spins are 
predominant, is the most important. The results to be 
described in this paper give good experimental evidence 
in support of this conclusion. 

Matthias, Suhl, and Corenzwit? have shown that 
the compounds MRuz are ferromagnetic when M= Pr, 
Nd, and Er. We have measured the Curie points and 
magnetic moments of most of the rare-earth compounds 
of this kind with Ru, Os, and Ir. 


MATERIALS 


The metallic rare-earth components were obtained 
from Research Chemicals and said to be 99.0 to 99.5 
pure. Most of the compounds were prepared by melting 
in a small arc furnace in an argon atmosphere, with a 
loss of weight during melting of 1 to 3%. The com- 
pounds containing Eu, Sm, and Yb were prepared by 
sintering the pressed powders at 800° to 1050°C. 

In order to investigate the sensitivity of Curie point 
and magnetic moment to composition, two compounds 
of composition NdIr;s and NdIr2.2 were prepared; 
results showed that the magnetic data were not sensitive 
to variation of stoichiometry. In preparing the Tm-Ir 
compound there was a loss in weight by evaporation of 
about 15% of Tm; the molar moment was then calcu- 
lated on the basis of TmIrz,35. The effect of the variation 
in composition on the Curie point was not investigated 
for the latter material; x-rays showed only Laves- 
phase structure. X-ray examination of the Yb prepara- 
tion showed only partial conversion to Laves-phase 
YbIre, consequently the data for this compound are 
uncertain. 

The structures of these compounds have been studied 
by Compton and Matthias’; some have the cubic 


1T. Kasuya, Progr. Theoret. Phys. (Kyoto) 16, 45 (1956). 

2 Matthias, Suhl, and Corenzwit, Phys. Rev. Letters 1, 92 
/, B. Compton and B. T. Matthias, Acta Cryst. 12, 651 

(1959). 


Laves-phase structure C15 and others the hexagonal 
Laves-phase structure C14. In addition to those already 
reported, all of the MIr2 compounds are cubic, and all 
of the MRuy and MOs, are hexagonal when M is 
heavier than Nd. X-ray powder diffraction photographs 
of the Ir compounds show no evidence of other phases; 
those of Os and Ru indicate a trace of hexagonal metallic 
Os and Ru. 


MEASUREMENTS 


Magnetization was measured with the pendulum 
magnetometer previously described,‘ in fields up to 
12 500 oe at temperatures from 1.3°K to 30° or more 
above the Curie point. In extrapolating the magnetiza- 
tion to 0O°K and H= © use was made of the 1/H rela- 
tion; the amount of the extrapolation was usually less 
than 10%. 

The Curie points, 6, were determined by the classical 
method of Weiss and Forrer. When the magnetization 
was less than about half of the magnetization at 0°K, 
the results were extrapolated at constant magnetization, 
oi, to H=0 and the corresponding temperature, 7}, 
noted. The straight line o,? vs 7; was extrapolated to 
o,=0, at which point 7,;=6. Accuracy was usually 
about 1°. As nearly as could be determined the points 
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Fic. 1. Typical data for determining Curie points and Bohr 
magneton numbers. Dotted lines show determination of 6 by 
extrapolation of o;? vs T, 0; being o for H=0, 
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Fic. 2. Experimental values of @ for three series of compounds 
(points) and theoretical values (lines). See text. 


of inflections of the o, vs T curves for H=4000 were 
at T=6. 
RESULTS AND DISCUSSION 

Data for two compounds, from which values of the 
Curie point @ and the Bohr magneton number ng are 
deduced, are shown in Fig. 1, as typical of the results. 
Plots of 6 and mg for the three series of compounds are 
given in Figs. 2 and 3. 

It seems reasonable to explain the observed ferro- 
magnetism on the basis of an indirect exchange inter- 
action of the f shell spins via the conduction electrons. 
Such an interaction takes the form >-4;;S,;-S,, where 
Aj; is the Ruderman-Kittel exchange potential between 
spins at positions 7 and 7. A,; is an oscillatory function 
of distance with diminishing amplitude. It is calculated 
by eliminating the electronic spins from the full Hamil- 
tonian to second order. In the present case, there is 
good reason to believe that the f-shell is only slightly 
aware of the crystal field, so that the angular momentum 
is not quenched. Except in the case of gadolinium 
(L=0) we must therefore take into account the spin- 
orbit coupling in the calculation of the range function, 
and pay heed to the fact that J is a good quantum 
number. The result has been submitted for publication 
previously.® 

It is found that the long-range part of the potential 
is best corrected by replacing S$ by its projection on the 
lowest J state. The short-range part is unaffected. In 
fitting the results to a simple Weiss field model we 
must therefore decide whether the long-range or short- 
range part dominates the effective Weiss field. This is a 
difficult question, but if we decide in favor of the long 
range part, and accordingly correct for the spin-orbit 
coupling, we find that the Curie point is proportional to 

J (J+1)(g—1)?, (1) 
where g is the gyromagnetic splitting factor appropriate 
to the ion in question.® Had we taken “spin only” into 
account, it would have been proportional to S(S+1). 
Figure 2 shows the experimental Curie points, the 
(normalized) points from formula (1), and those for a 


®R. Brout and H. Suhl, Phys. Rev. Letters 2, 387 (1959). 
6 Also derived by P.-G. de Gennes, Compt. rend. 247 (1958). 
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Fic. 3. Bohr magneton numbers of three series of compounds, 
compared with theoretical values for M** for total moment and 
for spin only. 


“spin only” dependence. The case of Eu, here as in 
other experiments of similar nature, is anomalous and 
will be dealt with later. 

The Curie points of the Pr compounds are con- 
sistently above those for Nd, though the theory as 
presently worked out gives no reason for this. 

At low temperatures the Ce compounds show small 
spontaneous moments of 0.003 to 0.01 Bohr magneton, 
determined by extrapolating the linear portion of the 
o vs H to H=0. The Curie points of these small moments 
have so far not been determined, but that of Celre is 
known to lie considerably above that of PrIre. 

It is noteworthy that the compound CeRuz is super- 
conducting below 5°K and weakly ferromagnetic above 
this temperature. 

The values of the molecular moments (Fig. 3) are 
generally less than the Jg expected for the free trivalent 
ions. Partial quenching of the orbital moments is 
indicated by the fact that, at least in the upper half 
of the rare-earth series, mg lies between 2S and Jg. 
The moments of the rare-earth garnets, M;FesO., 
have been observed by Pauthenet?’ to vary in the same 
way; this indicates that the quenching of the orbital 
moments is almost the same in the materials we have 
examined as in the garnets. Since the variation of the 
Curie points with the rare-earth elements is quite 
different in the intermetallic compounds and in the 
garnets, we conclude that the mechanism of exchange 
interaction is quite different in the two cases, namely 
interaction between ionic spins and conduction electrons 
in the former case and superexchange in the latter. 
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Note added in proof.—The data for Eulr, are open to 
reinterpretation ; possibly this material is paramagnetic 
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7R. Pauthenet, Compt. rend. 243, 1737 (1956). 
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Solid solutions of chromium, manganese, iron, or cobalt in titanium raise its superconducting transition 
temperature by almost an order of magnitude. At very small concentrations this is much more than could 


be expected from a variation of electron concentration. 


T has been assumed for a long time that “‘ferro- 

magnetic impurities” lower the transition tempera- 
ture in superconductors. Impurities or small amounts of 
the magnetic transition elements from chromium to 
nickel dissolved in superconductors would on the basis 
of this assumption be expected to lower the super- 
conducting transition temperatures. Until recently no 
systematic data have been available. Recent results! 
for solid solutions of rare earth metals in lanthanum 
showed that rare earth atoms with magnetic moments 
did depress the superconducting transition temperature 
very strongly. 

One would have expected this depression to be pro- 
portional to the effective moment if the magnetic field 
due to this moment would have been responsible for 
lowering the superconducting transition temperature in 
the same way as an external magnetic field would have 
done. But it turned out that the depression of the super- 
conducting transition temperature was essentially pro- 
portional to the projection of the electronic spin on the 
lowest J-state and not related to the total magnetic 
moment. This led to the conclusion that it was an 
exchange interaction between the conduction electrons 
and the rare earth spins which caused the depression. 
This idea then suggested the possibility that the transi- 
tion elements, especially those with a large electronic 
spin, would act in a way similar to the rare earths in 
superconductors. 
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Fic. 1. Superconducting transition temperatures of 


chromium solid solutions in titanium. 


1 Matthias, Suhl, and Corenzwit, Phys. Rev. Letters 1, 92 
(1958). 


We have now carried out experiments which show 
that this conclusion is not borne out. We had shown 
previously that alloys of zirconium with iron, cobalt, 
or nickel had higher superconducting transition tem- 
peratures than pure zirconium.” We have now observed 
the superconducting behavior of the solid solutions of 
chromium, manganese, iron, and cobalt in titanium, and 
have again shown that the transiton temperatures in 
every case is raised, and not lowered, by dissolving these 
magnetic impurities. Solid solutions were prepared by 
melting the elements together in an argon arc furnace. 
Each sample was remelted at least six times. While 
liquid the melts were rolled around to insure homo- 
geneity. In order to avoid precipitation, the samples 
were not annealed. It was found that cold working 
changed the superconducting transition temperature, 
particularly of those solid solutions with compositions 
close to the a-8 transition region. For this reason con- 
siderable care was used in preparing the samples for 
measurement. The width of the superconducting transi- 
tions ranged between 0.1° and 0.25°, which indicated 
that no new or additional strains except those of 
quenching were introduced.. In these solid solutions the 
body-centered cubic 8 phase of titanium was stabilized 
at room temperature if the concentration of the transi- 
tion element exceeded values that ranged from 5 atomic 
percent to 8 atomic percent. 

Superconductivity was observed by mutual induc- 
tance measurements. The most significant result is that 
in every case the superconducting transition tempera- 
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Fic. 2. Superconducting transition temperatures of manganese 


or rhenium solid solutions in titanium. The 6 values of 3.5 and 7 
at% Mn are isolated and unreproducible points. 


2B. T. Matthias and E. Corenzwit, Phys. Rev. 100, 626 (1955). 
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Fic. 3. Superconducting transition temperatures of iron or 
ruthenium solid solutions in titanium. 


ture of titanium is greatly increased by dissolving small 
amounts of the transition elements in it. In the case of 
nickel additions, at least 9 atomic percent Ni was 
necessary to raise the superconducting transition point 
above 1°K, and such melts are still hexagonal. 

The superconducting transition temperatures of the 
various solid solutions in titanium are shown in Figs. 1 
to 4. For comparison, the superconducting transition 
temperatures of solid solutions in titanium of transition 
elements from the 4d and 5d shells of the periodic 
system are also shown. Although these other solute 
elements have the same number of valence electrons, 
they do not have the magnetic moments of the corre- 
sponding elements in the 3d row. 

At large percentages, the results are the same for 
atoms with and without magnetic moments. The maxi- 
mum transition temperatures range between 3.5° and 
4.5°K; at the compositions where the maxima occur 
the average number of valence electrons per atom lies 
between 4.4 and 4.7. In this range of concentration, it 
seems to be quite irrelevant whether these valence 
electrons are contributed by atoms like manganese, or 
by atoms of ferromagnetic elements like iron or cobalt, 
or by atoms of superconducting elements like rhenium. 

The rare earth elements had the opposite effect.' One 
or two percent of a rare earth in solution lowered the 
superconducting transition temperature of lanthanum 
from close to 6°K to below 1°K; slightly larger con- 
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Fic. 4. Superconducting transition temperatures of cobalt or 
rhodium solid solutions in titanium. 


centrations caused ferromagnetism to occur.’ As men- 
tioned above, these effects of the rare earth ions are 
almost certainly due to the interaction of the 4f spins 
with the conduction electrons.’ Since no such effects 
seem to occur with the spins of the 3d electrons, it 
seems probable that either (1) the s—d interaction is 
much weaker, or (2) it is always such as to give anti- 
ferromagnetic coupling, or (3) some other interaction, 
always antiferromagnetic, overrides the s—d exchange. 

Unlike the rare-earth case, small percentages (<3 
atomic percent) of the transition elements with a 
magnetic moment raise the superconducting transition 
temperature of titanium when dissolved in it, much 
more than would correspond to the same number of 
valence electrons but with negligible magnetic moment. 
This is illustrated by comparison with the elements of 
the second and third row of the periodic system that 
contribute the same number of valence electrons but 
no magnetic moment to speak of. From the experi- 
mental data it appears that no dilute solution ferro- 
magnetism occurs in any case of the transition elements 
below 30 atomic percent. 


ACKNOWLEDGMENTS 


We would like to thank P. W. Anderson, A. M. 
Clogston, J. K. Galt, C. Herring, and R. Brout for 
their interesting discussions. 

We would like to thank J. K. Galt for pointing out and 


impressing on us the significance of this ferromagnetism in dilute 
solution. 





PHYSICAL REVIEW VOLUME 


12S), 


NUMBER 6 SEPTEMBER 15, 1959 


Positron Annihilation in Helium* 
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Positron mean lives were measured in helium gas over a wide temperature range at densities from 4.6 
to 534 times the STP density and in liquid helium at 4.2 and 5.1°K. The orthopositronium mean life in 
the liquid at 5.1°K was found to be (9.4+0.6) X 10-8 sec which differs little from the lifetime in liquid at 
4.2°K. Except at very low temperatures, the pick-off rate in gas agreed moderately well with theory. The 
low pick-off rate associated with orthopositronium in liquid was observed also in the gas at temperatures 
below 9°K for densities greater than 250 times the STP density. 


INTRODUCTION 


HE discovery of orthopositronium in liquid 
helium!” raised the question of why its mean life 
(10-7 sec) is so much longer than would be expected 
on the basis of the liquid density. Ferrell has pointed 
out® that it is energetically favorable for a positronium 
atom to be in a region where the local helium density 
is less than average. He postulated that once in this 
region, the positronium creates a little bubble in the 
liquid. The pick-off rate, and thus the observed mean 
life, then should be characteristic of the density of the 
saturated vapor in the bubble. A sensitive test of the 
bubble theory is provided by a measurement of ortho- 
positronium mean life in liquid helium near the critical 
point where the calculated mean life is about 3 the 
mean life in the liquid at 4.2°K. 

A twofold experimental program naturally suggested 
itself. First, the orthopositronium mean life was meas- 
ured as a function of gas density to see if the pick-off 
rate corresponded to Ferrell’s calculation. Second, the 
mean life was measured in liquid helium near the 
critical point to check the bubble theory. 


EXPERIMENTS 


Positrons from a Na*Cl source sandwiched between 
thin aluminum foils passed into helium confined in an 
annihilation chamber. The size of the chamber was 
chosen to facilitate the lifetime measurement at the 
pressure and temperature being investigated. In all, 
five different chambers were employed. They ranged in 
size from 6 inches diameter and 7 inches long for low 
pressures at room temperature down to 3 inch diameter 
and 1} inches long for high pressures at liquid nitrogen 
and liquid helium temperatures. The chamber for 
experiments in the temperature region 4 to 50°K was 
isolated from the liquid helium bath by a vacuum wall 
and provided with an electric heater. By regulating the 
current through the heater, the heat leak to the bath 


* Part of Ph.D. thesis submitted by T. B. D. to the University 
of Kansas. 

ft Present address: Midwest Research Institute, Kansas City, 
Missouri. 

1—D, A. L. Paul and R. L. Graham, Phys. Rev. 106, 16 (1957). 

2 J. Wackerle and R. Stump, Phys. Rev. 106, 18 (1957). 

3R, Ferrell, Phys. Rev. 108, 167 (1957). 


could be compensated thus controlling the temperature 
of the helium inside the chamber. Before each experi- 
ment, the chamber was repeatedly exhausted to a fore 
pump vacuum and flushed with helium. 

Mean lives were measured in the usual way from a 
plot of coincidence rate versus the delay time between 
the 1.28-Mev nuclear gamma ray emitted simultane- 
ously with a positron and one of the subsequent 
annihilation photons. Gamma rays were detected by 
plastic scintillators mounted on photomultiplier tubes 
whose signals were fed into a delay time to pulse-height 
converter. A 256-channel pulse-height analyzer recorded 
the converter pulses. Signals from the last dynode of 
the photomultiplier tubes were sent to discriminator 
circuits for identification. The discriminators were set 
so that if a nuclear gamma ray was detected by one 
photomultiplier and an annihilation gamma ray by the 
other, then pulses were sent to a 2-way slow coincidence 
circuit which controlled a gate between the converter 
and the pulse-height analyzer. 


RESULTS 


The coincidence rate versus delay time curves usually 
indicated the presence of three components in the decay 
of positrons. First, there was a short-lived (~10-" sec) 
component attributed to the decay of positronium in 
the 1S state. Second, there was an intermediate life due 
to the annihilation of free positrons. Third, there was 
a long-lived (~ 10-7 sec) component due to positronium 
in the *§ state (orthopositronium). The latter two 
components of positron decay are plotted as a function 
of helium density in Fig. 1. The unit of density used is 
the amagat, the density of the helium gas under STP 
conditions. Near the critical point it was not possible 
to make a straightforward density calculation from p, T 
data. It was possible, however, to assign limits to the 
density for these data. To avoid crowding the figure, 
typical standard deviations are shown for only a few 
points. 

The group of points (between light parallel lines) 
which are in inverse proportion to the helium density 
with no temperature dependence being evident is 
identified as belonging to the annihilation of free 
positrons. Of the remaining data in Fig. 1, those taken 
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Fic. 1. Lifetimes in helium vs density. Heavy curve indicates 
calculated values for picked-off orthopositronium. Points P and 
W are from references 1 and 2, respectively, and are included for 
comparison with points D, the data taken in liquid helium at 
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in gas at temperatures from 9°K up to room temperature 
appear to agree moderately well with Ferrell’s calcu- 
lated values (heavy curve) for picked-off orthoposi- 
tronium. For temperatures below 9°K, the data do not 


agree with the theoretical values but are about the 
same as for the liquid. 

The mean life zz of orthopositronium measured in 
liquid helium at 4.2°K is (11.2+0.6) X 10~ sec, in good 
agreement with previous experiments.!* The mean life 
in the liquid just below the critical point at 5.1°K is 
(9.4+0.6)X10-* sec which does not agree with the 
prediction of the bubble theory. 

The annihilation cross section @ is defined by A= nov, 
where \ is the annihilation rate, is the number of 
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helium atoms per unit volume, and »v is the average 
relative velocity of positron and helium atoms. From 
the experimental values of and X, the volume cross 
section ov may be found. For free positrons, the average 
value of o1;v= 2.4 10-" cm* sec over the whole range 
of p and T investigated. The observed annihilation rate 
for orthopositronium is \2=Ayacuum+A’, where Ayacuum 
= 7.14X 10° sec and 2’ is the pick-off annihilation rate. 
The volume cross section for pick-off ov found from 
the experimental values of \’ is approximately 2.4 
X 10-'* cm’ sec! for temperatures down to 9°K. 

It must be remembered that if the orthopositronium 
were not picked off appreciably by the helium and 
instead a quenching agent were present as a contami- 
nant, v2 would still vary in much the same way with 
increasing density. However, a calculation showed that 
to give the observed effect the contamination of the 
helium, even if it were all oxygen, would need to have 
been 10 times the maximum impurity content claimed 
by the supplier. Experimental tests for contamination 
were run at p=534 amagats. This was the highest 
density at which data were taken in the gas and thus 
where the effect of any contamination would be most 
pronounced. The test consisted of four experiments, 
two done with gas straight from the bottle and two 
performed with gas which had been allowed to seep 
into the chamber through a liquid helium cold trap 
which would have frozen out any impurity. The cold 
trap had no perceptible effect on the results, indicating 
that the orthopositronium was indeed being picked off 
by helium and not being quenched by a contaminant. 

Although the immediate goal of this inquiry was to 
settle the questions posed by Ferrell’s explanation of 
the low pick-off rate in liquid helium, the most interest- 
ing result was the discovery that a low pick-off rate is 
not a property of the liquid phase exclusively but is 
also associated with the gas at sufficiently low temper- 
atures. 
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An electrostatic analyzer with a radius of curvature of 2 meters and a deflection angle of 90° has been 
constructed and evaluated. It is used to provide an ion beam whose energy is precisely known and highly 
resolved. The absolute energy calibration is believed to be accurate to +0.05%, and the inherent energy 
resolution is 0.01% per 0.010 in. of input slit separation. Proton bombarding energies have been determined 
for (p,y) reactions on F® at 340.5+0.3 kev, 483.6+0.3 kev, 872.4+0.4 kev; Al?’ at 992.4+0.5 kev; Ni at 
1424.1+0.7 kev, 1843.7+0.9 kev; and C® at 1747.6+0.9 kev. Resonance widths have been measured for 
these reactions. They are 2.4+0.3 kev, 0.9+0.1 kev, 4.5+0.3 kev, 100+50 ev, 50+50 ev, 100-+450 ev, and 
75+50 ev, respectively. The (p,m) thresholds have been determined for Li? at 1881.2+0.9 kev and C™ at 


3237.2+1.6 kev. 


I. INTRODUCTION 


HE two-meter electrostatic analyzer is an instru- 
ment which is used in conjunction with the NRL 
positive-ion 5-Mv Van de Graaff electrostatic acceler- 
ator.! This analyzer provides a method of measuring the 
energy of the ion beam from the Van de Graaff and a 
method of sharply limiting the energy spread of the 
beam. The first of these functions is achieved by 
measuring the geometry of the analyzer and the voltage 
applied to the deflecting plates of the analyzer; the 
second function is achieved by utilizing the direction 
and velocity focusing properties of the electrostatic field. 
Early applications of electrostatic analyzers were in 
the field of mass spectroscopy. The concurrent de- 
velopments in the nonrelativistic theory for the focusing 
of charged particles by an electrostatic field formed by a 
cylindrical condenser were due to Hughes and Rojansky,’ 
Dempster,® and Herzog.‘ Herzog developed the general 
theory for the nonrelativistic focusing of charged par- 
ticles by crossed electric and magnetic fields and then 
applied the equations to the special cases of pure 
electric or magnetic fields. Later, Milfet® considered the 
relativistic focusing of charged particles by crossed 
electric and magnetic fields for the special case where the 
object and image are within the field. 

Several cylindrical analyzers have been constructed 
especially for use with Van de Graaff accelerators. Herb, 
Snowden, and Sala® give a rather comprehensive treat- 
ment of the corrections and errors that should be taken 
into account when using the instrument for absolute 
determination of beam energy. Honnold and Miller? 
have developed a relativistic theory for the focusing of 


! Dunning, Bondelid, Fagg, Kennedy, and Wolicki, report of 
Naval Research Laboratory progress, May, 1955. A detailed dis- 
cussion of the two-meter electrostatic analyzer is contained in 
Naval Research Laboratory Report NRL-5083 (unpublished). 

2 A. L. Hughes and V. Rojansky, Phys. Rev. 34, 284 (1929). 

3A. J. Dempster, Phys. Rev. 51, 67 (1937). 

4R. Herzog, Z. Physik 89, 447 (1934). 

SE. Millet, Phys. Rev. 74, 1058 (1948). 

6 Herb, Snowden, and Sala, Phys. Rev. 75, 246 (1949). 

7V. R. Honnold and W. C. Miller, Technical Report No. 2, 
University of Notre Dame, 1953 (unpublished). 


charged particles through a cylindrical electrostatic 
analyzer for the general case when the object and image 
are outside of the field. The fringing electric field effects 
have been treated by Herzog,‘ Herb ef al.,° Allison et al.,* 
and Monahan? 


II. DESIGN AND CONSTRUCTION 


Voltages of equal magnitude but of opposite sign are 
applied to the two plates of the NRL two-meter 
cylindrical electrostatic analyzer. This means that the 
geometric mean radius is at zero potential. A particle 
entering the analyzer at this point experiences no linear 
acceleration due'to the presence of the deflecting field. 
The orbit of this particle is called the mean-line orbit 
and is used to determine the absolute calibration equa- 
tion of the instrument: 


d 
V'=2Vo (1-7) (1-2). (1) 


The relativistic correction term is y=eVo/2myc’? 
and the internal magnetic field correction term is 
5C= (Bero/c)(2emyVo)~?, where V’ is the total voltage 
applied to the analyzer plates, Vo is the particle voltage 
as determined by the Van de Graaff, d is the plate 
separation, b is the arithmetic mean radius, é is the 
charge on the particle, mo is the particle rest mass, ¢ is 
the velocity of light, B is the vertical component of the 
magnetic field in the region between the plates, and 19 is 
the geometric mean radius. Gaussian units are used. The 
quantities (d/b), y, and 5C are approximations but con- 
tribute errors the order of 1 part in 10°, as discussed by 
Herb ef al.6 Corrections to be applied to the results 
obtained by use of this expression are discussed in the 
section on errors. If the correction factors are ignored, 
then the equation is simply 


V’=V(2d/b). 


8 Allison, Frankel, Hall, Montague, Morrish, and Warshaw, 
Rev. Sci. Instr. 20, 735 (1949). 
9 J. E. Monahan, J. Appl. Phys. 24, 434 (1953). 
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From this it can be seen that if the maximum voltage of 
the particle to be deflected is 5 million volts and the 
voltage applied to the deflecting plates of the analyzer 
is 50 kv, then the ratio 6/2d must be approximately 100. 
The parameter d, the plate separation, has been chosen 
to be 1 centimeter to provide good high-voltage charac- 
teristics and to reduce the percentage error in the 
measurement of this distance (d can be measured to an 
accuracy of +0.0001 inch, i.e., +0.025%). Thus 6, the 
arithmetical mean radius of the instrument, must be 2 
meters. The déflection angle was chosen to be 90° for 
convenience in fabrication, installation, and use of the 
analyzer. 

The object and image distances are chosen after 
consideration of the equation for focusing of ion 
paths*?; 


(l’/—g)(l"—g)= f°, (2) 


where {= (ro/k) csck®, g= (ro/k) cotkb, k=v2, and @ is 
the total analyzer angle. The quantity /’ is the object 
distance, measured from the object to the electrical end 
of the analyzer at the input. The quantity /” is the 
image distance and is measured from the image point to 
the electrical end of the analyzer at the output. Choice 
of either one of these quantities fixes the other. The 
quantity /’” was chosen using the following considera- 
tion: The electrical end of the deflection plates is made 
to coincide with the geometrical end by using properly 
placed shields. The problem of placing these shields is 
the “bend in a Leyden jar” problem with corrections 
applied for finite opening between the shields and for 
rounding of the ends of the plates.‘:°* The electrical 
shields at the output are used to form the image distance 
I’. In the case of the present analyzer this image dis- 
tance is approximately § inch. The object distance 1’ is 
then about 72.8 inches. Electrical shields are placed § 
inch from the geometrical ends of the deflecting plates at 
the input to the analyzer and are used as the collimating 
or limiting aperture. 

The slits forming the input aperture (object) are 
mounted on a stand which is securely bolted to the floor. 
Each slit is adjustable by means of a micrometer head 
screw which is enclosed in the vacuum. 

The deflection plates of the electrostatic analyzer, 
made from type 303 stainless steel, were rolled from a 
straight section 2 inches by 4 inches in cross section and 
machined to the final dimension, 1} inches high by 3 
inches wide. The separation between the plates is about 
0.4 inch. Spring-out occurred during machining due to 
relief of rolling stresses. When the springout leveled off, 
the plates were considered ready for the final machining 
operation. The plates are mounted on a stainless steel 
base plate which in turn is mounted on 25 6-inch square 
pads in a 5 by 5 matrix on an H-beam framework. The 
base plate is bolted loosely at each pad so that it is free 
to slide on the pads during changing temperature condi- 
tions. Supporting the deflection plates are Microy insu- 
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lators 14 inches thick to each side of which are bonded 
stainless steel disks. The bottom disk is then securely 
fastened to the base plate and the deflection plate is 
fastened to the top disk. There are 7 insulators sup- 
porting each plate. Final machining of each deflection 
plate was done in position. 

The deflection plates are covered by a brass box. The 
vacuum seal is obtained by use of a round rubber gasket. 
The vacuum within the analyzer is maintained by a 
4-inch diffusion pump on top of which is mounted a 
liquid nitrogen trap. A 4-inch line between the analyzer 
box and the trap is constructed in a manner such that 
there is no optical path between the cold surface of the 
trap and the deflection plates of the analyzer. 

A radius arm whose center of swing is located at the 
center of the radius of curvature of the analyzer plates 
holds an air gauge which is used to measure the variation 
in radius of the plates and to measure the separation, d, 
between the plates. A high-precision Timken class O 
bearing is used to maintain the center of swing of the 
arm. The air gauge is calibrated with gauge blocks. 
Figure 1 shows the plate separation as a function of angle 
at various gauge positions measured from the top of the 
plates. Over the region from the center of the plates to 
; inch above the center of the plates the variation of gap 
is +0.0002 inch. Because of this relative uniformity of 
gap spacing, the charged particle beam is brought 
through the analyzer in this region. The horizontal 
component of the electrical field over this region varies 
from 0.01% to 0.001% of the total electrical field. No 
correction is made in the final data for this variation. 
Herb et al.® have shown that the average gap, d, can be 
determined from the contour data using the expression 


v2 + 
d= : D(@) sin{v2(#—6) dé, (3) 
1—cosv2® J 


where D(@) is the separation at angle @. 

Various known points have been established on the 
base plate. By using these points and gauge blocks, it is 
possible to determine the radius of curvature of the 
plates and the angle of deflection. The arithmetical mean 
radius, 6, is 78.7468+0.002 inches and the analyzer 
angle ® is 90.00+0.02 degrees. 


Fic. 1. Variation 
in gap over the face 
of the plates as a 
function of depth. 
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III. ENERGY RESOLUTION 


An ion that enters the analyzer at a position /’ from 
the deflection plates and displaced a distance b’ from the 
mean line orbit leaves the analyzer at a distance /” from 
the deflection plates and displaced a distance 6” given 
by the expression! 


b" = (€/2)(1—M)ro+0'M, (4) 


where ¢ is the fractional deviation of the energy of the 
ion from the mean-line orbit energy, and M is the 
magnification given by M=-— (l’’—g)/f. For the two- 
meter analyzer, M is —0.6. 

It is convenient to think of the quantity 6’ as having 
limiting values to either side of the mean-line orbit, i.e., 
an aperture is placed at /’ having a width 2|dmax’|. A 
plot of 6’ vs 6” with ¢ taking on values from 0 to + €max 
will show that in the general case for a uniform distribu- 
tion of energies at /’ the output distribution will take on 
the form of a truncated triangle, and in the special case 
where Bmmax’’=Obmax'M the form of the output distribu- 
tion is an isosceles triangle. 

If now apertures are placed at /’ and 1” of total width 
w’ and w”, and w’’ is set equal to w’| M|, then Eq. (4) 
can be written (since M is inherently negative) 


w= Crass = M \ro—w”. (5) 


Since €max refers to 4 the total fractional energy spread, 
then Eq. (5) written in terms of €to¢ becomes 


€tot = 4y’’/ (1 T M)ro, 


where éto¢ represents the base of the energy distribution 
isosceles triangle. Resolution is commonly quoted as the 
full width at half maximum of a distribution curve. 
Using this criterion, the above equation shows that the 
energy resolution of the two meter electrostatic analyzer 
is 0.01% per 0.010 inch of input aperture opening. 


IV. MAGNETIC FIELD CORRECTIONS 


The earth’s magnetic field induces a magnetic field in 
the analyzer. To a first approximation only the vertical 
component of the field is effective in deflecting the ion 
beam. The component within the analyzer acts to deflect 
the beam in the same direction as the electric field. Since 
the correction to the energy introduced by the magnetic 
field is small it can be treated as uniform over the region 
of deflection. The internal magnetic field as a function 
of analyzer angle is shown in Fig. 2. Equation (1) con- 
tains the correction for the magnetic field within the 
analyzer. 

The magnetic field along the distance /’ causes an 
energy correction to be necessary because it effectively 
changes the direction of entry of the ion beam, which in 
effect shifts the source slightly off axis. The correction 
for this, given by Herb et al.,° is 

l' “lM @M\v1 
—- ro Od } 
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Fic. 2. The internal magnetic field. 


where rg=c(2meV o)$/eB.x_ and M is the magnification. 
Because l’’ is so small, the effect of the magnetic field 
here is neglected. The magnetic field between the plates 
is measured by a device that makes use of the effect of a 
magnetic bias on a core of ferromagnetic material that is 
driven to partial saturation by a sinusoidal voltage.’°" 
If there is no magnetic bias on the core, only odd 
harmonics of the fundamental will be present. If the 
core material is biased as it would be in the earth’s field, 
even harmonic components of current will be present. 
The second harmonic component can then be used as a 
measure of the magnetic field in which the material is 
placed. 


V. ELECTROSTATIC ANALYZER POWER SUPPLY 


The electrostatic analyzer requires a stable source of 
potential for the plates. Since the current drain is very 
low, batteries or a rectifier-filter type of regulated power 
supply could be used.” Batteries were not considered 
acceptable because of their bulk, temperature coeffi- 
cient, and difficulty of getting continuous voltage 
control. 

A rectifier-filter type of power supply, having con- 
tinuous voltage adjustment and adequate regulation, 
can be made. The following discussion gives the require- 
ments of the power supply, a description of the design, 
and evaluation of the power supply. 


A. Requirements 


To meet the design requirements of the electrostatic 
analyzer, the power supply must provide simultaneously 
equal positive and negative voltages up to a maximum 
total potential of 70 kv. The voltage should be continu- 
ously adjustable from zero, preferably in several over- 
lapping ranges. Voltage stability over the range of about 
7 kv to 70 kv should be better than 0.01% of the 
operating voltage for periods of several hours, and some 
means of monitoring the potential should be available. 
The error in voltage measurement should be less than 


10 Bhattacherjee, Waldman, and Miller, Phys. Rev. 95, 404 
(1954). 

1 Adams, Dressel, and Towsley, Rev. Sci. Instr. 21, 69 (1950). 

12. R. L. Henkel and B. Petrie, Rev. Sci. Instr. 20, 729 (1949). 
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IG. 3. Block diagram of electrostatic analyzer power supply. 


operating conditions should be less than 0.01% of the 
operating voltage. Inequality of the positive and nega- 
tive output voltages should be less than 0.3%. 


B. General Circuit Description 


A block diagram of the power supply system is shown 
in Fig. 3. A 400-cycle permanent-magnet generator 
driven by an induction motor is used to supply the pri- 
mary power for the high-voltage rectifier system. This 
motor-generator combination provides a relatively 
stable primary voltage at the constant load used since it 
is fairly insensitive to normal 60-cycle line voltage 
changes. Normal frequency changes do not constitute a 
problem. 

The rectifier system is a conventional full-wave 
voltage-doubler system in which the mid-voltage point 
is grounded, thereby supplying equal positive and nega- 
tive potentials at the output terminals. A capacity filter, 
having sufficient capacity to give a ripple below 0.01% 
for normal loads, is used. 

The precision voltage divider is made up of 68 one- 
megohm wire-wound resistors arranged in two spiral 
columns. Between each group of 3 resistors, there is a 
gradient disk to provide a uniform gradient down the 
column and minimize the possibility of corona discharge. 
At the ground end of each of the two columns, there is a 
series of resistors having a total resistance of 1 megohm, 
with taps at 750 ohms, 1500 ohms, and 2250 ohms for 
each of the three voltage ranges. This makes a total 
resistance of 70 megohms, 35 megohms in each column. 
The high-resistance units are made with Evanohm 
which has a temperature coefficient of resistance of 20 
parts per million per °C, and a thermal emf to copper 
of 2 ” 
wound of Manganin which has a temperature coefficient 
of resistance of 15 parts per million per °C, and a 
thermal emf to copper of 1.5 microvolts per °C. 

To minimize the effects of humidity, all the resistor 
elements are encapsulated and the entire divider as- 
sembly is housed in a sealed enclosure. A fan and 
thermostatically controlled heater are also included to 


microvolts per °C. The low-resistance units are 
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minimize the effect of ambient temperature change, and 
the effect of divider dissipation changes with voltage. 

The voltage reference system is composed of a 
Rubicon type “B” potentiometer, a standard cell, and 
a low-discharge type storage cell. Since the potentiome- 
ter was designed for operation from either a 3- or 6-volt 
source, some of the auxiliary resistors have been changed 
to make operation from a 2-volt cell possible. The 
potentiometer was also remounted for rack and panel- 
type construction, and the back was enclosed in a 
Lucite case large enough to contain the standard cell. 
The Lucite enclosure is thick enough to serve as a 
thermal shield to minimize the effect of rapid changes in 
ambient temperature on both the standard cell and the 
potentiometer. The storage cell, a Willard Type DDS-1, 
200-ampere-hour cell, is enclosed in a Lucite box having 
an additional thermal-insulating liner. A temperature 
compensating resistance of copper wire, series con- 
nected, is wrapped around the battery.’* The storage- 
cell temperature coefficient is plus 410 microvolts per 
°C. For the 11-ma load current required by the potenti- 
ometer, the required copper wire resistance is 9.55 ohms. 
The insulating box is not intended to maintain the 
battery at constant temperature, but rather to cause the 
temperature to change slowly enough for the battery 
and compensating resistance to change in equilibrium. 

The difference between the potentiometer output 
voltage and the voltage on the appropriate tap of the 
precision voltage divider constitutes the error voltage. 
This error voltage is fed to a Liston Becker type 10-A, dc 
Breaker Amplifier. The amplifier output is fed to the 
regulator, as well as two error indicators—one a panel 
meter, and the other a 1-ma Esterline-Angus recorder. 
The recorder provides a continuous record of voltage 
deviations from the reference value. 

The regulator consists of a saturable reactor con- 
nected in series with the primary of the high-voltage 
transformer. The impedance of the reactor is varied in 
accordance with the error signal, which is applied to the 
grid of the control tube in the de circuit of the reactor. 


C. Problems 


The high-voltage transformer secondary had an in- 
ductance high enough that it was self-resonant below 
the 400-cycle operating frequency. This caused the input 
impedance as seen at the transformer primary to be 
highly capacitive. The transformer current due to total 
circuit capacity loading exceeded the ratings. To mini- 
mize capacity the filament transformer for the negative 
rectifier was redesigned to provide as low a capacity 
from the secondary to the core and primary as possible. 

The proximity of 60-cycle circuits caused objection- 
able interference at the input of the breaker amplifier 
primarily due to magnetic coupling in the circuits ahead 
of the amplifier input terminals. To minimize this diffi- 
culty all possible leads were run as shielded twisted 


‘8 F, K. Harris, National Bureau of Standards (private com- 
munication). 
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pairs. The low-voltage resistors in the voltage divider 
were arranged with the ground leads in a way such that 
the area of the circuit loop was a minimum. The tem- 
perature-compensating element on the potentiometer 
source battery was wound as a bifilar winding to reduce 
magnetic coupling at this point. Interference from 60 
cycles that remained was further reduced by means of a 
low-pass filter at the amplifier input. 

In the initial design, regulation was not possible over 
the required range of the two higher voltage tap posi- 
tions. This difficulty was due to series resonance opera- 
tion at the 400-cycle power-supply frequency of the 
transformer-saturable reactor circuit. Near the low- 
voltage end of each of these ranges, the impedance would 
swing from the inductive region through resonance to 
the capacitive region as the operating voltage was 
lowered. This phase reversal, from a lagging to a leading 
impedance, required a reversal of the error signal to 
maintain regulator operation. The problem was not 
encountered on the low-voltage range, because the 
saturable reactor inductance was sufficiently high to 
produce series resonance well below the power-supply 
frequency over the entire operating range. High-in- 
ductance operation of the saturable reactor on the higher 
voltage ranges was not feasible because of the limits in 
the source voltage. The problem was solved by shunting 
the transformer primary with an inductance of low 
enough value to produce resonance with the transformer 
well above 400 cycles over the entire operating voltage 
range. Since the load seen by the saturable reactor is 
now inductive, there is no resonance phenomenon and 
regulation is possible anywhere in the operating range. 


D. Test and Evaluation 


In order to evaluate the power-supply regulator, a 
comparison was made between the terminal voltage 
stability with and without the regulator in operation. 
Without the regulator, the voltage fluctuated about 
+0.75% of the terminal voltage at 10 kv. With the 
regulator, the fluctuation from a cold start was +0.02% 
at 10 kv, an improvement of 38 times. Typical values of 
short-term regulation, after warmup, range from about 
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Fic. 4. Loop gain of regulator system. 


RESONANCE WIDTHS 
TABLE I, Resistance ratios. 


Right stack 
% of NR 


Computed Measured 


Left stack 
% of NR 


Computed Measured 


Tap 


Nominal ratio 
No. (NR) 


99.69 
99.68 
99.69 


99.69 
99.68 
99.69 


99.78 
99.79 
99.78 


99.78 
99.78 
99.78 


46 666.7 
23 333.3 


15 555.5 


+0.003% of the terminal voltage on the 25-kv range to 
+0.007% on the 75-kv range. 

The gain characteristics of the regulator loop are 
shown in Fig. 4. The maximum gain of about 430 occurs 
at de, and falls to 1 at about 1.5 cycles. The phase shift 
is 360° at about 4 cycles, and with the phase shift 
characteristics of the system limits the maximum dc 
gain to about 430, 

When completed, the precision voltage divider was 
sent to the National Bureau of Standards for calibra- 
tion. This calibration was in two parts. The first con- 
sisted of measurement of the divider ratio at voltage 
increments from 5 kv to 35 kv on each of the two 
columns for all three tap positions. Measurements were 
made with the heaters operating normally, and with the 
heaters off. The maximum variation indicated is 0.02% 
and the limit of measurement precision was 0.01%. 
With the heaters operating, the air temperature within 
the enclosure was about 33°C, and varied about 2°C 
from zero voltage to maximum divider dissipation. The 
second set of measurements consisted of calibration of 
the individual resistors at room temperature in a low- 
voltage bridge. To compare the two calibration results, 
the individual values were totalled and the divider ratio 
computed. These ratios were compared with the average 
of the room temperature high-voltage measurements. 
Agreement (see Table I) was +0.01%, or better, for all 
three tap positions on each stack. 

The source battery for the potentiometer was checked 
to determine voltage drift due to the continuous po- 
tentiometer load. Under this load, the voltage drops 
about 225 wv/day, or 9.4 wv/hour. The change in po- 
tentiometer calibration due to source change is then 
about 4.7 10% per hour after potentiometer calibra- 
tion against the standard cell. 

The potentiometer used for voltage reference and 
measurement has a calibration accuracy of +0.01% 
above the 0.3-volt tap. In normal operation, 0.5 volt is 
the lowest tap position used. The standard cell used for 
standardizing the potentiometer is the Weston unsatu- 
rated type. It is housed in the potentiometer enclosure, 
which serves as a thermal shield for both. This shield 
minimizes the chance of rapid and unequal changes in 
the temperature of the cell. Under these conditions, the 
temperature coefficient is negligible. The potentiometer 
standardization sensitivity is 0.269 uv per uv of change 
in source battery voltage. Using the error amplifier as 
the detector for standardization, the potentiometer can 


be set to within 5X 10% of standard cell potential. 
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TABLE II. Precision of measurement of electrostatic 
analyzer voltage. 


Limit of error 
(% 


+0.01 
+0.0005 
+0.005 
+0.005 
+0.005 
+0.01 
+0.005 
+0.018 


Potentiometer accuracy 

Potentiometer setting relative to standard cell 
Setability 

Standard cell accuracy 

Temperature change 

Voltage divider 

Zero drift in amplifier 

Probable error in voltage 


The precision of voltage measurement is affected by 
accuracy of the potentiometer, the voltage divider, and 
the standard cell, as well as the stability of these com- 
ponents. Precision is as shown in Table II, considering 
the errors as random. 


VI. ERRORS IN ABSOLUTE CALIBRATION 


Examination of the equation for the mean-line orbit, 
Eq. (1), shows that the errors for absolute calibration 
arise from the errors in the determination of V’, d, 6, 
and B. Since this equation is based on the supposition 
that the orbit is the mean-line orbit, additional errors 
arise from the deviations from the mean-line orbit. 

As concluded in the previous section, the error in the 
voltage determination is +0.018%. The radius is de- 
termined by piling gauge blocks. From the reproduci- 
bility and estimates of the gauge block error, the 
uncertainty in the radius is taken to be +0.002 inch. 
Since the radius is about 78 inches, the error is +0.003%. 
The gap measurement is made up of 91 readings at 4 
vertical positions for a total of 364 separate measure- 
ments. The reproducibility of the average is better than 
0.00005 inch even after the gauge has been removed, 
then reinstalled and recalibrated. The total error in the 
gap is taken to be +0.0001 inch. Since the gap is 0.4074 
inch, the error is +0.025%. An error introduced by 
vacuum deformation of the base plate is included in the 
stated gap uncertainty. The internal magnetic field is 
measured to a precision of + 10%. Since it contributes a 
correction of about 0.05% for a 1-Mev proton, the error 
is 0.005%. 

The external magnetic field along /’, discussed in a 
previous section, tends to make the source appear to 
come from a position off the mean-line orbit at the 
input. This introduces a correction of about 0.014% for 


or 


TaB_e III. Summary of errors in energy determination (in %%). 


+0,018 
+0.003 
+0.025 
+0.005 
+0.001 
+0.012 
+0,001 
+0.012 


Voltage 

Radius 

Gap 

Bint 

Boxt 

Slits, input, output lateral position 
Slits, relative to plates 

Analyzer angle, ® 


+0.035 


Rms total 
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a 1-Mev proton. Since the magnetic field can be 
measured to +10%, the uncertainty is +0.0014%. 

The lateral positioning of the slits at the plates can be 
done reproducibly to 0.0001 inch. The lateral positioning 
of the slits at the input, or source, can be done to 0.010 
inch. Examination of Eq. (4) shows that an estimate of 
the lateral positioning error (€,) of the slits can be made. 
Assuming a point source and the uncertainties given 
above, €p is 0.01%. The effect of moving the input slits 
laterally by 0.100 inch was checked by observing the 
apparent energy shift in a strong resonance in the 
Al?" (p,y)Si?8 reaction. The data substantiated the above 
calculation. That a lateral positioning error of several 
thousandths of an inch in the limiting aperture at the 
input to the plates is negligible was checked experi- 
mentally by looking for an apparent energy shift in this 
resonance. 

The distance of the shields relative to the ends of the 
plates can be set to +0.001 inch by the use of a specially 
ground plate. An error here has the following effects: 
An error in the end correction changes the effective 
angle of the analyzer and at the output changes /’’. The 
effect of these changes on the focusing condition, Eq. 
(2), is small enough to be neglected. Another effect is on 
the condition of tangency required of the mean-line 


BAFFLE GOLD TRAP INSULATOR VALVE. TARGET 


Fic. 5. The target 
holder. 


orbit at the input to the plates. A positioning error of 
0.001 inch reflects an error of about 0.001 inch at the 
input. The uncertainty is then +0.001%. 

An error in the total analyzer angle, ®, also introduces 
an error in the condition of tangency of the ion beam as 
it enters the analyzer. Equation (4) can be used to 
estimate this error. Half of the error in # is taken to be 
present at the input to the analyzer. The error reflected 
at the input aperture is +0.013 inch or +0.012%. The 
errors are summarized in Table III. 


VII. PRECISION ENERGY DETERMINATION 
OF NUCLEAR REACTIONS 


One of the difficulties encountered in bombarding a 
target with a beam of charged particles is the buildup 
of asurface layer of carbon due to the presence of organic 
contaminants from the vacuum system. To avoid the 
presence of carbon on the target surface, the technique 
employed in the present experiment is to surround the 
target with a surface near liquid nitrogen temperature 
while the target is near room temperature." Since the 
vapor pressure of the contaminants is many orders of 
magnitude less at liquid nitrogen temperature than at 
room temperature, the effect is a rapid exchange of 


4 J. W. Butler and C. R. Gossett, Phys. Rev. 108, 1473 (1957). 
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RESONANCE WIDTHS 


TABLE IV. Summary of energy measurements. 


Reaction energies (kev) 


Li?(p,n) Be? 


Run 
1880.9 


Al?7(p, 7) Si*8 


992.3 
992.3 
992.4 


C3(p,y)N4 


3237.1 
3236.9 
3237.5 


1747.6 


992.4 1747.6 1881.5 


992.4 
992.4+0.5 


1747.6+0.9 


contaminants from the target face to the cold surface. A 
target can be bombarded for several days in this ar- 
rangement and show no effects attributable to carbon 
build-up. The target holder (Fig. 5) has the added 
feature that it is possible either to change targets with- 
out exposing the cold surfaces to air, or to install a 
target without exposing it to air. 

The present experiments have been confined to re- 
actions producing y rays or neutrons. The y rays are 
detected with a NalI:TI scintillator mounted on a 
photomultiplier tube. The neutrons are detected with a 
BF; proportional counter surrounded by a small amount 
of paraffin. A counter of this type is more sensitive to 
threshold or “slow” neutrons (1 to 100 kev) than to 
“fast” neutrons (above 100 kev).!® The electronics is the 
same for both counters. The pulses are amplified and fed 
into an integral pulse-height discriminator, the output 
of which is recorded on a scaler. 

Prior to the start of an experiment using the electro- 
static analyzer, the vacuum cover is removed and the 
interior, including the plates, is cleaned. The gap, d, is 
gauged according to the method described previously, 
the slit zeroes are determined, and the input tangential 
alignment is checked. The target and limiting baffle 
(Fig. 5) are aligned so that the beam must emerge from 
the electrostatic analyzer plates in the proper region, 
from the center to } inch above the center of the plates. 
The cover is then replaced and the system is evacuated. 
After a series of measurements, the gap is again 
measured. 


A. Strong Resonance in the Al*’(p,y)Si®* Reaction 


Herb e al.® have determined the proton bombarding 
energy at a strong resonance in the Al’’(p,y)Si*8 reaction 
to be 993.3+ 1.0 kev. This resonance is known to have a 
very narrow full width at half maximum. Because it is 
strong and narrow, it is an excellent calibration point 
and also provides an indication of the resolution of 
magnetic and electrostatic analyzers. 

A series of observations were made on this resonance 
to determine the operational behavior of the NRL 
electrostatic analyzer. These are the following: 

1. Measurement of absolute energy of bombarding 
protons: Targets of aluminum were prepared by evapo- 


16 Butler, Dunning, and Bondelid, Phys. Rev. 106, 1225 (1957). 


CB(p,n)N% Ni58(p,y)Cu% 


1881.2+0.9 3237.241.6 1424.1+40.7 1843.7+0.9 


Ni8(p,y)Cu® F(pay)O'® F®(pay)O'®  F!9(p,ay)O'8 


1424.1 1843.7 


872.4 
872.4+0.4 


483.6 
483.6+0.3 


340.5 
340.5+0.3 


rating aluminum onto vacuum-heated cleaned disks of 
tantalum. The aluminum was approximately 5 to 10 kev 
thick to a 1-Mev proton beam, a thick target for this 
resonance. The potentiometer reading corresponding to 
the center point of the thick-target yield curve rise is 
used to compute the bombarding energy of the protons, 
Eq. (1). A typical yield curve is shown in Fig. 6. Several 
times during the period covered by the calibration runs, 
changes were made in the resistor stack. Since each 
resistor in the stack was calibrated absolutely, it was 
possible to change resistors and then recalculate the 
divider ratio. By the end of the last run, 8 resistors had 
been changed. The stack ratio was recalibrated by the 
High Voltage Laboratory at the Bureau of Standards. 
This result agreed to within 0.01% of the value based on 
the absolute resistor calibration data. The results of 
several runs on this resonance are presented in Table IV. 
The average of these runs is 992.4+-0.5 kev. This is to be 
compared with the results of Herb et al.® (993.3+1.0 
kev) and Bumiller et al.!® (990.8+0.2 kev). 

2. By taking small intervals over the resonance, it 
appears that short-time stability of the high-voltage 
supply is of the order of 0.002%. 

3. There was a long-time daily drift upward of the 
order of 0.02% in the potentiometer reading corre- 
sponding to the peak of the resonance. On successive 














65950 


METER 


065930 
POTENTI SETTING 


Fic. 6. The Al?’(p,y)Si?* yield curve of the 992.4-kev resonance. 


16 Bumiller, Staub, and Weaver, Helv. Phys. Acta 29, 83 (1956) ; 
Bumiller, Miiller, and Staub, Helv. Phys. Acta 29, 234 (1956). 
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TABLE V. Comparison of present absolute measurements with previous work.* 


Bumillere 


340.5-40.3 


Hunt> 


340.440.4 
483.1-40.5 


NRL 


340.5+0.3 
483.6+0.3 
872.4+0.4 
992.4+0.5 
1424.1+0.7 
1843.7+0.9 
1747.6+0.9 
1881.2+0.9 


Reaction 


F*(p,ary)O"* 
F*(p,ary)O"* 
F(pyary)O"* 
APT (p,)Si?® 
Ni*®*(p,7)Cu® 
Ni*(p,7)Cu® 
C4(p,7)N™ 
Li’ (p,n) Be’ 


871.340.4 
990.8+0.2 


C#(p,n)N¥ 


3237.2+1.6 


© See reference 26. 
! See reference 27. 
« See reference 29. 
b See reference 22. 


* All energies are in kev. 
> See reference 28. 

* See reference 16. 

4 See reference 6. 


mornings the reading was the same. This drift disap- 
peared when the analyzer was operated with the high- 
wattage overhead lights turned off. In connection with 
this drift, other possible thermal effects were investi- 
gated and eliminated. 

4. Changes in slit settings produced a shift in the 
potentiometer readings which substantiated the calcu- 
lations made previously. 

5. Periods of long bombardment (several hours) pro- 
duced no change in the potentiometer reading corre- 
sponding to the peak of the resonance, indicating that 
there was no appreciable buildup of carbon on the face 
of the target. A surface layer corresponding to 50-ev 
change in beam energy could have been observed easily. 

6. Instrument resolution: For a very narrow reso- 
nance, and small contributions from Doppler broadening, 
power supply ripple and regulation, the shape of the 
thick-target yield curve is determined by the distribu- 
tion of energies in the beam out of the electrostatic 
analyzer. The preceding statement will hold true for a 
thin-target yield curve if the thin target has a thickness 
which is much less than the spread in energies out of the 
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Fic. 7. The Li’(p,") Be’ threshold yield curve. 


Herb4 Other investigators 


~~ 340.440.4,° 340.042.0 (rel. to 873.5)! 


873.540.9 872.5+1.8¢ 


993.34+1.0 


1746.9+0.8," 1747.6! 

1881.340.7,) 1881.4+1.1,« 1879.7+1.1* 
1881.2+1.9! 

3236.043.0" 3238.044.0" 


1882.2+1.9 


m See reference 24. 
» See reference 25. 


i See reference 23. 
i See reference 19. 
k See reference 18. 
'See reference 20. 


electrostatic analyzer. The 992.4-kev resonance in the 
Al’ (p,y)Si?* reaction was examined with an input aper- 
ture of 0.060 inch and a thick target, and an input 
aperture of 0.050 inch and a very thin target. Under 
these conditions, the base of the isosceles distribution 
triangle is 0.12% and 0.10% of the nominal energy, re- 
spectively, or for this resonance 1.2 kev and 1.0 kev, 
respectively. The resonance width, the Doppler broaden- 
ing, and the power supply regulation represent a spread 
in energy very much less than this (order of 50 to 100 
volts each). The thin target was estimated to be <400 
ev thick. In both the thick- and the thin-target case, the 
yield curve substantiated the predicted triangular 
distribution. 

7. Linearity: Using thin targets and the mass-2 
proton beam on this and other resonances, the linearity 
of the analyzer has been established to be within 0.02% 
over the energy range from 340 kev to 3 Mev.” 


B. Li’(p,n)Be’ Threshold Bombarding Energy 


This reaction provides a well-known calibration point, 
measured by several investigators.®.18- 

For the present measurement, LiF was evaporated 
onto a tantalum backing. The front face of the target 
was covered with stainless steel mesh to prevent error 
due to charge accumulation on the nonconductive LiF. 
As the bombarding energy is increased, neutrons first 
start to appear when the higher-energy side of the inci- 
dent particle group first reaches threshold. This lower- 
energy portion of the excitation curve will be rounded 
due to target thickness, Doppler shift, and energy 
spread of the incident beam. As the energy increases 
still further, the curve will appear to rise almost 
linearly. An extrapolation of the linear portion then 
provides an approximation to the true threshold.” In 
Fig. 7 is shown the number of neutron counts as a 


17 Bondelid, Butler, and Kennedy, Bull. Am. Phys. Soc. 2, 381 
(1957). 

18 Jones, Douglas, McEllistrem, and Richards, Phys. Rev. 94, 
947 (1954). 

19 W. J. Sturm and V. Johnson, Phys. Rev. 83, 542 (1951). 

2 Shoupp, Jennings, and Jones, Phys. Rev. 76, 502 (1949). 

21 Newson, Williamson, Jones, Gibbons, and Marshak, Phys. 
Rev. 108, 1294 (1957). 
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function of the electrostatic analyzer potentiometer 
setting. The present result for this threshold is summa- 
rized in Table IV. The average is 1.8812+0.009 Mev. 
Previous results®!*- are shown in Table V. The 
weighted average of all results is 1.8811+0.0005 Mev. 

The final error in the bombarding energy includes the 
error in the electrostatic analyzer plus estimated target 
and extrapolation effects.” The input aperture was set 
for an opening of 0.030 inch. 

The resistor stack of the power supply was checked by 
observing the potentiometer settings corresponding to 
this threshold on the tap 2 and tap 3 position. The 
agreement is better than 0.01%. 


C. Strong Resonance in the C'*(p,~)N™ Reaction 


A target of C was prepared by heating a strip of 
molybdenum in an atmosphere of C-enriched (60%) 
CH3I. The resulting target was approximately 5 kev 
thick to a 1.75-Mev proton beam. A strong resonance in 
the region of 1.75 Mev was examined using input aper- 
tures of 0.010 inch and 0.030 inch. Figure 8 shows the 
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Fic. 8. The C- 
(p,vy)N™ yield curve 
at 1.7476 Mev. 
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1.7468 1.7472 1.7476 1.7480 
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1.16050 





y-ray yield as a function of potentiometer setting for an 
input aperture of 0.010 inch. The middle of the rise of 
the yield curve is taken to be the peak of the resonance. 
There may be some doubt about the validity of this 
choice because of the asymmetrical shape of the curve. 
This shape could be caused by nonuniform target com- 
position. It is to be noted that the error this introduces 
in the choice of the resonance peak is of the order of the 
half-width of the resonance, about 0.01%. The average 
proton bombarding energy, as determined from two 
runs on the same target, is 1.7476+0.0009 Mev (see 
Table IV). Marion and Hagedorn” report 1.7469 
+0.0008 Mev relative to the Li’(p,2)Be’ threshold at 
~ #J. B. Marion and F. B. Hagedorn, Phys. Rev. 104, 1028 
(1956). 

23R. M. Williamson and W. Haeberli, quoted by Marion and 
Hagedorn, reference 22. 

*% Richards, Smith, and Browne, Phys. Rev. 80, 524 (1950). 

25 R. A. Chapman and H. Bichsel (unpublished results), quoted 
by J. B. Marion and T. W. Bonner (unpublished report). 

26 A. H. Morrish, Phys. Rev. 76, 1651 (1949). 

27 A. Fowler and C. C. Lauritsen, Phys. Rev. 76, 314 (1949). 

28S. E. Hunt, Proc. Phys. Soc. (London) A65, 982 (1952). 

2K. F. Famularo and G. C. Phillips, Phys. Rev. 91, 1195 
(1953). 
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1.8811+0.0005 Mev. Williamson and Haeberli* give 
1.7476 relative to the same calibration. 


D. C'8(p,n)N" Threshold Bombarding Energy 


A typical neutron yield curve is shown in Fig. 9. 
Table IV summarizes the results of the observations on 
this threshold. The average threshold bombarding 
energy is 3.2372+0.0016 Mev. Richards ef al.** measured 
this threshold relative to the Li’(p,n) Be’ threshold using 
targets of normal carbon which contains 1.1% C. They 
quote 3.236+0.003 when normalized to 1.882 Mev for 
the Li’(p,2)Be’ threshold. Chapman and Bichsel”® re- 
port 3.238+0.004 relative to several reactions. 

An input aperture setting of 0.030 inch was used for 
this measurement. 


E. Two Resonances in the Ni®*'(p,7)Cu®® Reaction 


Two resonances in this reaction were examined. A 
target of Ni** was prepared by electrodeposition onto a 
silver backing. The yield curves are shown in Fig. 10. 





600 an 
500 - Ni*8(P, y)Cu® 


400 


GAMMA-RAY COUNTS 


0.94630 
ee a ee 
0.94600 0.94650 
POTENTIOMETER SETTING 
Sea eS ee ee 
1423 








= 
0.94550 0.94700 


1424 1,425 
PROTON ENERGY (Mev) 


co 2 ay go 








GAMMA-RAY COUNTS 


ee ee ee ee ee ee ee ee 
1.22450 1.22500 1.22550 

POTENTIOMETER SETTING 

bie hee 

1.844 

Y (Me v) 


eS See" 
1.843 


PROTON ENER 


Fic. 10. The Ni*8(p,y)Cu® yield curves at 1.4241 and 1.8437 Mev. 
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Fic. 11. The F*(p,ay)O" yield curves at 340.5, 483.6, 
and 872.4 kev. 


The results (Table IV) are 1.4241+0.0007 Mev and 
1.8437+-0.0009 Mev."4 


F. Three Resonances in the F"’(p,ay)O" Reaction 


Three resonances in this reaction were examined. A 
target of CaF, was prepared by evaporation onto a 
tantalum backing. The yield curves are shown in Fig. 11. 
The widths of the resonances, I’, determined by the 
interquartile separation, are also shown in Fig. 11 and 
summarized in Table VII. A summary of the determi- 
nation of the bombarding energies is in Table IV. They 
are 340.5+0.3, 483.6+0.3, and 872.4+0.4 kev. 
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Table V summarizes all of the absolute calibration 
data and compares them with previous work. 


VIII. MEASUREMENT OF RESONANCE WIDTH 


The shape of a yield curve from a resonance is de- 
termined by the natural shape of the resonance, the 
factors contributing to the spread in the energy of the 
bombarding beam, and the thickness and composition 
of the target. A thin-target yield curve can give a good 
value for the full width at half-maximum of a resonance 
provided that this full width is greater than the thick- 
ness of the target and the factors contributing to energy 
spread of the incident particle beam. When the reso- 
nance width is the order of the thickness of the target, 
the analysis becomes complicated and uncertain due to 
the doubt in the thickness of the target and the effects 
of straggling. The shape of a thick-target yield curve, on 
the other hand, depends only on uniformity of target 
composition, resonance width, and incident particle 
energy spread. In this case, when the resonance width is 
large compared to the incident-beam energy spread it 
can be shown easily that the width of the resonance can 
be found from the interquartile distance on the yield 
curve.” When the resonance width is of the same order 
as the incident beam energy spread, the most satisfying 
method of determining the width of the resonance is to 
compute a family of yield curves with I’ as the parame- 
ter, and comparing with the data. 

Factors which contribute to the energy spread of the 
beam are those which arise from the resolution of the 
analyzer, the modulation of the incident beam due to 
power supply ripple and regulation, and the Doppler 
broadening introduced by the thermal motion of the 
target nuclei. 

To arrive at an expression for the contribution due to 
Doppler broadening, it has been assumed that the 
distribution of component velocities of target nuclei in 
the direction of the incident beam can be represented by 
a probability distribution. Secondly, it has been as- 
sumed that the relationship between mean square 
velocity and most probable velocity is the same in 
solids as it is in gases. Thirdly, the Debye theory has 
been used and it is assumed that the Debye temperature, 
©, is not very different than the target temperature T. 
The distribution of particle energies about the mean 
proton bombarding energy, Ey, due to Doppler broaden- 
ing is represented by the Gaussian 


N 
N(E)dE= nar exp[—(Ep—E)*/2A;°WE, (7) 


(Zr 3 


where 
2m oO \7 
A3= [ear (1 +—)] , 
M 20T? 
m is the mass of the bombarding particle, and M is the 


%” Fowler, Lauritsen, and Lauritsen, Revs. Modern Phys. 20, 236 
(1948). 
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mass of the target nucleus. The principal difference be- 
tween this expression and that found from application 
of the perfect gas law is the assumption that there is an 
equal division between potential and kinetic energy in 
the oscillator, i.e., the target nucleus. 

To simplify the over-all problem, it is assumed that 
the distribution of energy in the beam due to the 
geometry of the analyzer and the factors involving 
regulation and ripple can be represented by a Gaussian. 
With this assumption a new Gaussian representing the 
distribution of energy in the beam is written where the 
A’s are defined as follows: A; describes the instrument 
resolution, A» describes the modulation of the beam due 
to the power supply ripple and regulation, A; is the 
Doppler broadening, and A?=A,?+ A,’+-A;’. The natu- 
ral shape of the resonance (or the cross section curve of 
the resonance) is given by the Breit-Wigner single-level 
expression : 

oy 


o( E,— E)= 


(E,—E)+1/4 


where £, is the resonance energy, (Z,—£) is the devia- 
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Fic. 12. Comparison of experimental data with calculated yield 
curves for various assumed I’. 


tion from resonance energy, and I’ is the full width at 
half-maximum. Assume that an incoming proton pene- 
trates the target a distance x losing energy 6, before 
producing the (p,y) interaction in the slab of target dx. 
The yield for the reaction from this slab of target from 
the segment of beam dE can be written 


y(dE) = N x(dE)nw[E,— (E—6E)], 


where mo is the number of target nuclei per cm? and is 
assumed to be constant. The total yield from the 
element of beam dE at E can be found by integrating 
from E= EF to E=0, or some number small enough so 
that the contribution from the low-energy side of the 
curve is very small. Since E=F at «=0, and E=0 at 


TABLE VI. Parameters used for evaluating Eq. (8) 


Ey Input 
Reaction Mev aperture 
0.010 in, 39 50, 100, 150 
0.010 in, 69 ‘ : 25, 28,8235 
0.030 in, 168 50, 100 
0.030 in. 218 50, 100, 150 


0.9924 
1.7476 
1.4241 
1.8437 


Al??(p,y)Si%8 
C8(p,y)N4 
Ni58(p,y) Cu 
Ni58(p,y)Cu® 
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TaBLE VII. Results of the present measurements of the full width at half-maximum of several (p,7) resonances. Column 3—here [° 
is determined by finding the }-} position on the rise of the thick-target yield curve. Column 4—I is determined by comparison of 


experimental data with theoretical shape. 


Energy r 
Mev Interquartile 


0.3405 2.4+0.3 kev 
0.4836 0.9+0.1 kev 
0.8724 4.5+0.3 kev 
0.9924 
1.7476 
1.4241 
1.8436 


Reaction 


F"*(p,ary)O"* 
F*(p,ary)O"* 
F®(p,ay)O"* 
Al?7(p,y)Si2® 
C4(p,7)N" 
Ni*(p,y)Cu™® 
Ni**(p,y)Cu® 


* See reference 26. 

> See reference 28. 

© See reference 16. 

4 See S. E. Hunt and K. Firth, Phys. Rev. 99, 786 (1955). 
¢See H. H. Staub, Nuovo cimento 6, 306 (1957). 


«=some very large number, 6F can be replaced by kx 
and integration is from zero to infinity, where & is the 
constant energy loss through the target*: 


- ovkdx 
y(dE) = Ne(dt)ne f a . 
» (E—(E—kx) P+12/4 


To find the total yield, integrate the above over the 
energy distribution N g(dE): 


© Ny —(E,—E)* 
Yitot) = mof exn| - | 
0 (2a) tA, 2A? 


= ookdx 
x| f jer (8) 
0 [E,- (E—kx) ?+T? 4 


This equation can be evaluated numerically for the 
proper value of A, obtaining a family of curves for 
various I’’s. By normalizing the experimental results and 
comparing with these curves it is possible to obtain a 
value for I’. Thick-target yield curves are shown in 
Fig. 12 for Al, C, and Ni (p,7) resonances. The solid 
lines are the calculated curves for various I’s; the ex- 
perimental points are indicated. Table VI is a summary 
of the parameters used in evaluating Eq. (8). The 

5t The assumption of a constant energy loss per unit distance, 
equal to k, also involves the assumption that the incremental 
energy loss is very small relative to the width of the resonance so 
that the statistical processes of energy loss theory will apply. For 
a resonance of 100 ev width there may be some doubt in the valid- 
ity of this assumption. 


100+ 50 ev 
75+50 ev 
50+50 ev 
100+50 ev 


Other results and references 

3 +1 kev* 2.9+0.2 kev> 

2.2+0.2 kev» 

4.5+0.2 kev® 
60 +30 ev? 

2.1+0.2 kevé 


5.4+0.3 kev 
100 ev‘ 


f Bender, Shoemaker, Kaufmann, and Bouricius, Phys. Rev. 76, 273 
(1949). 

« J. D. Seagrave, Phys. Rev. 85, 197 (1952). 

b See reference 22. 


widths and the assigned errors, presented in Table VII, 
have been chosen by inspection of the curves.” 
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Levels in Zr’: Experimental 
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The levels in Zr® have been studied by analyzing the radiations 
of Nb® in magnetic and scintillation spectrometers employing 
various coincidence techniques. Multipolarities of most of the 
transitions have been determined from internal conversion co- 
efficients and K-L ratios. A decay scheme (I) for Nb” has been 
proposed which assigns the following excited states in Zr™: 1752 
kev (0+), 2182 kev (2+), 2315 kev (5—), 3081 kev (4+), 3453 
kev (6+), and 3595 kev (8+). Evidence has been discussed for a 
few weak additional transitions potentially involving three addi- 
tional levels (decay scheme IT). Following the suggestion of Ford, 
the levels in decay scheme I have all been interpreted as arising 
from the proton configurations (f1/2)?, (go/2)", and (go/2p1/2). The 


I. INTRODUCTION 


HEREAS many even-A nuclei are known in 
which the low-lying states represent essentially 
pure collective excitations, it is unusually difficult to 
find an even-A nucleus with relatively pure single par- 
ticle states. This difficulty arises because it is unusual 
to find an energy gap in the single particle spectrum, 
both above and below the orbitals which produce the 
single particle states. Only in such cases can we expect 
even the lowest levels to be composed of relatively pure 
configurations. 

A nucleus which fulfills these stringent requirements 
is Zr®. It is a 40-proton and 50-neutron ensemble. Ex- 
perimental evidence! on the levels in Sr®* can be inter- 
preted to indicate that there is 1.85 Mev between the 
approximately degenerate 3/2 and /s2 orbitals and the 
1/2 orbital. The go/2 orbital lies about 0.9 Mev above the 
Piz orbital and then there is a gap of several Mev as a 
result of the completion of a 50 nucleon shell. Therefore, 
for the proton levels in Zr™, there is probably an energy 
gap just below the filled 1/2 orbital and an energy gap 
just above the completely empty go/2 orbital. As origi- 
nally pointed out by Ford,? the only low-energy config- 
urations to be expected from such a situation are the 
(p12)? ground state and (p1/2g9/2) and (gg/2)? configur- 
ations. 

Since these configurations can give rise to 0+, 0+, 
2+, 44+, 6+, 8+, 4—, 5— levels, their population 
requires a decaying nucleus of high spin. Of the two 
possibilities Nb” and Y”, only Nb® has a high spin. 
This is known both from the existence of two meta- 
stable states through which the Mo” 0+ ground state 
must decay to populate the Nb”, and the fact that the 
proton-neutron configuration (41-49) should be gy,2! 
and go! with a corresponding spin of 8 or 9. 


* Fulbright and Guggenheim scholar during the years 1955 and 
1956. Now at Florida State University, Tallahassee, Florida. 

1 Lazar, Eichler, and O’Kelley, Phys. Rev. 101, 727 (1956). 

2K. W. Ford, Phys. Rev. 98, 1516 (1955). 


half-life of the 3595 kev 8+ state has been experimentally deter- 
mined as 3X1077 sec, in good agreement with the half-life ex- 
pected for a 141.5-kev £2 transition between 8+ and 6+ states, 
each involving a (g9/2)* configuration. 

The relative population of the two 0+ states of Zr, both by 
de-excitation of the 2+ state of that nucleus and by the beta 
decay of Y”, indicates that these states result from highly mixed 
(p12)? and (go/2)? configurations. 

Hindrance factors for several transitions indicate that the other 
positive parity states are largely generated from the (go/2) 
configuration. 


Accordingly, Nb” was selected for study. In previous 
studies of Nb” there was a considerable lack of agree- 
ment on the positron end point and the existence of 
certain gamma rays.*-§ The only suggested decay 
schemes®’ contradicted the well-studied Y* decay and 
implied a low spin for Nb”. 

More recently, a revised decay scheme based on 
crystal nuclear spectroscopy has been presented." 
This decay scheme is in agreement with a great ma- 
jority of previous experimental results and with the 
decay of Y”. It involves a high spin (8,9+) for the 
Nb® ground state and confirms the existence of most 
of the states predicted by Ford.’ 

It is the intention of the present paper to reinvesti- 
gate the decay of Nb”, using the more precise measure- 
ments of beta-ray spectroscopy and more complete 
coincidence measurements. Such a reinvestigation is 
justified in view of (1) the large energy difference 
(~6.1 Mev) between the ground state of Nb” and the 
ground state of Zr® suggested in the most recent decay 
scheme studies?" and expected on the basis of empirical 
neutron and proton binding energy data,” (2) the 
potential population of a considerable number of high 
spin states in view of the high spin of Nb” and the 
large energy difference, and (3) the unique properties 
of Zr® outlined above which make it susceptible to a 
straightforward comparison between experiment and 


3G. E. Boyd, Oak Ridge National Laboratory Report ORNL- 
229, 1949 (unpublished), p. 31. 

4D. N. Kundu and M. L. Pool, Phys. Rev. 76, 183 (1949). 

5R. M. Diamond, Phys. Rev. 89, 1149 (1953). 

6 Hok, Kramer, and Meijer, Physica 20, 1200 (1954). 

7H. B. Mathur and E. K. Hyde, Phys. Rev. 98, 79 (1955). 

8 N. H. Lazar and G. D. O’Kelley, Bull. Am. Phys. Soc. Ser. IT, 
1, 163 (1956). 

9 N. H. Lazar and G. D. O’Kelley (private communication from 
G. D. O’Kelley). 

10 R, K. Sheline, Bull. Am. Phys. Soc. Ser. II, 2, 260 (1957). 

4 R, K. Sheline, Physica 23, 923 (1957). 

12 Nuclear Level Schemes, A=40—A=92, compiled by Way, 
King, McGinnis, and van Lieshout, Atomic Energy Commission 
Report TID-5300 (U. S. Government Printing Office, Washing- 
ton, D. C., 1955). 
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Fic. 1. Low-energy internal conversion lines of Nb”. Spectrum 
made with two gaps of the small “orange” spectrometer. Resolu- 
tion ~1%; instrument set to give ~0.8%; difference results from 
source thickness. 


theory. In the present paper, the experimental facts 
are presented with the minimum of discussion consistent 
with this presentation. In the paper which follows a 
more complete interpretation is attempted. 


II. EXPERIMENTAL 
A. Source Preparation 


Isotopically pure sources of Nb” are difficult to 
prepare. By the use of (p,7) reactions on Zr® samples 
enriched to ~97% and 99% it was possible to obtain 
samples which decayed with a 14.6 hr half-life over a 
period greater than 8 half-lives. In order to achieve this 
purity it was necessary not only to have isotopically 
enriched Zr™, but also to wait several hours after 
bombardment before chemically separating the nio- 
bium. Protons of twenty-two and thirty Mev were 
used to give a maximum Nb™® yield, and at these en- 
ergies a considerable amount of Nb® was formed by 
the reaction Zr®(p,2n) Nb®. Fortunately, however, this 
impurity can be very largely removed by the simple 
expedient of waiting, since the half-life of Nb*, 1.9 hr, 
is short in comparison with the Nb” half-life. In spite 
of these precautions, however, and in spite of the purity 
indicated by the half-life measurements, the presence 
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of ~0.2% Nb® and ~0.3% Zr® (the daughter of 
Nb®) was proven by the existence of weak internal 
conversion lines indicative of these isotopes. (See 
Table I.) 

The Zr® was bombarded in the form of the oxide, 
ZrOz, in a hollow stainless steel needle and in an 
aluminum foil. After bombardment, the target was dis- 
solved in a mixture of hydrofluoric and hydrochloric 
acids. The sample was fumed to dryness a number of 
times and then taken up in a minimum amount of a 
solution 9 normal in hydrochloric acid and 1.0 normal 
in hydrofluoric acid. 

The separation of carrier-free Nb” from the target 
material was performed by an anion exchange pro- 
cedure based on the work of Kraus and Moore." This 
procedure involves the use of hydrofluoric acid. Conse- 
quently, columns, pipettes, and beakers of polyethylene 
and crucibles of platinum had to be used. The anion 
exchanger was Dowex 1-10x, 100 mesh. Column di- 
mensions varied from 6 to 10 cm in length and 3 to 5 
mm in diameter. 

The hydrofluoric-hydrochloric acid solution contain- 
ing the target material was placed on top of the column 
and allowed to pass into the resin bed. 

The niobium, forming negative complexes, sticks to 
the resin in a narrow band, whereas the zirconium 
passes through with practically no adsorption. Elution 
with a few column volumes of the same solution elimi- 
nates zirconium quantitatively, without affecting the 
niobium. 

A change of the eluant concentration to 6 normal in 
hydrochloric acid and 0.11 normal in hydrofluoric acid 
makes the niobium appear in the effluent after approxi- 
mately five column volumes. Gamma-ray sources were 
directly produced from the effluent by drying a small 
amount of the solution on a polystyrene backing and 
covering the dried source with a second piece of poly- 
styrene to prevent contamination of equipment. In 
most cases, before making a beta source, the separation 
procedure was repeated using a smaller column (50 ul 
free column volume). The niobium is thereby concen- 
trated by adsorption in the first step and subsequently 
eluted in the second step in a further purified and con- 
centrated form (about 4 drops). Beta-ray sources were 
made by two different techniques. The first produces 
relatively thin sources on a thin backing with, however, 
smaller amounts of total activity. In this method, an 
aliquot from the effluent of the first or second column 
separations is boiled down to dryness. Care must be 
taken not to destroy the chloride and fluoride complexes 
during this procedure. A considerable fraction of the 
activity may then be taken up in absolute alcohol, pre- 
sumably as the halides of niobium. A single drop of this 
alcoholic solution spreads very widely over a piece of 
150-ug/cm? aluminum foil from which a strip, 1 mmX7 
mm, containing a maximum activity may be cut. The 


18K, A. Kraus and G. E. Moore, J. Am. Chem. Soc. 73, 9 (1951). 
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strip of aluminum foil containing the activity is used 
directly in the beta ray spectrometer. A source produced 
in this way is almost invisible, and has an estimated 
source thickness of 50 ug/cm?. The second way to pro- 
duce beta-ray sources has the advantage of concentrat- 
ing a greater amount of activity in the useful source 
area, but has the disadvantage of yielding thicker 
sources. It consists of evaporating the niobium fraction 
of the second column separation (about 4 drops) on a 
polystyrene or platinum backing. A source thickness of 
200-500 yg/cm? resulted. The thickness is believed to 
be due to resin and/or impurities in the reagents. This 
type of source was used for the determination of the 
energies and intensities of the higher energy internal 
conversion lines. 


B. Equipment 


The gamma-ray spectrum and gamma-gamma co- 
incidence studies were made by means of conventional 
scintillation spectrometers, utilizing 131} inch and 
3X3 inch NaI(TI) crystals, Dumont 6292 and K 1197 
photomultiplier tubes, and a 36 channel analyzer of the 
Hutchinson and Scarrot design.'* The continuous posi- 
tron spectra and conversion lines were studied in two 
different six-gap beta-ray spectrometers.!>!6 The second 
spectrometer was essentially a scale-up of the first spec- 
trometer, but had the advantage of operating at higher 
energies and at higher resolution for a given trans- 


4G. W. Hutchinson and G. G. Scarrot, Phil. Mag. 42, 792 
(1951). 

15Q. B. Nielsen and O. Kofoed Hansen, Kgl. Danske Videnskab. 
Selskab, Mat.-fys. Medd. 29, No. 6 (1955). 

16(), B. Nielsen (to be published). 


mission. Figure 1 shows an internal conversion spectrum 
made employing only two gaps of the small beta-ray 
spectrometer, whereas Fig. 2 was made using three gaps 
of the larger spectrometer. The transmission of the 
larger spectrometer under the conditions employed was 
considerably greater and important in detecting the 
weaker lines. 

These six-gap beta-ray spectrometers were also used 
for internal conversion-gamma and positron-gamma co- 
incidences. The coincident spectra from a 1} inchX 1} 
inch Nal(TI) crystal placed immediately behind the 
source were displayed directly on the 36 channel ana- 
lyzer. The high transmission of the six-gap spectrom- 
eters was particularly appropriate for these coincidence 
measurements. This arrangement requires no light 
guide from the crystal to the photomultiplier. Conse- 
quently, the resolution of the crystal and photomulti- 
plier is not impaired. 

In the measurement of the lifetime of the 8+ level, 
the coincidence equipment of the beta-ray spectrometer 
(resolving time, 27 = 10~7 sec) together with a 3.5 X 1077- 
sec delay line were used. 


III. 6-RAY SPECTROSCOPY 
A. Internal Conversion Lines 


The internal conversion spectrum of Nb” consists of 
two low-lying lines of high intensity, a number of rela- 
tively weak lines up to 2200 kev, and an unusually 
strong line at 2315 kev. Two portions of the internal 
conversion spectrum of particular significance are shown 
in Figs. 1 and 2. The separation of the 132.7- and 141.5- 
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TaBLeE I. Conversion electron data for Nb” and impurities. 








Electron Energy sum Energy sum Energy determined Transition en- 
energy this research Hok et al. by external con- ergy* assumed 


Isotope 
Shell (kev) (kev) version in Pt (kev) 


(parent) (kev) 

Nb® 114.7 132.7 132.9 132.7 
Nb® 130.3 132.8 132.9 
Nb® te A cee 133.0 
Nb® 123.5 q 141.5 141.6 141.5 
Nb*® 139.2 L 141.5 141.7 
Nb” see l ee 141.6 
Nb” 353.9 q 371.9 371.9 371.9 
Nb” 369.2 L 371.7 371.4 
Nb” tee ] tee 371.5 
Nb” 978(?) €# 996(?) tee 996(?) 
Nb” 1120 Li 1138 1130 1138 
Nb® 1734 ‘ 1752 1767 1752 
Nb” 1834(?) LI 1852(?) are 1852(?) 
Nb® 1963(?) os 1981 (?) tee 1981 (?) 
Nb” 2119(?) LI) 2137(?) ee 2137 (?) 
Nb® 2164 £1 2182 2182 
Nb” 2297 €F 2315 see 2315 





Nb*® 215. 4 234.7> 5. 236 
Nb*® 233. , 235.4 


Zr 891 LI 908° Y 910 


* The energies are generally believed to be accurate to +0.7%. 
b This line was double. There was a shorter lived component as well as the 90-hr Nb®5. 
© There was an indication this line was double, containing a shorter lived component which may be the internal conversion line corresponding to the 


900-kev gamma line. 


kev lines from each other is sufficient for gamma co- kev were measured with a thin source (~50 yg/cm?*). 
incidence measurements. Several of the other lines were also measured with the 

A previous study of the energies and intensities of thin source. However, the weakest lines could not be 
the conversion lines of Nb® has been published by Hok _ seen with satisfactory statistics until a thicker (200-500 
et al.6 Except for a few weaker lines and the strong line yg/cm*) source containing considerably more activity 
at 2315 kev, the experimental results presented in this was used. For the highest energies (> 1600 kev) a thick 
paper are in agreement with those of Hok et al.® source on a very thick backing was used. Under these 

The energies of the conversion lines of Nb® and circumstances resolution was not much impaired. How- 
impurities present in the sample determined in these ever, external conversion occurs to some extent (see 
experiments and previous work are shown in Table I. Fig. 2). Weak indications (from 2 to 4 times statistics) 
Most of the lines observed have been determined with — of four internal conversion lines corresponding to transi- 
a resolution of about 1.1%. The lines at 132.7 and 141.5 tions of 996, 1852, 1981, and 2137 kev are observed. In 
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TABLE II. Results of events focused in the beta spectrometer-gamma coincidence measurements. 


Events observed in co 
incidence with the 
Events selected in the selected event 
beta spectrometer 


Remarks 


This coincidence is cleanly determined. (See Fig. 4.) 


132.7-kev transition 


The coincidence with the 1850-kev gamma was demonstrated most clearly 
by reducing the 1650-kev sum line by moving the crystal for the gamma 
ray coincident spectra further from the source. (See Fig. 5.) 


141.5-kev transition 511, 1138, 1650, 1850 


The coincidence spectrum was run in the energy range 0-1200 kev, only. 
Apparent coincidences with 142- and 511-kev gammas appeared also, but 
in view of the appearance of a 1140-kev line it is felt that these coincidences 
have not been established. 


371.9-kev transition 


The total intensity of the transition feeding the 1752-kev level is (140.3) 
10-4. (See Fig. 6.) 


1752-kev transition 420 


There was no difference outside of statistics between 450- and 940-kev 
regions of the positron spectra. 

The existence of a weak 511-kev gamma in coincidence with the positron 
spectra may result from pair formation from the 2182-kev gamma. 


450-kev region of continuous 142, 1140, 511 
positron spectrum 
940-kev region of continuous 


positron spectrum 


142, 1140, 511 


and photomultiplier. Since Nb” is a positron emitter, 
there is little difficulty with the continuous background. 
The results of these coincidence studies are shown in 


Table IT. 


view of their uncertainty they are indicated in Table I 
with a question mark in spite of the fact that three of 
the four have been observed in singles and/or coincident 
gamma spectra. A careful search for an internal con- 
version line corresponding to a transition energy of 
~1280 kev (141.5+1138) revealed no evidence for it. 
If it is present it must be less than 1/15th the intensity 
of the 1138-kev internal conversion line. Other features 
of the internal conversion spectra are indicated in the 
footnotes to Table I. 





B. Positron Spectra 


Positron spectra of Nb® showed a group with maxi- 
mum energy 1.49+0.01 Mev. The deviation from a 
straight Fermi plot below 650 kev may indicate a weak 
low-energy component. The uncertainties regarding this 
possible low-energy component are further discussed in 
Sec. V, The Decay Scheme. The Fermi plot for the 
positron spectrum is shown in Fig. 3. 

The experimental points near the maximum energy 
have been corrected for the line width of the spec- 
trometer. There still is a slight deviation from the ex- 
trapolated Fermi plot. Presumably this effect is due to 
scattering in the spectrometer. 
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IV. COINCIDENCE EXPERIMENTS 


A. Gamma Coincidences with Beta Spectrometer 
Focused Events 


Several of the internal conversion lines lend them- 
selves to studies of the gamma coincidences. The 
“orange” type of spectrometer'®!® is especially suited 
for this type of coincidence experiment because of its 
high transmission of the event focused in the beta 
spectrometer and because the gamma-ray coincidence 
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Fic. 4. Gamma spectrum in coincidence with 1327-kev in 


crystal is placed immediately behind the source without 
a light guide. It therefore has a high transmission 
without destroying the intrinsic resolution of crystal 


ternally converted transition. The 1140-kev gamma-ray results 
from incomplete resolution of the 141.5-kev line from the 132.7- 
kev line. 
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Fic. 5. Gamma spectrum in coincidence with the 141.5-kev internally converted transition : 5(a) crystal close to source; 5(b) crystal far 
from source suggests existence of weak ~1850-kev gamma coincidence with the 141.5-kev interns illy converted transition. 


Of particular interest are the gamma coincidences 
with the internal conversion lines representing transi- 
tions of 132.7, 141.5, and 1752 kev. The coincidence of a 
gamma of 2182 kev with the 132.7-kev internally con- 
verted transition is particularly clear, and is shown in 
Fig. 4. Since the 132.7-kev line is not completely re- 
solved from the 141.5-kev line, this gamma-ray co- 
incidence spectrum is contaminated with small in- 
tensities resulting from the gamma coincidences with 
the 141.5-kev line. The gamma coincidences of the 
141.5-kev line are shown in Figs. 5(a) and (b). The 
coincidence with annihilation radiation indicates that 
this line is in coincidence with the main positron 
branch. The apparent coincidence with a gamma ray of 
1650 kev is shown to be a coincidence with a sum line 
(1138+-511 kev) by comparing Figs. 5(a) and 5(b). This 
comparison which involves moving the gamma crystal 
further from the source also seems to establish the 
existence of a fairly weak 1850 kev coincidence which is 
largely obscured by the 1650-kev sum line when the 
gamma crystal is close to the source. The 371.9-kev in- 
ternally converted transition is in coincidence with a 
900-kev gamma, but not with a potential 767-kev 


gamma ray. This fact is important in establishing the 
nature of the 4+ level (see IV, Decay Scheme). 

A particularly interesting coincidence study is shown 
in Fig. 6. The 1752-kev transition is a weak transition. 
However, since it represents a 0— 0 transition (see 
Table IV) it is ~67% internally converted. (The other 
~33% is involved in pair formation.) The only transi- 
tion feeding it is the ~420-kev gamma. Accordingly, 
this very weak cascade is clearly established, as shown 
in Fig. 6. 

Comparison of the gamma coincidences between low- 
energy and high-energy positron spectra do not indi- 
cate significant differences (see Table II). This is un- 
fortunate since differences would have been convincing 
evidence for a weak low-energy positron branch. How- 
ever, failure to observe these differences probably does 
not constitute strong evidence against this potential 
branch. 


B. Lifetime of the State Involving the 
Positron—141.5-kev Cascade 


The presence of attenuated coincidences had been 
noticed previously" between the 511-kev annihilation 
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radiation and the 142-kev gamma transition in gamma- 
gamma coincidence experiments. A coincidence experi- 
ment between the internal conversion line of the 141.5- 
kev transition and other photons was used to measure 
the lifetime of the state involved. The results of the 
measurements are shown in Figs. 7(a) and (b). When 
gamma coincidences with the 141.5-kev internally con- 
verted transition are studied with no delay, a spectrum 
with strong 511- and 1140-kev gamma transitions re- 
sults. When a delay of ~3.5X 1077 sec is placed in the 
gamma branch of the coincidence circuit, the gamma 
spectrum simplifies to a single strong 511-kev gamma. 
If the same delay is placed in the beta spectrometer 
branch of the circuit, so that the 141.5-kev internally 
converted transition is delayed, the resultant gamma 
spectrum shows no coincidence. These measurements 
indicate that the positron proceeds to a delayed state 
with a (3_;+'*) X 1077 sec half-life followed by a 141.5- 
kev transition and then by a 1138-kev transition. 


C. Gamma-Gamma Coincidence Measurements 


Gamma-gamma coincidences were studied in con- 
siderably more detail than in the previous work.'?"' 
The photopeaks of gamma rays studied were selected 
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Fic. 6. Gamma spectrum in coincidence with the 1752-kev in 
ternally converted transition. Intensity of the (420+20)-kev 
gamma transition is (140.3) X 10-4/Nb® disintegration. 
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by means of a single channel analyzer, and the coinci- 
dent gamma spectrum displayed on the screen of the 
36-channel analyzer. Care was taken to eliminate false 
coincidences resulting from Compton background and 
backscatter radiation. All the results obtained in these 
experiments are summarized in Table III. In those 
cases where the statistics were not good or the line 
shape is not normal, a question mark has been placed 
after the coincident event. 

To a considerable extent, the information obtained 
in these measurements only confirms the information, 
summarized in Table II, obtained in gamma-coincidence 
studies with events focused in the beta spectrometer. 
One important piece of additional information is the 
fact that the strong 2315-kev line was not in coincidence 
with any other gamma ray or with annihilation radia- 
tion. This is a strong indication that the transition is 
from a delayed state to the ground state. In addition, 
evidence is obtained in these gamma-gamma coinci- 
dence measurements for a number of weak transitions. 

Often these weak transitions involve poor statistics 
or bad line shape. An example of some of the better 
measurements on these weak transitions is shown in 
Fig. 8. The measurements presented in Fig. 8 seem to 
indicate that an 1850-kev gamma ray is in coincidence 
with both the 142- and 1138-kev gamma rays, whereas 
the 1630-kev gamma is just in coincidence with the 
second member of the cascade, namely the 1138-kev 
gamma ray. The gamma-gamma coincidence measure- 
ments also give weak evidence for a 1960- to 2000-kev 
gamma ray and ~960-kev gamma in coincidence with 
the 1138-kev gamma, but not with the 142-kev gamma. 





Selected event 
(kev) 
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TABLE III. Results of gamma-gamma coincidence experiments. 


Events observed in 
coincidence with 
selected event 


Remarks 








132.7+141.5 


1630 


1850 


511, 1138, 1650, 
1850, 2182 


142, 511, 650, 890, 
1640?, 1960, 2190 


141.5, 511, 650, 
1630, 1860?, 2000 


1140 
1140, 1280 


~960?, 


The relative intensities of the coincidences change as one shifts the gate over 1 the peak. 
The 2182-kev peak is definitely more coincident with the low-energy end of the (132.7 
+141.5)-kev peak. The 1650-kev coincidence is presumably the sum line of annihilation 
radiation and the 1138-kev transition. 

The coincidence with the 890-kev gamma is very clearly observed. The 1640-kev gamma 
coincidence is weak. The 1960-kev gamma coincidence peak is broad, probably indicating 
more than one coincident gamma ray. 


The 650-kev coincidence is presumably the sum line of annihilation radiation and the 
141.5-kev transition. The 960-kev gamma coincidence is very broad. The 1630-kev 
gamma coincidence is definite and cannot be accounted for as the sum line of annihilation 
radiation and the 1138-kev transition. The 1860-kev gamma coincidence is listed as 
questionable because of uncertainties involving intensities in the (2320—511)-kev es- 
cape peak. 


Only coincidences in which E,>1050 kev would have been observed. (See Fig. 8.) 


Only coincidences in which E,>1050 kev would have been observed. Presumably the 


1280-kev coincidence is the sum line of 1140+142 kev, indicating also a coincidence with 


142 kev. (See Fig. 8.) 


1140? 
not very good. 
132.7 


Only coincidence in which E,>1050 kev would have been observed. Statistics were 


The coincident gamma ray peak is definitely at lower energy than the 141.5-kev gamma. 


This strong line (2315 kev) was not in coincidence with any gamma ray. This is one of 
the indications that the transition is from a delayed state to the ground state. 





While the evidence about these weaker transitions is 
in no way conclusive, the evidence on the majority of 


the transitions is quite strong. Among those things 
definitely established by these coincidence measure- 
ments are (1) a cascade involving the main positron 
branch to a 3X10-7-sec delayed state followed by a 
141.5-kev transition and then by a 1138-kev transition, 
(2) a transition of 2315 kev from a delayed state to the 
ground state, (3) a cascade involving a 2182-kev transi- 
tion and a 132.7-kev transition, and (4) a cascade in- 
volving a ~420-kev transition and a very highly con- 
verted 1752-kev transition which, because of the co- 
incidence efficiency, must be a 0 — 0 transition. 


V. THE LEVEL SCHEME 
A. Established Levels 


All data on the stronger transitions and some of the 
weaker transitions are summarized in the decay scheme 
shown in Fig. 9. It should be emphasized that several 
of the weaker transitions are not included in this decay 
scheme, but will be considered separately, because the 
data suggesting their existence and the existence of 
certain additional levels are so much weaker. The decay 
scheme presented in Fig. 9 explains in a very satis- 
factory way the apparent lack of relationship between 
(1) the cascade involving the main positron branch to 
a 3X10~7-sec delayed state followed consecutively by 
141.5- and 1138-kev transitions, (2) a cascade involving 
132.7- and 2182-kev transitions, and (3) a long-delayed 
transition of 2315 kev. There is little doubt that this 
state is identical with the 0.83-sec state at 2320+ 20 kev 





observed by Campbell ef al.” on the basis of inelastically 
scattered neutrons on separated Zr®. Attempts to ob- 
serve this metastable state in the decay of Nb® have 
not been successful.'-"!7 However, this must be ex- 
plained as a failure in the column chemistry resulting 
from the oxidation state of Zr® produced in the decay 
of Nb” which does not behave normally during the 
first several seconds after the decay. 

The 2315-kev transition is shown to be an E5 to the 
ground state. This unambiguously establishes a 5- 
level. If the half-life for such a transition is calculated 
without reference to the statistical factors involved in 
the configurations assumed, a value of 1.21 sec is ob- 
tained. This is in excellent agreement for such a calcula- 
tion with the 0.83-sec half-life measured by Campbell 
et al.’ Therefore, although these experiments have not 
measured the lifetime of the delayed state at 2315 kev, 
it is assigned the half-life 0.83 sec measured by Campbell 
el al." 

Once the delayed state at 2315 kev is established, the 
positron, 141.5, 1138 kev cascade (which is known to 
end at a spin-5 state") determines levels (relative to 
the Zr® ground state) at 6107 kev (ground state of 
Nb”), 3595, 3453, and 2315 kev. With this super- 
structure on which to hang the data, the other two 
cascades fall easily into place. The 132.7-2182 kev 
cascade determines a level at 2182 kev rather than 132.7 
kev for several compelling reasons, among which are 
the competition between the 2315- and 132.7-kev 
gammas and the fact that a 3- level at 132.7 would cer- 


17 Campbell, Peelke, Maienschein, and Stelson, Phys. Rev. 98, 
1172(A) (1955). 
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tainly have been populated in the Y” beta decay. A 
level at 1752 kev previously suggested on theoretical 
grounds by Ford? and confirmed experimentally in the 
decay of Y* by Johnson, Johnson, and Langer'® is 
again confirmed by the 420-1752 kev cascade. The 
371.9-900 kev coincidence suggests a level at 3081 kev 
rather than at 2554 on the basis of the poor competition 
of this cascade with the 1138-kev transition. 
Therefore, there are well-established levels in Zr® at 
1752, 2182, 2315, 3081, 3453, and 3595 kev above the 
Zr™ ground state, and the ground state of Nb® is 
established at 6107 kev above the Zr® ground state. 
The biggest single factor in determining the error in 
these levels is the accuracy of the measurement of the 
2315-kev transition which is believed accurate to 0.5%. 


B. Other Levels 


Whereas the levels just enumerated are well estab- 
lished, there are several other transitions whose validity 
is not so clearly established (see Fig. 2) which are not 
included in this level scheme. Fortunately for the 
method of presentation, these transitions all involve 
three additional levels. It is therefore possible to con- 
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18 Johnson, Johnson, and Langer, Phys. Rev. 98, 1517 (1955). 
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Fic. 9. Levels in Zr® (decay scheme I). The intensities of the 
transitions are given as numbers close to the transition. They are 
total gamma and internal conversion intensities given as the frac- 
tion of all Nb” disintegrations. Beta-decay transitions are given 
in percent. 


struct an additional decay scheme as shown in Fig. 10. 
This decay scheme summarizes all the additional in- 
formation not included in the decay scheme shown in 
Fig. 9. 

It should be emphasized that most of the information 
indicated in Fig. 10 is considerably weaker and, there- 
fore, this part of the decay scheme must be considered 
preliminary and tentative. The evidence for a 1850-, 
141.5-, 1138-kev cascade and the weak evidence for a 
1990-, 1138-kev cascade suggest a level at 5440 kev 
populated only by the orbital electron capture of Nb” 
(see Fig. 8). The evidence for a level at 5080 kev is 
based on 1630-kev coincidence with 1138-, but not with 
141.5-kev radiation. This level should be populated 
principally by Nb” orbital electron capture. 

The main evidence for the existence of the tentative 
level at 4450 kev is the weak low-energy positron group 
with a maximum energy of 650+ 20 kev. Although weak, 
the intensity of this positron group would infer a con- 
siderably greater orbital electron capture branch. Yet 
the only possible transitions from the tentative 4450- 
kev state are much too weak in intensity to account for 
such a positron and orbital electron capture branch. 
The evidence on the 996- and 2137-kev transitions from 
the tentative 4450-kev state to the 3453- and 2315-kev 
states ts extremely weak (see Fig. 2). For these reasons 
the tentative level at 4450 kev and transitions from it 
are indicated as dotted lines in the partial decay scheme 
of Fig. 10. 
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Fic. 10. Three additional tentative levels in Zr® (decay scheme 


II). This is a partial level scheme only. Note that the 8+, 6+, 
and 5— levels are present in the level scheme presented in Fig. 9. 


In addition to these levels, two additional levels have 
been suggested by Day” on the basis of preliminary 
inelastic neutron scattering experiments. These levels 
are at approximately 3200 kev, decaying only to the 
ground state, and at 2760 kev decaying to the 2315- and 
2182-kev states. No evidence for these states has been 
found in the present experiments involving the decay 
of Nb®. However, in view of the very different mode 
of population in the experiments of Day, it would not 
be surprising to find different states populated. 


C. Branching Ratios, Transition Probabilities 
and Lifetimes 


The total intensity of each transition is included on 
the decay scheme shown in Fig. 9 as a number close to 
the multipolarity of the transition. The numbers are 
given relative to Nb” disintegration as 1.000. The total 
intensity is the sum of the gamma intensities previously 
determined" and the internal conversion electron in- 
tensity when it is significant. The actual experimental 
numbers are used in all cases. In some cases this may 
represent a known inaccuracy. For example, the in- 
tensity listed for the 371.9-kev £2 is the experimentally 
measured gamma intensity 0.028 per Nb” disintegra- 
tion. Actually the internal conversion line measurement 
is considerably more accurate than the gamma measure- 
ment for this transition. If one uses this intensity to- 
gether with the internal conversion coefficient for an £2 
of this energy, one obtains a gamma intensity of 0.016 
per Nb® disintegration. 

Of particular interest are the competitions between 
the 371.9- and 1138-kev transitions, the 132.7- and 
2315-kev transitions, and the 2182- and ~420-kev 
transitions. Also of considerable interest is the limit 


9 R. B. Day (private communication, March, 1957). 
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which can be placed on the potential 1280-141.5 kev 
competition. Previously, a limit of 0.02 1280-kev- 
transitions per Nb” disintegration was given." This 
estimate was based on a difficult gamma subtraction 
procedure. Now, however, the limit of 1/15th the 1138 
conversion line indicates (assuming that the radiation 
would be £3) a limit of less than 0.019 1280-kev- 
transitions per Nb” disintegration. These competitions 
will be discussed in greater detail in Sec. VII of this 
paper and the paper which follows. 

The ratio of pair formation to internal conversion for 
the 0-0 1752-kev transition has been calculated to 
be 1 to 3 by Greenberg and Deutsch” using the theory 
of Thomas.”! It has been experimentally determined as 
4.4 » +86 in agreement with the theory, by comparison 
of the pair formation measurements of Greenberg and 
Deutsch” and the internal conversion measurements of 
Yuasa et al. 

The Y” decay scheme has been included in Fig. 9 
because it is so intimately involved in any discussion 
of the levels in Zr®. The beta branching ratio to the 
two Zr” 0+ states is taken from the measurements of 
Greenberg and Deutsch” and Yuasa et al.”*.> The value 
for the logft for the Y® beta transition to the Zr” 
ground state includes the factor suggested by David- 
son* for unique first forbidden beta decays. 

Half-lives are given on the decay scheme for the 
3595-, 2315-, and 1752-kev states. The half-life for the 
3595-kev state is that of these measurements and needs 
no further explanation. The half-life of the 2315-kev 
state is that of Campbell et al.” 

There has been a considerable disagreement about 
the half-life of the 1752-kev state. Originally, Deutsch”® 
proposed a 5.9X 10~*-sec half-life based on his measure- 
ments. More recently preliminary measurements of Day 
and Kloepper*® indicate a half-life of 6.4X10-® sec. 
Still more recently, Alburger®’ has experimentally deter- 
mined a half-life of 6X 10-8 sec, in good agreement with 
Day and Kloepper, but in disagreement with Deutsch. 
Because of this agreement the half-life used in this 
decay scheme is that of Day and Kloepper in spite of 
its preliminary nature. 


VI. MULTIPOLE ORDER OF TRANSITIONS, 
SPIN AND PARITY ASSIGNMENTS 


A. Internal Conversion Coefficients 


Using the relative intensities of K or KLM conver- 
sion lines presented in Table IV together with the 
previously determined relative intensities for the gamma 

*0 J. Greenberg and M. Deutsch, Phys. Rev. 98, 1517 (1955). 

21 P. Thomas, Phys. Rev. 58, 714 (1940). 

” Yuasa, Laberrique-Frolow, and Feuvrais, Compt. rend. 242, 
2129 (1956). 

*T. Yuasa and J. Laberrique-Frolow, J. phys. radium 17, 558 
(1956). 

*4 J. P. Davidson, Jr., Phys. Rev. 82, 48 (1951). 

2° M. Deutsch, Nuclear Phys. 3, 83 (1957). 

26 R. Day and R. Kloepper (private communication). 

*7 1), Alburger (private communication to A. Bohr), 
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TasLe IV. K and KLM conversion coefficients and multipolarity assignments. 





Experimental 
Ror BLM Theoretical conversion coefficients* 
conversion : ‘ ; , 
coefficients Ei E2 E3 M1 M2 

4.2X10? 3.2107 

3.5X10 2.7K107 

2.6X107% 9.61073 

2.1%10 5.510% 4 

9910-5 2.2X10-* 3.9 K10-* 2.3K10 4.6K10-! 7.7K10-* £0 

91X10 14xK10* 2.5 X10*% 1.5X10% 3.3K10°* 45X10“% £2 

E3 E4 E5 M3 M4 M5 
3.9X10-! 6.41074 1.0 K10°* 7.7X10 1.5<X10-° 2.0X10°> £0 
2.1X10-* 3.0K10- 5.0 X10 4.1X10 60X10 89X10 £5 


Relative 

intensity 

of gamma 
lines 


Relative intensity 
of K or KLM 
conversion 
lines 


Assigned 
multi- 
polarity 


Transition 
energy 
(kev) 


132.7 44 ~6.5 

141.5 100 100 
0.084 3.7 
0.104(KLM)4 130 
0.027(KLM)4 <2.6 
0.009(KLM)4 18.4 


3.6 10° 2 
xo E2 
8.2X10 (E£2)¢ 
2.5X107% EI 


 1aSXIP 82x07 64X10 
1.4 X10° 7.010? 
3.3 X107 6.010 


1.1 X10 5.6X10 





2.310 
1.310% 
>2.8X 103 

1.3X10- 


>2.8X 10 
5.5X 10 


0.027(KLM)4 <2.6 
0.226(KLM)4 110 


® The conversion coefficients are those of M. E. Rose, reference 28. 

b The 141.5-kev transition is assumed to be pure £2 with a theoretical conversion coefficient of 2.7 X10~'. Both the K-L ratio and internal conversion 
coefficient determined by a comparison of the internal conversion line and the positron spectrum corroborate the E2 assignment. All other experimental 
conversion coefficients are determined relative to the 141.5-kev line. 

¢ The experimental value, 6-274 1073, for the conversion coefficient of the 371.9-kev transition is within the experimental error either an M1 or an E2. 
The tentative £2 assignment is made on the basis of the relative population of other states in the decay scheme. (See text.) 

4 All theoretical conversion coefficients for transitions of energy >1000-kev listed in the table have been increased by 14%, since they are being compared 


to experimentally determined ax ,;,4 rather than ax. 


intensities,’ it is possible to determine relative con- 
version coefficients for most of the transitions in the 
Nb” decay. The conversion of these relative values into 
absolute conversion coefficients may be accomplished if 
the multipolarity of any of the transitions is determined 
by an independent method. 

In the present experiment, the multipolarity of the 
141.5-kev line was determined by two independent 
methods. The K-Z ratio for this line indicates that it is 
an £2 transition (see Table V). The absolute conver- 
sion coefficient was determined for the 141.5-kev transi- 
tion by taking the ratio of the intensity of the K con- 
version line of the 141.5-kev transition to the intensity 
of the positron spectrum augmented by the theoretical 
orbital electron capture branch minus the intensity of 
the K conversion line of the 141.5 kev. The value for 
the conversion coefficient so obtained, 0.26, agrees very 
well with the theoretical value for an £2 of this energy, 
0.27. This argument assumes that there is no other 
positron or electron capture which ultimately feeds the 
141.5-kev transition. Actually, there is evidence that 
other positron and electron capture branches do feed 
the 141.5-kev transition. However, this feeding is at 
most a few percent and cannot affect the conversion 
coefficient by more than a few percent. 

Accordingly, the 141.5-kev £2 transition is chosen as 
a reference line. It is assigned the theoretical A-con- 
version coefficient, ax=0.27, and the conversion co- 
efficients of all other lines are determined relative to it. 
The K-conversion line of the 141.5-kev transition is 
particularly appropriate for comparisons, not only be- 
cause its multipolarity is independently determined, 
but also because it is an intense line and occurs at high 
enough energy so that there is little tailing. Accordingly, 
there is little ambiguity in its intensity. 

Table IV gives the experimental conversion coeffi- 
cients calculated in this way. The theoretical conversion 
coefficients for £1, #2, £3, M1, M2, and M3 and, in 


some cases, of £3, £4, E5, M3, M4, and M5 radiations 
are included for comparison. 

All conversion coefficients are those of Rose.** The 
K-conversion coefficients corrected for finite size effect 
recently calculated by Sliv and Band* differ little for 
Zr™ from those given by Rose. 

In general, there is little ambiguity in the multi- 
polarity assignment presented in the last column. A 
particular exception to this statement is the #2 assign- 
ment for the 371.9-kev transition. The experimental 
value of ax, (6-s+4)xX10-%, is actually closer to the 
theoretical value for an M1, namely 6.0X 107%, than to 
the value for an #2, 9.6X10-*. However, since the 
371.9-kev radiation originates from a 6+ state and is 
followed immediately by a 900-kev radiation to the 
2182-kev 2+ state rather than the potential 767-kev 
radiation to 2315-kev 5-state, it is much more plausible 
to use the maximum multipole order for the 371.9-kev 
radiation, namely £2. Unfortunately, the K-L ratio 
does not give an unambiguous answer for this transi- 
tion. No attempt could be made to determine the 
multipolarity of 900-kev radiation, because it was con- 
taminated with the 913-kev radiation of Zr®. 

Since a 371.9-kev £2 radiation competes with it, and 
it therefore must be unusually slow, it is tempting to 
believe that there ought to be an M2 admixture in the 
1138 £1 radiation. However, it is possible to calculate 
the half-life of a 371.9-kev transition between the 
assumed configurations (g9/2)64” and (gg/2)4,” (see Sec. 
VIIC). This half-life, combined with the experimen- 
tally determined branching ratio, can in turn be used 
to estimate the half-life of the 1138-kev transition to be 
~1X10-" sec. This estimate can then be used to set 
an upper limit of ~7% M2. However, this possible M2 
should be hindered (see Sec. VII B). Hindrance factors 


28M. E. Rose, in Beta- and Gamma-ray Spectroscopy, edited by 
K. Siegbahn (North-Holland Publishing Company, Amsterdam, 
1955), and privately circulated tables. 

*” ||. Sliv and S, Band (privately circulated tables). 
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are no less than 7 and usually considerably more (see 
the paper which follows) in each of four cases of this 
type. Accordingly, one is led to the conclusion that there 
can be very little M2 admixture. 

The experimental KLM conversion coefficient for the 
1138-kev transition, (2.14+0.4)10~*, would indicate 
<3% M2 admixture. 


B. K-L Ratios 


The spectrometers used had sufficient resolution so 
that it was possible to resolve the K and Z lines of the 
three lowest energy lines with sufficient accuracy to 
determine K-L ratios. The internal conversion lines for 
the 132.7- and 141.5-kev transitions are shown in Fig. 1. 
The line shape for the leading edge of all internal con- 
version lines in this complex is taken from the leading 
edge of the K conversion line of the 132.7-kev transi- 
tion, whereas the shape of the lines at energy greater 
than the assigned energy is taken from the Z conversion 
line of the 141.5-kev transition. Using the usual sub- 
traction procedure and the line shape of resulting lines 
as a check, it is possible to arrive at intensities for all 
the four lines in the complex without too large an error. 

The results are shown in Table V. Whereas the multi- 
polarity assignment from the K-L ratio is unambiguous 
for the 132.7- and 141.5-kev transition, as was men- 
tioned previously, this is not the case for the 371.9-kev 
transition. 


C. Spin and Parity Assignments 


The 1752-kev EO, 2182-kev E2, and 2315-kev E5 
transitions unambiguously assign the spin-parity of the 
1752-kev level as 0+, the 2182-kev level as 2+, and 
the 2315-kev level as 5—. The previously measured 
anisotropy of the 141.5-1138 kev cascade" deter- 
mined the spin values 8, 6, and 5 for the 3595-, 3453-, 
and 2315-kev levels. These spin assignments are 
strengthened and the parities determined by the de- 
termination of the multipolarity of the 141.5-kev radia- 
tion as £2 and the 1138-kev radiation as E1. Accord- 
ingly, the 3595- and 3453-kev levels are assigned spin- 
parities of 8+ and 6+, respectively. The 371.9-kev E2 
radiation strongly suggests that the 3081-kev state has 
spin-parity 4+. This assignment is supported by the 
theoretical calculation of the position of the 4+ level 


TABLE V. K-L ratios. 


Transition Theoreti- Multi- 
energy Intensity® Intensity* Experimental cal> K-L polarity 
(kev) of K line of L line K-L ratio ratio assumed 
132.7 44 15 2.9+0.3 2.95 E3 
141.5 100 17 5.9+0.4 6.23 E2 
371.9 0.084 0.0082 10 +3.5 9.23 E2 
—2.0 


* All intensities are based on the intensity of the K line of the 141.5-kev 
transition which is arbitrarily assigned the intensity 100. 

> The K-L ratios were determined from conversion coefficients given by 
M. E. Rose, reference 28. 
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in the paper which follows this one. The ground state 
of Nb® is assigned the spin parity 8+ or 9+. A justifi- 
cation of this assignment is given in the discussion. 

In view of the tentative nature of the three addi- 
tional levels in Fig. 10, no detailed attempt will be 
made to justify spin and parity assignments, although 
in some cases assignments appear possible. It is suffi- 
cient to say that all three levels must have high spin. 


VII. DISCUSSION 
A. Discussion of the Levels 


The decay scheme presented in Fig. 9 satisfactorily 
explains the experimental data not only of these meas- 
urements but also of previous measurements, with 
certain minor exceptions in which the previous experi- 
mental data have been shown to be in error. It differs 
radically from the Nb” decay schemes of Hok et al.,® 
and Mathur and Hyde,’ which, while essentially in 
agreement with each other, are in serious disagreement 
with the well studied decay scheme of Y®.'*-3! The 
low spin levels at 1130 and 1270 kev in Zr ® suggested 
by Hok ef al. should be populated in the decay of Y” 
in preference to the unique first forbidden ground state 
transition, yet less than 0.0001%” gamma rays are 
observed. 

In addition to the rather compelling experimental 
facts presented in this paper, there are a number of 
additional reasons for believing in the present decay 
scheme rather than that presented by Hok and by 
Mathur and Hyde. Their proposed energy difference 
(3650 kev) between the Nb” and Zr” ground states is 
considerably less than that suggested from the combined 
nuclear data.* Both the shell model (see below) and 
the population of the metastable states in Nb” in the 
decay of Mo® strongly suggest a high-spin ground 
state for Nb” rather than a low-spin state. Finally, 
systematics of first excited 2+ states™ indicate that the 
first excited 2+ should be considerably higher than 
1130 kev. 


B. Configurations Involved in the Levels 


As was pointed out in the introduction, we expect 
the low-lying states of Zr® to be determined by the 
proton configurations (1/2), (gs2)”, and (p1/2g9/2). 0+, 
0+, 2+, 5—, 4+, 6+, and 8+ states have been ob- 
served. The only other state to which these configura- 
tions can give rise is the (g9/21/2) 4— state, which should 
not be seen because its population is strongly forbidden 
by beta and gamma selection rules. 


% Braden, Slack, and Shull, Phys. Rev. 75, 1964 (1949). 

31 Laslett, Jensen, and Paskin, Phys. Rev. 79, 412 (1950). 

% Saraf, Varma, and Mandeville, Phys. Rev. 98, 1206(A) (1955). 

%3 Nuclear Level Schemes, A=40—A=92, compiled by Way, 
King, McGinnis, and van Lieshout, Atomic Energy Commission 
Report TID-5300 (U. S. Government Printing Office, Washing- 
ton, D. C., 1955), p. 187. 

% Alder, Bohr, Huus, Mottelson, and Winther, Revs. Modern 
Phys. 28, 432 (1956). 
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The energy relationship of these states is also grati- 
fying. This will be dealt with in considerable detail in 
the paper which follows. It is sufficient here to indicate 
very briefly the qualitative argument first suggested by 
Ford.’ Since the go/z orbital lies approximately 900 kev 
above the 1/2 orbital, before interparticle interaction 
the (go/2P1j2) and (go/2)? configuration should lie at 
~900 and ~ 1800 kev, respectively. After interparticle 
interaction and the consequent splitting of the con- 
figurations into the states enumerated above, it is not 
unreasonable to believe that the more vigorous inter- 
particle interaction of (g9/2)? 0+ and 2+ states would 
lower them below the (g9/2p1/2) 5— state. Therefore, 
not only the energy relationship of the configurations, 
but also the energy sequence of the 0+, 2+, 4+, 6+, 
and 8+ states and the relationship of this sequence to 
the 5— state are interpreted in a straightforward, un- 
ambiguous way. 

Furthermore, the more vigorous interparticle inter- 
action of the (go/2)? 0+ state may lower it into the 
vicinity of the (1/2)? O+ state whose interparticle 
interaction should be less. Such a situation would mean 
that the two 0+ states would be similar, each involving 
both the (gz)? and (12)? configurations, and that 
they would split apart energywise. 

If the energy of the sequence 2+, 4+, 6+, 8+ is 
plotted linearly against spin, a smooth curve results. 
Since neither of the two 0+ states falls on this curve, 
it seems improbable that either of the two 0+ states 
belongs to the pure (gg/2)? configuration which is almost 
totally responsible for the 2+, 4+, 6+, and 8+ states. 

A qualitative comparison of the relative population 
of the first two 0+ states by the beta decay of the 
(p1/2d5/2) 2— Y™ ground state indicates that both 0+ 
states must have a considerable (f1/2)? component. On 
the other hand, a comparison of the gamma branching 
from the (g9/2)? 2+ state to the two 0+ states indicates 
that they each have a considerable (g9/2)” component. 
Consequently, there is strong experimental evidence 
that the Zr ground and first excited 0+ states each 
involve strongly mixed (1/2)? and (gg/2)? configurations. 

The experimental evidence for the purity of the 2+, 
4+, 6+, and 8+ states is quite striking. This evidence 
is in the form of branching or potential branching from 
a given state to two states. The transition to one of the 
states is allowed by all selection rules, but the transition 
to the other state, while allowed by selection rules based 
on the spins and parities of the states, is forbidden for 
that multipole by the special selection rules imposed by 
the configuration. 

The competitions are between the following transi- 
tions: (1) 141.5 and 1280 kev, (2) 371.9 and 1138 kev, 
(3) 899 and 786 kev, and (4) 2315 and 132.7 kev, where 
the second member should be £3, £1, £1, and £3, re- 
spectively, by selection rules based on the spins and 
parities of the states, but M4 in each case by the selec- 
tion rules imposed by the configurations. Actually, the 
1280- and 786-kev transitions are not seen, whereas 


IN Z2retr EXPERIMENTAL 


1625 





considerably hindered 1138- and 132.7-kev E1 and £3, 
respectively, are observed. Since, with selection rules 
based on spins and parities only, the 1280- and 786-kev 
transitions should be more intense than the 141.5- and 
899-kev transitions, there is evidence for considerable 
hindrance in all four cases. This is qualitative evidence 
for the purity of the configurations in the 2+, 4+, 6+, 
and 8+ states. Quantitative estimates of the amount 
of impurities and of the mixing of the configurations in 
the two 0+ states are contained in the paper following 
this one. 

The two additional states at approximately 3200 and 
2760 kev observed by Day demand some additional 
attention. The state at 3200 kev might represent an 
excitation of the core (Sr**). The probable collective 
nature of such an excitation might then explain the 
exclusive depopulation of the 3200-kev state to the 
ground state.” 

It is also tempting to suggest that the 2760-kev state 
is the missing 4— state. However, the fact! that it 
decays both to the 2315-kev 5— state and to the 2182- 
kev 2+ state argues somewhat against this suggestion, 
since it is to be expected that the 4— state would con- 
siderably prefer depopulation to the 5— state over de- 
population to the 2+ state. A possible alternative 
interpretation of the 2760-kev state is that it is a 
[ (ps/2)"g9/2] 3— state. To a first approximation, such 
a state should be (p1/2— ps2) + (g9/2— P12) or 1850+ 900 
= 2750 kev above the ground state. The excellent agree- 
ment is at least partially fortuitous, even if the con- 
figurational assignment is correct. 


C. Lifetimes of the States 


Three states have experimentally measured _half- 
lives, namely the 3595-kev state with a half-life of 
(3_1.07!-*) K 10? sec, the 2315-kev state with a 0.83-sec 
half-life, and the 1752-kev state with a 6.4X10~-*-sec 
half-life. The half-lives for the first two of these states 
are calculated in the paper following this one. Conse- 
quently, these calculated half-lives are quoted here 
only for comparison. The half-life of the 3595-kev state 
is calculated as 4.27X10~7 sec, in complete agreement 
with experiment. The half-life of the 2315-kev state, 
without considering the involved configurations but 
only the multipole order, is 1.21 sec. When the con- 
figurations are included in the half-life calculation, the 
value 1.65 sec is obtained. This is still good agreement 
for this type of calculation. 

The calculation of the half-life of the 1752-kev state 
is considerably more complex, since the transition in- 
volved is a 0 > 0 transition. Accordingly, it is necessary 
to evaluate nuclear matrix elements. This has been 
done for Zr, both using square well®® and harmonic 
oscillator" potentials. The results of the calculations are 
in complete agreement, giving a half-life of the order of 
2X10~ sec, depending on the choice of the strength of 


ALS. Reiner, Physica 23, 338 (1957). 
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the contact particle interaction in the case of the square 
well potential and the choice of A/wM in the harmonic 
oscillator case. However, they are shorter by a factor 
of 30 than experimentally determined half-lives. At 
present, this is the most serious disagreement between 
theory and experiment in the study of the Zr™ nucleus. 
It is conceivable that this serious disagreement may 
result from the different polarizing effects of go/2 and 
p12 particles on the core of the nucleus.** 


D. Positron Decay of Nb*’ 


The positron decay of Nb” represents a transition 
from the ground state in Nb® whose spin and parity 
must be inferred from experimental and theoretical 
considerations to a state in Zr® which is largely (go/2)? 
8+. The log/t (6.0) indicates that the transition must 
either be allowed or first forbidden. Since the major 
configuration in 41 proton—49 neutron Nb” must cer- 
tainly be (go/2'go/2~'), the parity of the ground state of 
Nb” must be positive. Therefore, we are strongly led 
to believe it is an allowed transition. Furthermore, 
since the positron decay goes very largely to the 8+ 
state in Zr™ (less than 0.1% to the 6+ state), it is only 
reasonable to conclude that the spin-parity of Nb” is 
8+ or 9+. Consequently, one suggests that the major 
configurations in the positron decay of Nb” are: Nb” 
(ground state) : 


[a (Sr7®,p, 2*go/2')v(Sr™®,p1/2"g0 2°) Js, 945 
and Zr” (3595-kev state) : 
[4 (Sr7®,1/2°ga/2") v(Sr™®, pi2"go 2) Jey. 


It is then reasonable to expect a high value of the log ft 
for a positron decay which is allowed by normal beta 
selection rules, but would be strictly forbidden if the 
states involved were absolutely pure because the transi- 
tion involves the simultaneous excitation of two protons 
together with the positron decay. 

One might test the validity of the configurational 
assignments in the Nb® decay by comparison with 
nearby decays. An excellent example of the same 
forbiddenness can be found in the decay of Zr into an 
excited state of Y®. The ground state of Y® has the 
measured spin 1/2. The excited state into which the 
Zr® decays depopulates in turn by a 913 kev M4. This 
determines the spin (9/2) and probable configuration 
of this state. The ground state of Zr®, on the other 
hand, has a high spin as evidenced by its decay into the 


36 Note added in proof. R. B. Day has pointed out an arithmetic 
error in reference 11. The transition probability (W) for an EO 
internal conversion is given by W=Qp*. p=0.107 and Q=2.1 
X 10° sec for the 0+ — 0+ transition in Zr®. This results after 
were corrections for ex, 1, “~ pairs to a half-life of ~2X 10-8 
sec which can be compared with the experimental value of 6.4 
X 10~* sec. Thus the last serious discrepancy between experiment 
and theory for the Zr® nucleus is thereby resolved. 
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9/2 Y* excited state. The shell model together with the 
M4 transition involving the ground state and 4.4-min 
isomeric state of Zr® is further evidence for the con- 
figuration of the Zr™ ground state. On the basis of this 
experimental and theoretical information, it is possible 
to construct the configurations involved in the Zr® 
positron decay which follow: 

Zr® (ground state) : 


[a (Sr7®, p12?go/2)v (St™®, pr/27g0/2") Joy2+5 
and Y® (913-kev state) : 
[a (Sr7®,p1y2°goa!)v(Sr”®,p1j2"go/2") Jo/24- 


When the configurations involved in the Zr® and Nb” 
positron decays are compared, it is obvious that the 
transitions are analogous. Furthermore, the experi- 
mentally determined log ft value for the Zr® decay is 
6.1, very close to the value 6.0 observed in Nb”. 

On the other hand, if one has a simple positron transi- 
tion involving the transformation of a go/z proton into a 
9/2 neutron, as is the case in the Mo” positron decay, 
the log ft is 4.5. Therefore, there must be impurities in 
the configuration(s) of the Nb® ground state and/or 
Zr® 8+ state. The extent of the impurities should be of 
the order of 1/30th the major configuration. No reason- 
able configuration for the Zr® 8+ state will allow a 
single particle transition from the major configuration 
in Nb®. However, if configurations of the type 


[a (Sr, p1/2°go/2*) v(Sr7®,p1/2°¢9/2") Js, 94 


were present in Nb” to the extent of ~3%, the transi- 
tion could take place with reduced probability, implying 
a logft of ~ 6.0. 
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Levels in Zr°®’: Theoretical 
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An attempt is made to describe the seven levels of Zr® below 3.6 Mev in terms of the proton configurations 
(2pi/2)?, (2pis2 1go/2), and (1g9/2)*. The level positions and the compositions of the two 0+ states are deter- 
mined for Gaussian and Yukawa forces of various ranges and exchange characters. The experimental data 
are well reproduced for a reasonable choice of the force parameters, the best fit being obtained with a Serber 
exchange mixture and a range of about 1.5 fermis. The experimental values of the half-lives of the excited 
states can also be reconciled with these simple configurational assignments. The most serious discrepancy is 
in the half-life of the first excited (0+-) state, which we calculate to be 1.35 1078 sec, as compared to the 
observed value of (6.0+1.5) 10-8 sec. The remaining discrepancies in the energies and half-lives are in the 
direction of the effects produced by a slight deformation of the Sr** core. 


1. INTRODUCTION 


ORD! has suggested that the low-lying levels of 
4oZr50” can be understood in terms of configurations 
obtained by distributing two protons in the 24/2 and 
1go/2 shells. It is certainly reasonable to deal with proton 
configurations alone, since the 50 neutrons form a very 
stable closed shell whose excitations are unlikely to 
affect levels below about 5 Mev. Moreover, although 
the single-particle proton levels in this vicinity are not 
known with certainty, a scheme such as that shown in 
Fig. 1 seems reasonable.” We need only note that the 
2p1/2-1g9/2 spacing is probably a good deal smaller 
than the 1fs/2-2p1/2 or 23/2-2p1/2 spacings. Hence one 
might expect that levels involving excitations of the 
1fs2 and 25/2 closed shells lie higher in the Zr® spec- 
trum than those arising from the (21/2), (2p1/2,1g9/2), 
and (19/2)? configurations. In addition, Ford was able 
to show qualitatively that the positions of the levels 
known at that time (0+, 0+, 2+,5—) could easily be 
understood in terms of this picture. 

Since then, much more data on the excited levels of 
Zr™ have become available. In particular, 4+, 6+, and 
8+ levels have been found, and one is tempted to 
regard these as the remaining levels of the (1g9/2)? 
configuration. Accordingly, we have thought it worth 
while to do a detailed spectroscopic calculation based 
on the levels arising from these lowest three configura- 
tions, using finite-range, spin-dependent, central forces. 
An analysis of the observed transition probabilities 
provides a further check on our configurational assign- 
ments. 


* C.E.R.N. Fellow 1957, on leave of absence from The Institute 
for Theoretical Physics, University of Amsterdam, Amsterdam, 
Holland. Present address: The Weizmann Institute of Science, 
Rehovoth, Israel. 

t Member of the Institute of Theoretical Physics, 1955-1958: 
Present address: Chemistry Department, Florida State Uni- 
versity, Tallahassee, Florida. 

1K. W. Ford, Phys. Rev. 98, 1516 (1955). 

2 See, for example, the spectrum of Y* given by Strominger, 
Hollander, and Seaborg, Revs. Modern Phys. 30, 585 (1958). 


2. EXPERIMENTAL FACTS RELATING TO 
THE Zr SPECTRUM 


The experimentally determined level scheme is shown 
in Fig. 9 of the preceding paper,* where the attribution 
of spins and parities is discussed. It is important for 
our purpose to note that we have every energy level 
expected from the (2f1/2)*, (2p1/2 1go/2), and (1gy/2)? 
configurations, with the exception of the (21/2 1go/2) 4 
level.4 However, it is reasonable that such a level would 
be populated very weakly in the processes that have 
so far been used to excite the higher levels of Zr™. 

We must expect a strong interaction between the 
(2p1/2)"0 and (1g9/2)*o levels, and hence the two ob- 
served 0+ levels will be mixtures of these. We will now 
estimate the compositions of these mixtures from the 
relative populations of the observed 0+ levels, both by 
B- decay of Y® and by vy decay of the Zr” 2+ level. 
These compositions are experimental data, independent 
of the positions of the levels, which can be compared 
to the result of a spectroscopic calculation. 

Figure 2 shows the levels involved in this decay, 
together with the configurations that can reasonably 











9% 
Q9 Mev 
—— 
Py, 
Fic. 1, Single-par- 
ticle proton levels in 
the vicinity of Z= 40. 40 
; PROTONS 
~v15S Mev 
-----}----- ty, 
re Py, 





3 Bjgrnholm, Nielsen, and Sheline, preceding paper [ Phys. Rev. 
115, 1613 (1959) ]. 

4The subscript outside the parentheses is the total angular 
momentum to which the enclosed angular momenta are vector 
coupled. 
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be expected to contribute to them. The ground state 
of Y® is (21/2 2ds/2)2-. Transitions connecting two 
“minor” configurations, i.e., configurations which have 
no particles in either the 21/2 or 2d5;2 orbits, should 
have a very small effect on the total 8--decay process. 
The dominant contribution is from the transition 
(2pi/2 2d5/2)2- —> (2p1/2)*0, while the remaining possi- 
bility, (2p1/2 2d5/2)2-—> (2d5/2)"0, would perhaps have 
an effect intermediate, but still small compared to the 
transition between the major configurations. Hence, 
the comparative strengths of the @ transitions to the 
two 0+ levels of Zr® are a measure of the relative con- 
tributions of (21/2)0,? to them. 

In accordance with this picture, we write the wave 
functions of the 0+ ground and first excited states 
(hereafter referred to as 0+ and 0’+, respectively) as 
follows: 


b 
vom VL (2 pr): Volt ———¥_ (1 g9/2)°o ], 


(a2+52)! a?+b?)! 


a 
vo+= VL(2pr2)? lt poems VL (1g 2)°o ]. 


a? +6)! 


The ratio of the two transition probabilities is then 


9 


P Y bl ctlienttn 


Jvtea 2ds5, /2y 2 p12) 2- WgPordr 


Pras? Y"°+Zr?"0’4 


| if V*L (2dsy2,2p1)2)2-Waborad 


2” f,(2250) 
Be fx(498) ’ 


where 6 is the 8-decay operator, and the f,’s are Fermi 
functions for a unique first-forbidden transition. 

Yuasa ef al.’ and Greenberg and Deutsch® have 
determined the number of internal conversion electrons 
and the number of pairs resulting from the population 
of the 0+ level in Zr® as (1.6+0.6)10~ and (0.36 
+0.09) X 10~ per Y® disintegration. These values imply 
that the 0+ level is populated with a probability 


® Yuasa, Laberrigue-Frolow, and Feuvrais, Compt. rend. 242, 
2129 (1956). 


6 J. Greenberg and M. Deutsch, Phys. Rev. 98, 1517 (1955). 
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(1.96+0.61) X10~ times that of the 0+ ground state. 
The rather large uncertainty here is due to that involved 
in the determination of the internal conversion proba- 
bility. The theoretical value of exzu/Pairs=3 can be 
inferred from the calculations of Thomas.’ Taken to- 
gether with the experimental pair probability deter- 
mined by Greenberg and Deutsch, this leads to a value 
of (1.44+0.36) X 10~ for the relative population of the 
0+ level. These populations imply that the ratio in (2) 
equals (5.1_;.2+?*)10* and (6.9_1.+**) 10%, respec- 
tively. 

Using the tables of Feenberg and Trigg,* and making 
the corrections appropriate to unique first-forbidden 
transitions,’ we calculate 


f1(2250)/f1(498) 


The ratio a?/b?, as calculated from (2) is 1.26_0,30*°-5” 
or 1.70_o,33+°-*8, depending on whether the experimental 
or theoretical internal conversion pegency is used. 
This corresponds to te gs = (44,77) )7 in 
the former case, and a?= (63_5+7)%, b?= (3745- Oy % in 
the latter. 

An independent estimate of the compositions of the 
two 0+ levels is afforded by the y decay of the 2+ 
level. Here we have 


=4.05X 10~%. 


J vt rab 
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2182—0 \°§ 
(ety 
2182—1752 
= (b? ‘a?) K 3400. 


Although the very weak 430-kev y transition between 
the 2+ and 0’+ levels is seen, there is considerable 
uncertainty regarding the observed intensity. Therefore 
we estimate this by subtracting the probability of the 
0’+ — 0+ transition from that of the 2+ —-0+ 
transition. The former probability is calculated from 
the well-determined 1734-kev conversion (intensity 
5.5X10-* per disintegration) and the theoretical pair 
probability of Thomas.’ The latter probability is 
measured directly. 

The validity of this procedure depends on the 
assumption that the 0’+ level is populated entirely by 
transitions from the 2+ level. Certainly we can treat 
as insignificant contributions to the 0’+ level from the 
levels at 3081 kev (4+), 3452 kev (6+), and 3595 kev 
(8+). The 5-level should populate the 0’+ level with 
a frequency of about (563/2315)"X0.84=1.5X 1077 
transitions per disintegration. This is also insignificant 

7 P. Thomas, Phys. Rev. 58, 714 (1940). 

*E. Feenberg and G. Trigg, Revs. Modern Phys. 22, 399 


(1950). 
9 J. Davidson, Phys. Rev. 82, 48 (1951). 
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in comparison with the population from the 2+ 
level. We calculate (b?/a?)3400= (1.4++0.2) K 10-!/ 
(7.34+0.7) X10-° leading to values of 6? and a? of 
(36_5+*)% and (64-6+*)%, respectively. This excellent 
agreement with the estimate based on the 6 decay of 
Y® gives us confidence in the validity of our picture. 
The two estimates are independent, not only with 
respect to the experimental quantities involved, but 
also in that one measures the (2/2)? contribution to 
each 0+ level, the other the (19/2)? contribution. 

Our conclusion is that the composition of the Zr™ 
ground state is (635)% (2p1/2)? and (37+5)% (1g¢/2)?. 
These percentages are interchanged for the 1752-kev 
0+ level. 


3. SPECTROSCOPIC CALCULATION 


As stated in the Introduction, we calculate the levels 
of Zr® using a model of two protons in the (2f1/2)’, 
(1go/2)?, and (21/2 1g9/2) configurations. We need, how- 
ever, further assumptions about the single-particle 
radial wave functions and the two-particle interaction. 

We follow here the usual practice of taking the single- 
particle wave functions to be those of a particle moving 
in a spherically symmetrical harmonic oscillator field, 


(4) 


We can estimate w by identifying the expectation value 


Hg. ».= p?/2m+ (mw?/2)r°. 


of r?, averaged over the 90 nucleons in Zr™, with 2R?, 
where R is the nuclear radius. Thus 
((N+3)) whw/2= }mw*(r?) = }mw? (2 R?) 

= }mw* (2) (1.3K 90!X 10-3 cm)?, 


ha~8 Mev, mw/h=a’?~1.9XK 10" cm-?. 


(5) 


The radial functions we need are 


Ug (r) = eae (ar)* expl—3(ar)*], 


(6) 
, 1 } 4 
Uo,(r)=3*a 651)! 
where (2n+1)!!=1XK3X5xX---(2n+1). 
The two-particle interactions we investigate 
the form 
Vie= {—0.605+ 7107} V (riz) +e?/rie, 


(ar) $—(ar)?] exp[—4(ar)?], 


are of 


(7) 
with 
e (ri2/p) 
V(r) =Ve— 


(ri2/p) 


(Yukawa), (8) 
(9) 


Here Q% and Q* are projection operators which select 
the singlet and triplet parts, respectively, of the 


V (712) = Vo exp(—ri22/20?) (Gauss). 


pre: 
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2-proton wave functions. The Rosenfeld exchange 
mixture uses r=}. 

The matrix elements we need can then be calculated 
using the expression 


(jije; J | Vie! pr’ ja’; J) 
= —0.6( (115) i1 (led) 2! (dite) 1 (33) 0) s 
X ((A'3) in! (L2'3)i2"| (Li'Le’) 1 (33) 0) 4 
Xd. (1A EU (LR 12! ; Ly'12) (R1y00| 1,0) 
X (kl2'00| 120) +7 1 (L149) (24) 2! (hile) 1 (34)1) 0 
X ( (113 ar’ (dad) ie" | (Le) 134) 1) 
Xd. (—1)F FFU (RL! ; Lye) 


X (k11/00| 1,0) (Rl2'00| 1,0). (10) 


The 7-7 to L-S transformation coefficients are given by 
Flowers,’ or may also be obtained from the convenient 
table given by Racah." The radial integrals F* were 
calculated, using the method of Talmi,!? as linear 
combinations of the simpler integrals, /;, viz. 


P= dict, (11) 


where 
Voat't3 72\) w 

t= —_(-) f exp —}a°r? |r?42V (r)dr. (12) 
(21+-1) !!Nar 0 


The c;* for the (1g9/2)? and (21/2)? configurations were 
taken from the tabulation by Thieberger. For the 
(21/2 1go/2) configuration and the ((21/2)’o| Viz! (1g9/2)o) 
matrix element, we need in addition the expressions 
given in the Appendix. Explicit formulas for the /; are: 


Coulomb interaction: 
938 /2\3 (21)! 
= —(-) a—— — Mev; 
137\ae/7 [(21+1)!!) 
Gauss interaction: 


2 Wd 
h= v|—] Mev, A=a0; 
1+22 


Yukawa interaction: 


(14) 


2 } el 2) 
L,= v=) 2)! Hhoy41(k) Mev, k= (ap). (15) 


us K 


To calculate a spectrum, we must choose an exchange 
mixture (7), an interaction shape (Gauss or Yukawa), a 
range (A or x), and strength (Vo). The levels (19/2)*2, 4, 6,8 


10 B. H. Flowers, Proc. Roy. Soc. (London) A215, 398 (1952). 
!G. Racah, Physica 16, 651 (1950). 

21. Talmi, Helv. Phys. Acta 25, 185 (1952). 

16 R, Thieberger, Nuclear Phys. 2, 533 (1956-57). 
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Fic. 3. Experimental 
and calculated spectra. 
Solid lines: calculated 
levels as functions of tri- 
plet strength 7. Dashed 
lines: experimental lev- 
els (no 4-level has been 
seen). Numbers above r 
axes: calculated (2)1/2)"o 
contributions to ground 
state, the experimental 
value is (634+5)%. 
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are simply given by ((1gy/2)?y/ Vie! (1gs/2)2y), the levels (2p1/2 1g9/2)4,5 by ((2p1 21go/2) 7! Vie! (2p12 1g9/2) 4) — 9.91. 


For the two 0+ levels, we must diagonalize the matrix 


ce 2) "9 V 12 (1go 2) 9) 
((2p1/2)"o V 12 (1go 2)"0) 


where we have taken the single-particle 21/2-1g9/2 
spacing to be 0.91 Mev (see the discussion in Sec. 1). 
In this way, we have calculated spectra corresponding 
to r=—2, —}, 0, 3, 3, the values of \ and « shown in 
Table I, and for a range of strengths Vo for each choice 
of r and X or x. Figure 3 shows some of the spectra 
obtained with the Gaussian interaction. The results for 
the Yukawa interaction are very similar, except that 
the fit with experiment is slightly poorer. We have also 
indicated in Fig. 3 the calculated compositions of the 
two 0+ states. 


4. EFFECT OF MODIFICATIONS OF THE FORCE 


We will now consider the way the positions of the 
calculated levels depend upon the various assumptions 
described in the previous section. To this end we note 
that the expectation values of our interactions in all 
the states except the 0+ and 0/+ are of the form [see 
Eqs. (10), (11), and (12) ] 


TABLE I. Ranges used in the calculation. 
Yukawa: (p/r)— exp[ —r/p] 
p (cm) 


Gauss: exp[ — (r?/2¢?) ] 
@ (cm) K 


0.3 5X10 
5X 10-8 


1.5 


((2p1/2)?0| Vie! (1gs/2)"0) | 
((2pr/2)?o| Viz! (2pr2)"o)— 1.82] 


(16) 


E;s= » w [—-0.6a,L5,( 1g0/)2 + raiLT,( 1go/2)? 7] Wi, 
l 


=Vo> [—0.6a,S,(1gs/2)2s J+ 7aiLT, (1g9/2)? 7] 
l 


q?2lt3 2 ; va 
%—— ( ) f exp(—}a°r?) V (r)r?"dr 
(214-1) !!N 4 0 


= v,| -00 f V (Pdr >Y aS, (1g9/2)? 7 | 


l 


q2lt3 ar} l 
nearer () exp(—}a°r?) r+? 
(214-1) !!\xr 


+f V har] aLT,(1gs/2)? 7] 


gilts a} 
x— (-) exp(— }a°r?)r?+? | 
(21+1)!!\x 


We can therefore identify the functions 


a4 
PsJ(r\= (-) >» aiLS,(1g9/2)? 7 | 
l 


9]. 
Q2lt3 


x , — exp( _— Say)?! + as 


21-+1)!! 
aiLT,(1g0/2)? 7] 
l 


q?@t3 


<———— exp(— 3a*r*)r 


(21+1)!! 


2v\3 
pr=(-) 
T 


2142 
’ 
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with the probability distributions of relative distance 
in the singlet and triplet parts of the two-particle 
state Yy. Moreover, the symmetry" between position 
and linear momentum in the harmonic oscillator Hamil- 
tonian (4) implies that these functions also give the 
probability distribution of relative momentum, pro- 
vided that we interpret a@ as (mwh)~?. They are plotted 
in Figs. 4 and 5. Using the estimate (5) of Aw=8 Mev, 
we see that our wave functions contain appreciable 
components with relative energies up to 100 Mev. 

The probability distributions for the two 0+ states 
involve an assumption about their compositions in 
terms of (2f1/2)"o and (1gy/2)"o states. Figures 4 and 5 
use the mixtures 


Yor = 0.89 (2p1/2)?o ]—0.6Y[ (1g0/2)"o ] 


(ground state), 


Yo as 0.6¥[. (2p1/2)70 ]+0.8Y[ (1g¢, 2)*o | 


(first excited state). 


(19a) 


(19b) 


Compositions roughly equal to these were obtained for 
any of the force parameters we investigated. Here we 
have 


v3 
P3(r)= (-) > [0.64a:LS ; (2p1/2)?o ] 
T l 
+0.36a,.5; (1g9 2)o | 
—0.96a,.S ; (2p; 12)?9— ( 1g 2)*o] | 
q2tt3 
Wee 
(21+-1)!! 
Ps" (r) -( ) > [0.36ai.S ; (2p1/2)?o |} 
T l 


exp(— 3a?r?)r2442 


+0.64a,[.5; (1g9/2)0 ] 
+0.96a,[.S; (2p1/2) — (1g9/2)?o]] 


q2lt3s 
<—— exp(—}a’r?)r?+2_ (18) 
(2/+-1)!! 
with corresponding expressions for the triplet com- 
ponents. 

We have also plotted in Figs. 4 and 5 the radial 
dependence of the three Gauss potential we have used. 
It is seen that, in each case, the two protons spend only 
a small fraction of their time within the range of the 
assumed forces, and that the strength of interaction in 
any state J depends only on the behavior of Ps7(r) and 
Pr’ (r) for small values of r. Similar remarks apply to 
the three Yukawa potentials. 

These curves enable us to understand the response of 
the energy levels to changes in the force parameters. 
From Fig. 5 we see that the two protons in the triplet 
parts of the 0’+ and 2+ levels spend more time within 


14 We are indebted to Mr. D. Bés for this remark. 
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(b) 
Fic. 4. Solid lines: probability distributions of relative distance 
and momentum implied by the assumed configurations and har- 


monic oscillator radial functions. Dashed lines: the three Gaussian 
interactions used in the calculation. 


the range of a short-range force than they do in the 
triplet part of the lowest 0+ level. We must therefore 
expect the 0’+ and 2+ states to be more responsive 
than the lowest 0+ level to changes in the triplet force 
strength. Since the lowest 0+ level is kept fixed at 
0 Mev in Fig. 3, this causes the 0’+ and 2+ levels to 
rise as the triplet force becomes more repulsive, and so 
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Fic. 5. The low (ri2,p12) region of Fig. 4(b). 


the corresponding curves in Fig. 3 should have positive 
slopes. On the other hand, the two protons in the 
triplet parts of the 5—, 4+, 6+, and 8+ levels spend 
less time close together than they do in the lowest 0+ 
level, so they should be less responsive to changes in 
the triplet force strength, which means they should 
have negative slopes in Fig. 3. Both these conclusions 
are in accord with the actual movement of the levels. 
Moreover, as the range of the force increases, and the 
values of Pr’(r) for larger r become important, we see 
that the sensitivity of the ground state to changes in 
the triplet forces decreases with respect to that of the 
other levels. This leads us to expect the slopes of the 
curves in Fig. 6 to become more positive as we increase 
the force range. This should be especially true for the 
4— and 8+ levels, since Pr*-(r) and Pr%(r) increase 
rapidly in the region of r available to the Gauss inter- 
action of longest range. This feature is also exhibited by 
Fig. 3. 

We can also discuss the movement of the calculated 
levels in Fig. 3 for varying force range and fixed ex- 
change mixture. For r=0, the principal effect of 
increasing the range is to spread the J/=2+, 4+, 6+, 
8+ levels and to diminish the (0+)-(2+) spacing. We 
see here the transition from a level scheme of the 
characteristic short-range “‘seniority” type to one of the 
long-range “rotational” type. 

In the “region of best fit” of Fig. 3, the remaining 
disagreement is greatest for the 2+, 5—, and 0’+ 
levels. It is perhaps suggestive that collective effects 
would depress our calculated 2+ level, and so improve 
the fit with experiment. 
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The calculations with a Yukawa radial dependence 
give essentially the same spectra and dependence on 
range and exchange mixture. Therefore we do not give 
here the actual numerical results, but only remark that 
we obtained best agreement with the observed levels 
using x=0.9 and Vo=42 Mev. Elliott and Flowers® 
used x=0.87 for their calculation of levels in the nuclei 
around O'*, and also found that Vo~40 Mev gave best 
agreement with observed levels. 


5. THE (0’+)-(0+) LIFETIMES 


Zr™ is one of the very few nuclei with a first-excited 
0+ state. This state (which we have designated 0’+) 
can only decay by electron (or pair) emission. The 
transition probability for such a (monopole) transition 
can be written 


W go = B(Z)F(Z,E,R)|M |2. (20) 


Here B(Z)F(Z,E,R) is proportional to the square of 
the electronic transition matrix element, of which 
F(Z,E,R) is a factor common to all models chosen to 
represent the nuclear charge density. B(Z) contains the 
specific properties of these models, and has been given 
elsewhere.'* | M|?* is the absolute square of the nuclear 
transition matrix element of the monopole operator, 
which is defined as 


M= > protons r*[1 —a(r?/R)+::- ]. 


For all but the heaviest elements, only the first term 
is of importance. 

The wave functions of the two 0+ states are given 
by (1), and permit a calculation of M once the param- 
eters a, 6 are known. One easily finds!” 


(21) 


M= 


a? 


ab 
pee (gore) "ol rat (1g9/2)?o) 
—((2py2)*o|r1?+12| (2prj2)o)] (22) 


Lr?) 199/2—(r*) 2x2 J. 


2ab 
a?+}? 


Equation (22) displays clearly the mechanism of the 
transition as a radial oscillation of charge between two 
“orbits.” Using harmonic oscillator radial functions we 


have 
11 9 
[(r?)199/2—(r?)2p1/2 |= (—- ) 
z 2 


mo 


where 
1 3 
p=———_- -= 0.146, 
(N+3)m 5 


according to (5). The ratio 2ab/(a°?+6?) is rather 


15 J. P. Elliott and B. H. Flowers, Proc. Roy. Soc. (London) 
A229, 536 (1955). 

16 A. S. Reiner, Physica 23, 33 (1957). 

17 We are indebted to Dr. B. R. Mottelson for suggesting this 
method. 
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insensitive to a and 6, providing neither is too small. 
Using the experimental values for a and b (see Sec. 2) 
and the values of B(Z) and F(Z,W,R) (the latter for 
K-conversion) appropriate to the transition, one finds 
71/2= 1.80 10-8 sec. The K/L ratio is about 9.5, with 
negligible M-conversion. The value exiu/Pairs=3 cal- 
culated by Thomas then leads to a total half-life of the 
0+ level of approximately 1.3510-* sec. This must 
be compared with the empirical value of (6.01.5) 
X 10-8 sec. Thus, the calculated matrix element is too 
large by about a factor two. 

We next consider the extent to which the above 
calculation depends on the use of harmonic oscillator 
radial functions. To this end we have calculated p of 
(23) using various central potentials, each of which 
implies different radial wave functions. We summarize 
here the results. 


(a) Infinite potential well: p=0.18. 
(b) Finite well with depth D= —40 Mev: p=0.18. 
(c) Finite well with spin-orbit force of strength &. 


D and é were adapted to the binding energy of the 21/2 
nucleon and the observed single-particle splitting 
1go/2-2p1/2. This yields D= —38.4 Mev, £=0.62 Mev, 
and p=0.17. 

(d) Infinite potential well, but all possible admixtures 
to 0+ and 0’+ taken into account using a contact 
interaction (strength 1.2 10-** Mev cm*) in first order 
perturbation theory: p=0.28. 

All models (a)—(d) only worsen the agreement be- 
tween theory and experiment, which seems to indicate 
that the remaining discrepancy is not associated with 
our special choice of radial wave functions. We have 
here the most serious failure of our simple configura- 
tional picture. We are investigating the possibility of 
the influence on this transition of center-of-mass effects. 


6. OTHER LIFETIMES 


We now consider whether the other known transitions 
between the levels of Zr® can be reconciled with our 
description of these levels in terms of the configurations 
(1goy2)", (2py2)*, (2pry2 19/2). 

The probabilities per unit time of an electric 2-pole 
transition of energy ww is given by!® 


4.4(L+1) 


T(EL) = 


3 2 
[(20+1) oy et 


he 2L+1 
x( =—) ReES 10" sec, (25) 
sr 197 Mev 

18S. Moszkowski, in Beta- and Gamma-Ray Spectroscopy, edited 
by K. Siegbahn (North-Holland Publishing Company, Amster- 
dam, 1955), p. 373. 
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where R is the nuclear radius in fermis (1 fermi=10-" 
cm). The statistical factor S is defined by 
4r | ‘ 


S= pa se f OF (J;,My)¥u'*O(J;,M,)aQ| , 
QJ i +1 MiMy M 
(26) 


where O(/;,M;) and @(J;,M;) are the initial and final 
wave functions and /dQ implies integration over angles 
and summation over spins. For the transition 


[ (lj) (1 7’) Ja — to? © _ Cj) ("7") Ix, 
S can be calculated to be 


S= (2L41)(Lj’04| j”4)°U (Li"Ji3 j’ Kk, 


(27) 


(28) 


where k= 1, 2, or 4 depending on whether there are 2, 3, 
or 4 identical orbitals participating in the transition 
(27). We also need S for the transition 


[Uj (Uj) Jy — na al)? +B(U'9")*x. 


In this case we find: 


(29) 


S=2(2L+1)( 103 j8)*| aU (II; 7R) 


a, 
- -) U (L717; 8)| : 


— (30) 
2j+1 


+(-19( 


Using (25) and (28), we calculate the half-life of a 
(1go/2)?s+— 141.5? — (1go/2)?e; transition to be 9.7 1077 
sec. This is to be compared with the experimental half- 
life of (3.0_;.o+!-5)10-7 sec for the 8+ level. If we 
interpret this enhancement as due to collective effects, 
we may say that the deformation of the core effectively 
increases the charge on each proton outside the core by 


_a factor of about 1.8. Elliott and Flowers!® found that 


the effective proton charge for #2 transitions in O" and 
F8 is 1.5, whereas the work of True and Ford” indicates 
that the corresponding value for Pb** is 2.15. Hence 
the effective charge seems to increase systematically 
with nuclear size. Similarly, for a 2315-kev transition 
between the states (1go/22p12)s_ and [0.8(2p1/2)%o 
—0.6(1g9/2)?o ], (25) and (29) give a half-life of 3.7 sec, 
compared to the measured half-life of the 5— level of 
(0.83 sec. The calculated half-life is here too sensitive to 
the assumed nuclear radius to enable us to conclude 
anything about an effective proton charge for E5 
transitions. 

Furthermore, the fact that no (less than 0.02 per 
disintegration) (8+)—1280%*—> (5—) transition com- 
petes with the (8+)—~—141.5¥?— (6+) transition is 
easily understood in terms of our picture. To connect 
the configurations (1gg/2)?s; and (1gi/2 2p1/2)s-, an M4 
transition would be required, with a decrease in proba- 
bility by a factor of about 5X 10~°. However, a contri- 


19W. W. True and K. W. Ford, Phys. Rev. 109, 1675 (1958). 
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bution of (1g9/2 1g7/2)s, to the 8+ level would permit a 
1280-kev 3 transition. The half-life for (1g9/2 1g7/2)s; 
— s280"8 — (1go/2 2pr2)s— is given by (25) and (28) as 
3.1 10~* sec, as compared to 9.7 10~7 sec for (1g9/2)?s+ 
— 341.5"? — (1go/2)"6,. The fact that the latter transition 
is observed to be at least 50 times stronger than the 
former indicates that the contribution of (1gg9/2 1g7/2) 8+ 
to the observed 8+ level is less than 1%. This is a 
confirmation of the extreme j-j coupling model, whose 
breakdown would allow mixtures such as (1g9/2)*y and 
(1goye 1g7/2) J. 

Of course we have not ruled out the possibility of 
appreciable contributions to both the 8+ and 5— levels 
of configurations such as [ (1f6/2)*7(2p12)0?(1go/2)?y" Js, 
or [(1fs/2)°sy2(2p1/2)"o 1go/2 ls-, i-e., core excitations. 
Since the statistical factors for such configurations are 
not appreciably smaller than those given by (28) and 
(30), we would then have to interpret the absence of an 
(8+ )—j289%8 — (5—) transition as due to accidental 
phase cancellations. 

Turning to the 6+ level, we must recognize that the 
observed (6+)—1133"!— (5—) transition contradicts 
our description of these (1go/2)"6, and 
(2p1y2 1g0/2)s-. But we also observe that 2.8% of the 
depopulation of the 6+ level occurs by means of the 
transition (6+ )— 371.9"? (4+). The existence of such 
a competition indicates that the 1138-kev 1 transition 
is very strongly retarded. In fact, (25) and (28) yield 
for (1gg/2)"6;— 71.9"? — (1go2)244 a half-life of 2.4 
xX 10-* sec, whereas a (6+)— 1133”! (5—) transition 
should have a half-life of about 10~'® sec, assuming a 
statistical factor of unity. Hence the observed ratio of 
about 40 for the two transitions indicates that the £1 
transition is enormously retarded, and that very small 
admixtures of other configurations in the 6+ and 5— 
levels would account for the observed /1 transition. 
We cannot make this argument much more quantitative 
because E1 transitions are frequently found to be 
hindered relative to the half-life given by (25) and (28). 
We can, however, conclude that the very strong 
hindrance we have here is due either to the almost 
complete absence from the 6+ and 5— levels of con- 
figurations which would allow an £1 transition, or to 
phase cancellations of the type that must be invoked if 
we wish to explain the absence of (8+) — 1280%* — (5—) 
in a similar way. 

An analogous, but weaker, argument shows that the 
dominance of the (4+ -)— 900"? — (2+) transition over 
the too-weak-to-be-observed (4+) —76"! — (5—) tran- 
sition implies a severe retardation of the latter, and is 


levels as 


consistent with our assumptions about the compositions 
of these levels. We might further expect from (25) 
that the transitions (5—)—j3.7"%— (2+) and (5—) 
—2315”° — (0+) would compete on equal terms, where- 
as the latter is observed to be stronger by a factor of 
seven. 

We see, therefore, that transitions which are incon- 
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sistent with our simple configurational assignments are 
all retarded compared with those that are not. As men- 
tioned above, ‘this does not prove the “purity” of these 
configurations because of the possibility that these 
retardations are due to interference effects. However, 
it seems unlikely that the phase relations of the com- 
ponents of the levels would always be such as to produce 
such large cancellations, and therefore there appears to 
be good reason to believe that the contributions of 
other configurations to the low-lying levels of Zr® are 
not large. 


7. DISCUSSION 


The previous sections show that one can come quite 
far in understanding the positions and compositions of 
the low-lying levels of Zr, and the transitions between 
them, in terms of the lowest configurations and simple 
two-particle interactions. However, there remain some 
real discrepancies between our calculations and experi- 
ment: 


(1) The observed #2 and £5 transitions are enhanced 
by a factor of about 3 or 4 over our calculated values. 

(2) The observed 2+ level is higher, and the observed 
5— level is lower, than our calculated positions by about 
200 or 300 kev. 

(3) The observed 0+ — 0+ monopole transition is 
retarded by a factor of 4 compared to our calculated 
transition probability. 


The discrepancies in the £2 and E5 lifetimes and the 
2+ level position can be interpreted as manifestations 
of the deformability of the Sr** core. The monopole 
transition may be influenced by collective motion of a 
different type, i.e., center-of-mass effects. 

The interactions which best reproduce the experi- 
mental level spacings are of Serber exchange character, 
or have a weakly attractive triplet component. The 
best “standard deviation” for the Gaussian radial 
dependence is about 1.7 fermis (depth 47 Mev), with a 
shorter range, 1.2 fermis (depth 79 Mev), being only 
slightly poorer. These latter parameters are similar to 
those used by Redlich” in his calculation on nuclei near 
O'* (1.06 fermis, 70.8 Mev). However, the force used by 
True and Ford near Pb”® (1.3 fermis, 32.5 Mev) is 
considerably weaker. Our goodness of fit deteriorates 
rapidly if we increase the force range beyond about 
2 fermis. 
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APPENDIX 


The Slater integrals for the (21/2 1gs/2) configura- 
tion are 


Pape = ff ues ruutlrd Mornin, 


GH 2pAg)= ff usplraduap(rde( raul) 


X f*(nire)ri?redridre. 
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F°(2p,1g) = (1/3840) (3857 o+5071,+120972— 5017; 
+18997,—23175—1573/¢+2145/;), 


F2(2p,1g) = (11/768) (357o+210,+15/2—23175 
+249] ,+1575—2991¢+195/2), 


G(2p,1g) = (7/3840) (38570—811;+12897.—1707/; 
+ 1867/4—416975+529114—2145/;), 


G*(2p,1g) = (121/3840) (357o— 1497,+415/.— 10657; 
+186574—185575+94975—195/7). 
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Elastic Scattering of N“ by Be’ 


M. L. HALBERT AND A. ZUCKER 
Oak Ridge National Laboratory,* Oak Ridge, Tennessee 
(Received April 20, 1959) 


Nitrogen-beryllium elastic scattering was measured over an angular range from 32 to 144 deg in the center- 
of-mass system with an angular resolution of about one degree. The mean energy of the incident nitrogen 
ions was 27.3 Mev. To distinguish elastic scattering from other events, both the scattered and the recoil 
particles were detected in coincidence by thin CsI (TI) scintillation counters. The elastic scattering differential 
cross section is 550 mb/sterad at 32 deg c.m. It decreases monotonically and more rapidly than csc*(@/2) toa 
shallow minimum of about 5 mb/sterad at 106 deg c.m., rises slightly, and then falls to about 2.5 mb/sterad 
at 144 deg c.m., the largest angle measured. The data are compared to the predictions of a sharp-cutoff 
model for elastic scattering, but no agreement is found between this theory and the experimental results. 


INTRODUCTION 


LASTIC scattering is of considerable interest in 

the study of nuclear reactions and can be a useful 
tool for constructing nuclear potential models of all 
degrees of sophistication. Heavy-ion elastic scattering 
is susceptible to some simplifications because of the 
classical nature of the particles. The parameter 7 
=Z,Z.e?/hv indicates the degree to which a particle 
may be regarded as classical and is ordinarily larger than 
unity in heavy ion experiments. 

Previously N''— N" elastic scattering was studied at 
this laboratory.! More recently, the elastic scattering 
angular distribution of C” on Au’ was measured by 
Goldberg and Reynolds.’ In both of these experiments 
it was found that the one free parameter of the semi- 
classical sharp-cutoff model proposed by Blair® can be 
chosen so that good.agreement with the data is ob- 
tained. Theoretical considerations and experiments on 
elastic scattering of alpha particles from heavy elements‘ 
indicate that the Blair model provides a good fit if 
n>1, and ¢/ocou>1/n. Both heavy ion experiments 
cited above fulfill these requirements. 

In the present measurement 27.8-Mev nitrogen-14 

* Operated for the U. S. Atomic Energy Commission by Union 
Carbide Corporation. 

1H. L. Reynolds and A. Zucker, Phys. Rev. 102, 1378 (1956). 

2 E. Goldberg and H. L. Reynolds, Phys. Rev. 112, 1981 (1959). 

3 J. S. Blair, Phys. Rev. 95, 1218 (1954). 

4 Wegener, Eisberg, and Igo, Phys. Rev. 99, 825 (1955). 


ions were scattered from beryllium. This corresponds 
to n= 3.2, a value closer to unity than was encountered 
in previous experiments. Furthermore, it was found that 
over practically the entire angular region investigated 
a/ocour<1/n. Thus, although previous measurements 
of heavy-ion scattering lay in the domain of validity 
of the Blair model, it is not to be expected that this 
model will fit the results reported here. 


DESIGN OF EXPERIMENT 


In this experiment it was necessary to consider certain 
characteristics of 28-Mev nitrogen ions, the beryllium 
targets, and the kinematics of the scattering process 
in order to develop a workable design. Care had to be 
taken to avoid confusing nitrogen-beryllium elastic 
scattering with many possible similar events. These 
include inelastic scattering and transfer reactions® 
from beryllium and elastic scattering from impurities. 

Self-supporting beryllium foils prepared by vacuum 
evaporation were used as targets. Typical thicknesses 
were about 0.18 mg/cm’, representing approximately 
1-Mev energy loss for the incident nitrogen beam. 

Early in the course of this experiment it was found 
that the beryllium foils contained oxygen, and scatter- 
ing from it competed seriously with the desired scatter- 
ing. For example, at 20 deg laboratory angle the 


5 Halbert, Handley, Pinajian, Webb, and Zucker, Phys. Rev. 
106, 251 (1957). 
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counting rate from oxygen was about equal to that 
from beryllium. Only a small percentage of target 
impurity is required for such a large effect, since (1) 
the oxygen is of higher atomic number than beryllium, 
(2) for the heavier oxygen atoms a given laboratory 
angle corresponds to a smaller center-of-mass angle, 
(3) the reduction in the scattering cross section due to 
nuclear effects is probably smaller for the oxygen since 
the energy is closer to the Coulomb barrier. There was 
no indication that other impurities were present in the 
targets, or accumulated on them during the course of 
the experiment. 

Scattering from impurities can be distinguished from 
N—Be scattering if the detector has sufficiently good 
energy resolution. In this experiment, the detector was 
a Lucite-mounted 0.005-inch thick CsI(TI) crystal® 
on a DuMont-6291 photomultiplier. Partly because of 
poor light-collection efficiency, its resolution was no 
better than about 10% for the most favorable particle 
energies. At forward angles this was inadequate to 
resolve nitrogen ions scattered by oxygen from nitrogen 
scattered by beryllium. Similarly, inelastic scattering 
from low-lying states or certain transfer reactions could 
not be cleanly separated from the elastic events. 


The Coincidence Method 


A reliable way of eliminating the unwanted counts 
is to take advantage of the differences in the kine- 
matics for the various processes. For any choice of the 
N"™ scattering angle, the angle at which the recoil 
particle is emitted during a given scattering process is 
known and may therefore be used to identify the 
process. In practice then the procedure was to detect 
either the nitrogen or beryllium atom in one scintil- 
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Fic. 1. Kinematics of the N'*-Be® elastic scattering for 27.2-Mev 
laboratory energy. Laboratory angles of both particles are given 
as a function of the nitrogen c.m. angle. Typical energies of 
nitrogen and beryllium at various angles are shown on the curve. 


® Obtained from Isomet Corporation, Palisades Park, New 
Jersey. 
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lation counter, and to place a second counter at the 
proper angle to accept the other particle involved in 
the scattering event. Pulses from the first counter were 
displayed on a 20-channel pulse-height analyzer, which 
was gated by pulses from the second counter. In this 
way both coincidence and pulse-height information 
was obtained for each event. Figure 1 shows the 
kinematics of the elastic scattering. 

An example of the usefulness of the coincidence 
method may be seen in Fig. 2 which shows nitrogen 
spectra taken at a laboratory angle of 20 deg. The 
impurity scattering peak is eliminated in the gated 
spectrum, and the low-energy continuum (probably 
due to reactions leading to multiparticle breakup) is 
also greatly reduced. The improvement is even more 
dramatic at larger angles, where the ungated spectrum 
usually shows no more than a slight bump at the pulse 
height expected for elastic scattering events. 


TN.-GATED BY 
COINCIDENCES 
—— 3 deg 


CHANNEL NUMBER 


Fic. 2. Pulse-height distributions of counts in the defining 
counter gated and ungated by a second counter in coincidence. 
= 64.3 deg is the laboratory angle at which Be recoils are expected 
for elastically scattered nitrogen detected at 20 deg in the labora- 
tory system. 


Figure 3 shows 20-deg spectra gated by the second 
counter at various angles. With this counter at 64.3 
deg the peak is due to coincidences with recoil Be ions, 
as shown by the kinematics (Fig. 1). The asymmetric 
shape of the peak was probably caused by nonuniform 
scintillation response of the CsI crystal since the peak 
became symmetric with a smaller collimator in front 
of the crystal. The peak at the 70.3 deg position is 
probably N*™ scattered by O'%. The energy ratio 
inferred from the two pulse heights’ is just what one 
expects for nitrogen scattered from beryllium and 
oxygen, respectively. The area of the N—O peak is 
smaller than might be expected from the ungated 
spectrum of Fig. 2, probably because some of the O* 
recoils were stopping in the target. Special care was 
taken, as described below, to avoid such losses in the 
scattering from Be. 

In any such coincidence arrangement one must be 


7M. L, Halbert, Phys. Rev. 107, 647 (1957). 
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certain that the second counter accepts every particle 
associated with a desired particle entering the first 
counter. It is essential to know which counter is 
defining the angular acceptance of the coincidence 
system. The counter with the smaller aperture in the 
center-of-mass system will hereafter be called the 
defining counter, while the other will be referred to as 
the conjugate counter, in accordance with the nomen- 
clature of Mather and Swan.’ In estimating whether 
the conjugate aperture is sufficiently large, one must 
take into account the finite size of the beam spot and 
multiple scattering in the target. Whenever the latter 
effect might have been serious, experiments were 
conducted to insure that no counts were lost. 

Accurate alignment of the counters is important. To 
prevent loss of coincidences the aperture of the conju- 
gate counter must always remain in the plane deter- 
mined by the incident beam direction and the defining 
aperture. By careful mechanical construction it was 


(x16) 
4 


5 
20 


15 


| 
| 
| 


counts /channel 


10 


| | pe aie OR 

Ol : 

45 50 60 65 70 B 85 
CHANNEL NUMBER 


Fic. 3. Pulse-height distributions in the defining counter gated 
by the conjugate counter at various angles, ¢, in the laboratory 
system. 


assured that the plane of motion of the conjugate 
counter was parallel to that of the defining counter. 
To be certain that the planes actually coincided, a 
piece of paper masking tape was mounted over the 
aperture of each counter. The defining counter was 
moved into the direct beam and the orientation of the 
chamber was adjusted until the position of the charred 
spot on the tape indicated that the beam was striking 
at the center of that aperture. Then the conjugate 
counter was moved into the direct beam. The charred 
spot was found to be well-centered in the conjugate 
aperture, showing that the alignment was correct. This 
measurement was also used to determine the zero- 
degree setting of each counter. In a similar set of 
measurements with tape on the target holder, it was 
verified that the beam passed through the center of 
the chamber. 

In spite of the coincidence technique, at certain 


8K. B. Mather and P. Swan, Nuclear Scattering (Cambridge 
University Press, Cambridge, 1958), Chap. III. 
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Fic. 4. Kinematics of some reactions which may be confused 
with elastic scattering events. The numbers identify the products 
of N+ Be? reactions according to the legend in the diagram. Two 
curves are required for each reaction since either of the products 
may be counted in each detector. The primed curves give the 
required angle settings if the heavier product is to enter the 
defining counter. The unprimed curves are for the inverse case. 


angles confusion with unwanted events was possible. 
Figure 4 shows how this may arise. For simplicity only 
a few of the possible reactions are shown.’ The curves 
identified by primed numbers are those in which the 
heavier particle enters the defining counter; the 
unprimed curves describe the opposite situation. The 
points where these curves cross the elastic scattering 
curves (1 and 1’) then give the angle settings at which 
confusion may occur. At nearly all intersections of the 
elastic scattering curves with the other primed and 
unprimed curves the pulse height in the defining 
counter was quite different for the two processes and a 
clear distinction was easily made. However, where 
curve 1 crosses curve 5 the particle energies differ by 
only about 10%, and a clean separation of the Be® from 
the B" could not be expected. From the shape of the 
spectra obtained at these angles (128 to 132 deg c.m.) 
there was no evidence for any B" pulses. The data were 
analyzed on the assumption that all counts were due 
to Be’. 

The reactions leading to Be* deserve special mention. 
By virtue of the transverse momentum which may be 

®So many reactions may occur that hand calculation of the 
kinematics becomes exceedingly time-consuming. A program 
for the IBM-704 computer was developed to calculate certain 
kinematic quantities for nonrelativistic reactions resulting in two 
particles. The machine prints out the following information for 
each reaction product as a function of the center-of-mass angle: 
(a) laboratory angle, (b) laboratory energy, (c) solid-angle 
transformation from laboratory to c.m. system, (d) time of flight 
in nanoseconds/meter (10~* sec/meter), and (e) magnetic rigidity. 
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imparted to the alpha particles produced when the Be® 
breaks up, one of them might be detected even if the 
counters are not at the correct angles for (Be’+N") 
detection. In most situations of interest in this experi- 
ment, the Be® kinetic energy is large compared with its 
distintegration energy. The maximum angle between 
the alpha-particle direction and the initial Be’ direction 
is therefore only a few degrees, and most of the alpha 
particles do not deviate even this much from the initial 
direction. By broadening the appropriate lines in Fig. 
4 to represent the maximum possible deviation of the 
alpha particles from the Be® angle of emission, it can be 
shown that of the reactions leading to Be® in its ground 
state only those resulting in N® excited to the 6.33 Mev 
(curve 4) or higher states might be mistaken for elastic 
scattering events. Fortunately, the Be® pulse height is 
sufficiently different from that of the alpha particle 
to permit a good separation. 


The Scattering Chamber 


Figure 5 is a simplified diagram of the scattering 
chamber. Its interior diameter is 24 inches and its 
depth about 8 inches. Each counter and its associated 
magnetic shielding is mounted on a rigid aluminum 
arm whose angle can be changed manually from 
outside the chamber. A worm-gear drive for each 
counter with a 100-to-1 reduction makes it possible to 
move the heavy iron shielding against the magnetic 


forces and also provides a brake against the pull of the 
field on the counters. 
The counter arms are mounted on concentric shafts 


with rotating O-ring seals and can be moved in- 











Fic. 5. Schematic drawing of the scattering chamber. 
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dependently of each other. A stationary tube inside the 
smaller shaft provides support for the target-carrying 
rod and an unobstructed annulus for electrical connec- 
tions. The target can be positioned in and out of the 
incident beam and be rotated on the axis of the counter 
motion. These adjustments can be made without 
breaking the vacuum. 

All mechanical support and electrical connections 
are thus provided through one wall of the chamber. 
The opposite wall of the chamber is a 2-inch thick, 
27.5-inch diameter laminated glass disk. Through it the 
entire interior of the chamber can be seen. This feature 
proved its worth many times. The location and condi- 
tion of the target, the electrical leads, the collimators, 
and the counters can all be checked at a glance or 
adjusted easily. An angle scale engraved on the inner 
face of the supporting wall is also easily visible. By 
means of a reference marker attached to each arm, the 
counter angles can be reproducibly set to +0.1 deg. 

Access to the inside of the chamber is provided by 
three 4.5-inch diameter ports. One serves as the beam 
entrance port, another is opposite the entrance port, 
and the third is at 90 deg to the incident beam direction. 

The scattering chamber was located near the cyclo- 
tron, in a fringing field of about 1000 gauss near the 
entrance collimator and decreasing to several hundred 
gauss at the defining counter. Each photomultiplier 
was shielded simply and effectively with a close-fitting 
mu-metal cylinder and two concentric iron cylinders. 
The effect of the magnetic field on the particle trajec- 
tories constituted a serious problem, particularly when 
it was necessary to detect low-energy particles. To 
minimize curvature of the paths and separation of the 
charge states, the particles traveled inside an iron pipe 
installed on each counter between the target position 
and the counter aperture. A third pipe was used to 
shield the path of the incident beam before it reached 
the target. 

All apertures were circular. The entering N“ beam 
was collimated by a }-inch diameter hole 10} inches 
from the target. The beam was slightly divergent ; when 
it reached the target it was at most 3°; inch in diameter. 
The divergence of the beam was thus no larger than 
+0.1 deg. 

The collimator of the defining counter was either 
5 inch or jg inch in diameter depending on the experi- 
mental requirements. Located 7 inches from the center 
of the chamber, these collimators subtended a cone 
of half angle slightly less than 0.5 deg or 0.25 deg. The 
true angular resolution was somewhat larger, due to 
factors such as the finite beam size and multiple 
scattering. 

The conjugate counter was normally used with a 
g-inch diameter aperture placed about 3} inches from 
the center of the chamber so that it subtended a cone 
of half angle approximately 2.9 deg. The size of the 
collimator could be varied for experimental checks of 
multiple scattering effects. 
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EXPERIMENTAL PROCEDURE 


The electronic apparatus for this experiment was 
fairly standard. Pulses from the two scintillation 
counters passed through cathode followers and were 
amplified by linear amplifiers. An integral discriminator 
following the conjugate-counter amplifier was used to 
gate a 20-channel pulse-height analyzer on which the 
pulses from the defining counter were displayed. The 
resolving time of the circuit was about 2.8 usec. 

As mentioned previously, the coincidence method 
requires that one take special care to be sure that no 
counts are lost. Experimental tests were devised to 
check that: (a) the conjugate counter was at the proper 
angle, (b) all coincident particles had sufficient energy 
to leave the target and be detected by the conjugate 
counter, (c) the aperture of the conjugate counter was 
large enough to accept all coincident particles, and (d) 
all coincident particles entering the conjugate counter 
actually gated the pulse-height analyzer. 

To check (a) and (c), the coincidence rate for a given 
setting of the defining counter was measured as a 
function of the conjugate-counter position. Figure 6 
shows such a curve with the defining counter at 20 deg. 
The expected angle for detection of recoil Be particles 
is 64 deg. The curve immediately checks condition (a), 
while the existence of a plateau is evidence that require- 
ment (c) is satisfied. A similar curve was obtained for 
every setting of the defining counter. 

If a good plateau was not obtained it was usually 
because coincident particles were multiply scattered as 
they passed out of the target. Tilting the target so that 
these particles traversed a shorter length of target 
material would in most cases restore the flat top. 
When this failed to give a good plateau, a direct check 
on multiple scattering was made. The usual 3-inch 
diameter collimator on the conjugate counter was 
replaced by a 37-inch collimator. If no reduction in 
coincidence rate was observed, the 3-inch collimator 
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Fic. 6. Coincidence rate as a function of conjugate counter 
angle, ¢. The defining counter was fixed at 20 deg in the labora- 
tory system. The small peak at 70 deg is probably due to oxygen 
impurity in the target. 
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was clearly large enough to satisfy (c). On the other 
hand, if a counting loss was observed the data at this 
angle were not used for a cross-section determination. 
This condition limited the smallest angle to 12.5 deg, 
laboratory system. A larger aperture on the conjugate 
counter might have permitted additional measure- 
ments, but other difficulties with the low-energy 
recoils would have precluded their extension to angles 
significantly smaller than 12.5 deg. 

The pulse-height analyzer was gated by pulses in the 
conjugate counter above a certain threshold. To check 
(b) and (d) the coincidence rate was measured as a 
function of this threshold setting. Ordinarily the curve 
obtained showed a broad region in which the coincidence 
rate was independent of pulse-height setting. This was 
taken as sufficient evidence that (b) and (d) were 
satisfied. If the coincident particle was of low energy 
the curve tended to have a shorter plateau. In such 
cases tilting the target toward the conjugate counter 
lengthened the plateau. However, at no angle permitted 
by requirement (c) was there any doubt that all the 
coincident particles were reaching the conjugate 
counter and turning on the gate circuit. 

For the cross-section measurements from 32 deg c.m. 
to 90 deg c.m., the defining counter was used to detect 
the scattered N“. Near 90 deg c.m., however, several 
difficulties manifested themselves. (1) Background in 
the gated spectra became a problem. As the large- 
aperture conjugate counter was moving further forward 
many more gate openings not due to the desired elastic 
scattering events occurred. (2) The defining aperture 
corresponded to a very large angular acceptance in the 
center-of-mass system. (3) Multiple scattering of the 
N* became appreciable. The last two effects tended to 
decrease the effective angular resolution. 

All of the above difficulties were alleviated simply by 
interchanging the roles of the counters. That is, the 
Be’ recoils entered the defining counter while the 
coincident N' ions were detected by the conjugate 
counter. Eventually, however, as the measurements 
were extended to much larger angles, the decreasing 
cross section and increasing background combined to 
make the data uncertain. For example, at the largest 
angle studied (144 deg c.m.) the uncertainty in sub- 
traction of the background continuum was of the 
order of 20%. 

In an attempt to study the origin of the background, 
a pair of large-angle runs was taken with full beam 
intensity and with one-fourth of full beam. The two 
runs gave identical spectra. This showed that the 
background was not due to random coincidences, but 
to real nonscattering events. These may have been 
many-body disintegrations in which two of the charged 
reaction products happened to enter the counters. The 
angular distribution was not extended beyond 144 
deg since the background would have been extremely 
difficult to eliminate, and it was likely that multiple 
scattering losses due to the low energy of the N ions 
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would have prevented measurements at angles not 
much larger than 144 deg. Furthermore, at these angles 
the counting rate in the conjugate counter was highest 
and therefore the pair of runs at high and low beam 
level showed that the contribution of chance coin- 
cidences to the elastic scattering peak was negligible, 
as was blocking of the gating circuit by a high counting 
rate. 

To measure the scattering cross section it is necessary 
to know the number of nitrogen ions striking the target 
per unit time. This was done by measuring and inte- 
grating the beam current after it passed through the 
target. An insulated cover plate on the forward port 
was used for this purpose. A check with a Faraday cup 
showed that no errors in these measurements were 
caused by escape of secondary electrons. 

To check for proper overall operation of the detection 
system and also for possible vaporization of target 
material, frequent runs were made at certain standard 
angles, at least once every three hours. These standard 
angles (66.1 deg c.m. when the N“ was detected by the 
defining counter and 85 deg c.m. when the Be’ entered 


+ 





da /dw(mb/sterodian) 








90 
NITROGEN C.M, ANGLE (deg) 


Fic. 7. The differential cross section of N'*— Be’ elastic scatter- 
ing as a function of the nitrogen c.m. angle. All experimental 
points are shown except at 85 deg where many points were taken 
and only an average is shown. The upper curve is the cross section 
for Coulomb scattering. 
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this counter) were selected for their high counting rates 
and the absence of the experimental difficulties de- 
scribed above. It was found that these runs for a given 
target were consistent within the statistical error. 

The energy of the beam was determined by placing a 
hydrogen-filled chamber at the forward port of the 
scattering chamber, and measuring the range in 
emulsion of recoil protons at zero degrees. This method 
has been described in detail previously."° No con- 
tinuous check on the beam energy during the scattering 
was made; however, past experience indicates that the 
beam energy was probably constant to within +200 
kev. Since the targets were relatively thick, usually 
about 1 Mev, no effects due to small beam-energy 
fluctuations could have been observed. All cross-section 
calculations are based on a mean energy in the target 
of 27.3 Mev. 


RESULTS 


We present the results of this experiment graphically 
in Fig. 7. All acceptable experimental points are shown 
in this figure. The angular resolution in the center-of- 
mass system was about one degree up to 77 deg, and 
somewhat larger beyond this angle. We also show for 
comparison the differential cross section for Coulomb 
scattering calculated from the relation docou/dw 
= (Z,Z2e"/2yuv?)? csc'(0/2). Here uw is the reduced mass, 
v the relative velocity, and 6 the angle of scatter of the 
N* in the center-of-mass system. 

The results are also presented in tabular form for the 
convenience of those who might want to analyze the 
data further. Table I gives the elastic-scattering 
differential cross section and its ratio to the Coulomb 
cross section. The percent errors listed are estimates of 
the standard deviation and include such effects as 
statistical fluctuations, uncertainties in background 
subtraction, errors in angle setting, and random errors 
in beam integration. At large angles the increased 
error is due mainly to uncertainties in background 
subtraction. 

The measured cross section in the region from 128 to 
132 deg c.m. may be too large because of the previously 
mentioned confusion with the reaction Be®(N",B)C}*, 
We estimate that the contribution of this transfer 
reaction is probably not larger than 40%. 


Systematic Errors 


In addition to the relative errors listed in Table I, 
there are systematic errors which do not change the 
shape of the angular distribution but do affect the 
absolute value of the cross section. These errors arise 
from uncertainties in (1) the target thickness, purity, 
uniformity and tilt; (2) the solid angle subtended by 
the defining counter; (3) the average charge of nitrogen 
ions emerging from the target; and (4) the beam- 


1 Reynolds, Scott, and Zucker, Phys. Rev. 102, 237 (1956). 
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integrator calibration. All percent errors given below 
are estimated standard deviations. 

The target thickness was determined by weighing 
the beryllium foils and measuring their area. These 
measurements were subject to an error estimated to be 
+10%. Appreciable oxygen contamination was present 
in the foils, and the weight of oxygen had to be sub- 
tracted from the gross foil weight. The amount of 
oxygen in one target was estimated in the following 
way. From the ungated spectrum at 20 deg lab (Fig. 2) 
the relative number of nitrogen ions scattered by 
oxygen and beryllium was determined. The N—O 
elastic cross section at this angle was estimated to be 
three quarters of the Coulomb cross section. This 
number is probably accurate to 30%. The total per- 
centage of oxygen in the target calculated on this basis 
is 7.4+2.2 by weight. The same figure was used to 
correct the weight of the other targets. 

The uniformity of the targets is more difficult to 
evaluate. No measurable difference in the counting 
rate was found when various portions of a foil were 
used, and data from four different targets, when 
normalized to the same target weight, gave the same 
results. The effect of nonuniformity is thus probably 
small compared to the 10% standard deviation due to 
the thickness measurements themselves. The target 
tilt which was necessary for small and large angle 


TaBLE I. Differential cross section for N'4-Be® elastic scattering 
and its ratio to Coulomb scattering at 27.3-Mev incident nitrogen 
energy. 


Nitrogen 
center-of- 
mass angle 

(degrees) 


32.2 551 
35.5 317 
38.7 213 
42.2 119 
45.4 81.1 
48.9 61.2 
52.1 39.8 
55.7 23.9 
59.0 15.4 
62.7 12.7 
66.1 9.67 
69.9 6.79 
74.0 4.59 
13 3.51 
85.0 2.71 
93.0 1.64 
98.0 1.03 
103.0 0.571 
108.0 0.537 6 
112.0 0.640 6 
116.0 0.610 10 
120.0 0.502 10 
124.0 0.460 10 
128.0 0.378 10 
132.0 0.284 15 
136.0 0.293 20 
140.0 0.323 20 
144.0 0.254 20 


Standard 
deviation® 
(percent) 


da/dwe.m 


(mb/sterad.) da/docoul 


0.366 
0.308 
0.290 
0.224 
0.202 
0.202 
0.166 
0.137 
0.116 
0.104 
0.0961 
0.0812 
0.0676 
0.0602 
0.0634 
0.0508 
0.0374 
0.0241 
0.0259 
0.0340 
0.0354 
0.0317 
0.0314 
0.0277 
0.0223 
0.0243 
0.0283 
0.0233 
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oo 


® The standard deviation is for random errors only. For systematic 
errors see text. 
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measurements produced an additional uncertainty 
of 3%. 

The solid angle subtended by the defining aperture, 
computed from its diameter and its distance from the 
target, could have been in error by about 4%. The 
average charge was calculated from the equilibrium 
charge distribution for nitrogen in plastic foils." It 
also might have been in error by several percent. The 
absolute calibration of the beam integrator was un- 
certain by a similar amount. Fortunately these three 
uncertainties may be eliminated by measuring the 
counting rate for some process with a known cross 
section from a target of accurately known thickness. 
In effect one substitutes one error, that in the thickness 
of this secondary target, for the three uncertainties 
given above. 

The process chosen for the absolute calibration was 
elastic scattering of nitrogen from a thin nickel foil. The 
available nitrogen energy is well below the Coulomb 
barrier for nickel, and the scattering cross section can 
therefore be readily calculated. It was found that over 
the entire range of defining counter settings used in the 
nickel scattering the cross section exhibits the expected 
csc!(@/2) shape, thus providing a check on the measured 
laboratory angles. The N— Be absolute differential cross 
section calculated from the measured solid angle, 
estimated average charge of the beam, and integrator 
calibration agreed with the cross section based on the 
nickel calibration to within 2%. 

The uncertainty in the nickel thickness measurement 
was +3%. The root-mean-square error from all 
systematic factors was thus about +11%, due mostly 
to the uncertainty in the Be target thickness. 


Inelastic Scattering 


The cross section for inelastic scattering to the 2.31- 
Mev state in N“ was measured at 66 deg and 98 deg 
c.m. At 66 deg a peak for this process was found and the 
cross section is 0.04+0.01 mb/sterad. At 98 deg no 
inelastic peak was found, and an upper limit to the cross 
section is 0.005 mb/sterad. 


DISCUSSION 


The sharp-cutoff model proposed by Blair for elastic 
scattering postulates that the scattering nucleus absorbs 
completely all partial waves from /=0 to a certain 
cutoff /-value, while all the partial waves with />Jcutore 
are Coulomb-scattered. This model has shown good 
agreement with data from elastic scattering of alpha 
particles by heavy nuclei! provided »=Z,Z2e"/hv is 
large compared to unity and if ¢/ocou>1/n. In the 
two experiments cited in the introduction both these 
conditions hold: in the N*—N" scattering, n= 6.1 and 
o/ocoui Was always greater than 4, while in the nitrogen- 
gold scattering » varied from 24.5 to 31.0, and the 


1 Reynolds, Wyly, and Zucker, Phys. Rev. 98, 474 (1955). 
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Fic. 8. The heavy curve represents the elastic cross section 
divided by the Coulomb cross section. The lighter oscillating curve 
is the result of a sharp-cutoff calculation using /max=8. 


experimental measurements were limited to the angular 
region where o/acoui> 7y- 

In this experiment, however, n=3.2, and @/ocoui 
<1/3.2 over the entire measured portion of the angular 
distribution. Hence one should not expect the sharp- 
cutoff theory to agree with the experimental results. 


AND A. 


ZUCKER 


The heavy line in Fig. 8 is the ratio of the experimental 
cross section to the Coulomb cross section, and the 
rapidly oscillating lighter curve is calculated from the 
Blair model according to the formula 


do my ; 
— = |————- exp[— 7 In sin?(0/2) Je? 
dw |2k sin?(@/2) 


2 


8 
——), (2/+-1)e?*:P 1(cos8)| , 


2ik i= 


where @ is the c.m. angle, & is the wave number of the 
relative motion, and 6,=I'(/+1+in)/T (l+1—in). The 
cutoff /-value of 8 was obtained from the semiclassical 
relation 

1(1+1)#?=2yR?(E— Ez), 


where p=reduced mass, E=center-of-mass energy, 
Ep=Z,Z2e?/R, and R=1.5(A,'+A,') fermis. Calcu- 
lations using cutoff /-values of 6, 7, 9, and 10 were also 
made and show no better agreement with the data. 

No further attempt has been made to interpret the 
results of the present experiment. Other approaches, 
such as a phase-shift analysis or complete optical-model 
calculation may well provide valuable information 
about the behavior of nuclei in close collision. Such 
calculations are, however, beyond the scope of this 
work. 

ACKNOWLEDGMENTS 


The authors wish to thank J. A. Northrop for kindly 
providing the precious beryllium foils. We are indebted 
to H. L. Reynolds, and E. Goldberg who made the 
IBM-650 code for the Blair model calculation available, 
and to B. D. Williams who developed the kinematics 
program for the IBM 704. Our thanks go also to J. G. 
Harris for the construction of a large part of the ap- 
paratus, to A. W. Riikola and H. L. Dickerson for their 
untiring efforts in operating the cyclotron, and to C. E. 
Hunting and G. A. Palmer for their valuable assistance 
in taking data. 





PHYSICAL REVIEW VOLUME 


115, 


NUMBER 6 SEPTEMBER 15, 1959 


Quantum-Mechanical Three-Body Problem* 


LEONARD EyYGES 
Lincoln Laboratory, Massachusetts Institute of Technology, Lexington, Massachusetts 


(Received November 3, 1958) 


We treat the quantum-mechanical problem of three spinless particles, with the boundary condition that 
the logarithmic derivative of the wave function be a prescribed constant at each of the three boundaries 
|\ri—re| =a, |r2—r3| =a, |r1—r3| =a. This boundary condition is discussed; it is roughly equivalent to an 
interparticle potential which consists of a hard core plus a strong short-range attractive part. The eigen- 
functions and eigenvalues of the system are given by the solutions of an infinite set of coupled homogeneous 
integral equations. The equations involve partial wave expansions in the interparticle distances and can often 
be truncated with good approximation by taking only a finite number of partial waves. We discuss the 
solution of these equations for the ground state of the system, taking relative S-waves only, for which case 
the infinite set of equations reduces to a single integral equation in one variable. 


I. INTRODUCTION 


T goes without saying that we would like to solve the 
following problem. Three (or more) particles with 
arbitrary spin, statistics, and interparticle forces inter- 
act. What are the eigenfunctions and eigenvalues of the 
possible bound states of the system? It also goes without 
saying that the problem in this generality is very diffi- 
cult and that we must begin by simplifying it. In this 
paper we present a method for treating a simplified 
version of this problem, but a version which retains 
much of the essential difficulty inherent in the fact that 
it is a three-body problem. The method seems gener- 
alizable to more than three bodies. 

We begin by making the simplifying assumptions that 
all the particles are identical and spinless, and by 
neglecting statistics, in the sense that we make no 
special requirements on the symmetry or antisymmetry 
of the wave function. The method appears applicable, 
but is of course more complicated, if these assumptions 
are not made. There is a more important simplification 
that we make. We would like at least to be able to solve 
the problem for central forces, i.e., when the potential 
energy, as a function of the position vectors 1m, f2, r3 of 
the particles, is of the form 


V 12(%'12) + V13(113) + V 23 (23), (1) 


where the V ;; are arbitrary functions and r,;;= |r;—r;|. 


The simplification in the problem we discuss consists of 
replacing these potentials by a boundary condition on the 
wave function at each of the three boundaries 


%12=64, Leg=a, i3:=d. (2) 


We use boundary conditions instead of potentials of the 
form (1) mainly because the boundary conditions are 
easier to handle mathematically. We do not pretend 
that they correspond to any real physical problem, 
although they have been extensively used in nuclear 
dhysics.! From our point of view they are a mathe- 
ij 

* The research in this document was supported jointly by the 
Army, Navy, and Air Force under contract with the Massachusetts 
Institute of Technology. 


matical artifice, a stepping stone which enables us to 
take a fairly long step toward the solution of more 
complicated, and more realistic? three-body problems. 

If W is the total wave function of the system, the 
boundary condition that we use is 


(1/¥) (W/an) = —y, (3) 


at each of the boundaries in (2). Here y is some constant, 
the same for all boundaries and 0/0n is just 0/0r;; for 
the boundary 77. Although this condition is mainly a 
mathematical device it does correspond, in an imprecise 
way, to the following interparticle potential: an infinite 
repulsive core of radius a, surrounded by a deep attrac- 
tive well of very short range. We elaborate on this now. 

Suppose we have a particle bound to a spherically 
symmetric potential which is of arbitrary shape except 
that it has a finite radius ro: V=0 for r>ro. Let the 
particle be in the ground state with binding energy E£. 
The logarithmic derivative of the internal (r<7o) and 
external (r>r0) wave functions are then equal to each 
other at r=7o. Given the potential we could, at least in 
principle, always calculate the value of this logarithmic 
derivative. Conversely, if one were given only the 
logarithmic derivative of the wave function at r=ro and 
were completely ignorant of the form of the potential 
for r<ro, one could reconstruct at least the external part 
of the wave function since the general form of the ex- 
ternal solution is known and only the constants in it 
need be determined. But we would know nothing about 
the wave function for r<ro, since we could not recon- 
struct the interior solution from a knowledge only of the 
logarithmic derivative at r=1. 

We can, however, imagine a kind of degenerate po- 
tential for which the knowledge of the logarithmic 
derivative at a certain radius defines the wave function 
everywhere. Suppose the potential V is defined by 
V=+ for r<a, V=—Vpo for a<r<a+d, V=0 for 


1H. Feshbach and E. Lomon, Phys. Rev. 102, 891 (1956) and 
the reference cited therein. 

* For a solution of a three-body scattering problem involving 
potentials of infinitely short range see G. V. Skorniakov and K. A. 
TerMartirosian, J. Exptl. Theoret. Phys. U.S.S.R. 31, 775 (1956) 
(translation: Soviet Phys. JETP 4, 648 (1957) ]. 


1643 





1644 


r>a-+d. Then it is well known that there is a multi- 
plicity of wells, i.e., different values of V» and d, that 
will give a binding energy £ for the ground state, only 
the quantity Vd? being determined by the energy. For 
r>a-+d, all these wells have the same wave function; 
they differ only in the region a<r<a+d; all the solu- 
tions vanish of course for r<a. Now if we imagine d to 
diminish and Vo to increase in such a way that Vod? is 
constant, we get in the limit a degenerate well for which 
there is no interior and for which the exterior solution 
extends down to r=a. In this limit then, and for the 
ground state, the degenerate potential is equivalent to a 
boundary condition at r=a, for either the one or the 
other uniquely defines the wave function over all space. 

The discussion above deals with a particle in the 
ground state. What makes boundary conditions never 
exactly equivalent to potentials, even to degenerate 
ones, is that the logarithmic derivative of the wave 
function, evaluated at whatever radius, depends on just 
which state we treat. Thus, even if we limit ourselves to 
exterior solutions for nondegenerate potentials, we can- 
not reproduce these solutions for all states of any true 
potential by an energy-independent boundary condition. 
This kind of boundary condition must, as we have said 
before, be considered mainly as a mathematical device 
for uniquely defining a boundary value problem which 
is, however, very suggestive for the three-body problem 
involving true potentials. 

For later comparison with the three-body problem we 
write the relation between y and the energy for a bound 
state of fwo particles whose center of mass is at rest, with 
the boundary condition (3) at r=a, where r is the 
interparticle distance. The solution for r>a of the 
Schrédinger equation is e~*'/r, where K?=m| E| /h?, 
E=-—|E|, and mis the mass of each particle. The above 
boundary condition then gives Ka=ya—1. There is a 
bound state only for ya>1. 

Perhaps we should make clear from the outset what is 
the nature of the solution we shall develop. The wave 
function V is a function of rj, re, and rs. It is defined 
either as a solution of a 9-dimensional Schrédinger 
equation or of the corresponding integral equation. This 
integral equation as it first appears is not in a form 
which is practical for numerical computation. Our 
treatment of the problem consists of deriving an equiva- 
lent infinite set of coupled integral equations, which can 
be truncated with good approximation and which are 
‘solution”’ 


‘ 


practical for numerical computation. The 
thus consists of replacing an intractable integral equa- 
tion by a set of tractable ones. 


II. THE BASIC IDEA 


With the assumptions discussed above, we have a 
nonseparable boundary value problem involving a par- 
tial differential equation in 6 variables (after separating 
off the center-of-mass motion). It is of a special kind in 
that it is nonseparable, not because the individual 
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boundaries are nonseparable, but because there is more 
than one such boundary. We have discussed elsewhere* 
problems which are mathematically similar in three 
dimensions, e.g., the scattering of a wave from a number 
of scatterers or the problem of a particle bound to more 
than one spherical potential well. We found that these 
problems could be handled if one began by writing 
general solutions to the wave equation in a special way. 
This way was as follows: For each potential or boundary, 
one wrote a separate solution of the wave equation in 
coordinates appropriate to that potential or boundary, and 
took the total wave function to be the sum of such 
solutions. Then boundary or matching conditions at any 
given potential, say the ith, were satisfied by the use of 
formulas for expressing all the other solutions except 
the ith in the ith coordinate system. This form of the 
solution was motivated by looking at the integral 
equation formulation of the problem; it is discussed in 
reference (3). 

We use the same point of view for the three-body 
problem. Here the boundaries of Eq. (2) are of course 
“surfaces” in the nine-dimensional space of m1, fe, fs, but 
this is irrelevant for the application of the general idea. 
We write the wave function as a sum of solutions of the 
wave equation, each solution appropriate to one of the 
boundaries. Then for satisfying boundary conditions at 
a given one of the three boundaries we use transforma- 
tion formulas which express the solutions appropriate to 
the other boundaries in terms of the coordinates of that 
boundary. 

Natural sets of coordinates appropriate to the 
boundaries (2) are [in addition to the center of mass 


R= 4(rytret+rs) | 


M2=M1— fe (4a) 


and o3=4r3—}(mt+r.2), 


and 92.=fm—}3(m+rs), (4b) 


and O1=M1— 3 (Fet4s). (4c) 


These sets of coordinates are of course not independent. 
The linear relations among them are of the form 


ri3= +14 313, 
ae 1 
e2>= +4023— 301, 


rio= +e1— 3823, (5) 
Os= — iT 23— 201, ” 
and so on. The Schrédinger equation, with the center-of- 
mass part split off has, of course, the same form in all 
these three pairs of coordinates. For example, in coordi- 
nates Tes, 0; it is, for particles of mass m, 


(Vros?-+3Vor2)~= KY, (6) 


where for a bound state we have set E=—|F| and 
K?=m| E|/h?. Note that there is no factor of two in the 
definition of K?. 

Now we write a general solution to the wave Eq. (6). 
This equation has elementary separable solutions which 


3 L. Eyges, Ann. Phys. (N. Y.) 2, 101 (1957); Phys. Rev. 111, 
683 (1958), 
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are functions of rz; times functions of o:. For satisfying 
boundary conditions at r23;=<a, it is clearly appropriate 
to imagine Vro;* expressed in spherical coordinates 123, 
623, ¢23, and to work with solutions which are spherical 
Hankel functions of 72; times spherical harmonics in 
Q23, (where Q23 stands for 623, ¢23). Since there is no 
boundary condition on @; we simply try exponential 
functions of @; for that part of Eq. (6) which involves 
Vor’. Thus we are led to try as an elementary solution‘ 
of (6) 

hy(ikr23) V im(Qes) exp(ix: 01), (7) 
which is a solution if 


R= K?+- 3x2, (8) 


The elementary solutions (7) are almost, but not 
quite, of the correct form. To see what changes must be 
made in them let us reconsider the two-body problem 
for a moment. There we found that the radial wave 
functions for relative motion were /,(ikr) for r>a and, 
because of the hard core of the potential, were zero for 
r<a. Formally, we can express this by saying that these 
wave functions were h(ikr) where 


hi(ikr)=0, r<a 


C 
«kittie. (9) 


r>da. 

The difference between the functions ; and h, really 
becomes important only if we take the Fourier trans- 
form of the wave functions. This was not necessary to 
solve the two-body problem. But for the three-body 
problem, as we shall see, it is essential to the analysis to 
Fourier-analyze the wave function and it does make a 
profound difference whether or not we use /;, or h; to 
build up the elementary solutions. In fact, using the 
former functions, one is led to inconsistent integral 
equations. Using h; on the other hand, one is led to a 
consistent set of equations with reasonable solutions, 
which is a pragmatic, if not mathematical proof for the 
necessity of using them. To state it differently, we 
assume that the boundary value problem we define has 
a unique solution and that sufficient justification for 
using the h, is that with them we shall be led to a 
function that satisfies the wave equation and the 
boundary conditions, namely to a solution. 

To continue then, we use the functions h; to form the 
elementary solutions 


hi(ikros) Y tw (Qo) exp(ix : 01) ° ( 10) 


Then we build up a general solution to Eq. (6), for 
23>, by multiplying (10) by arbitrary coefficients and 
summing (and integrating) over the complete set of 
functions of which (10) is a member. That is, we 
multiply (10) by an arbitrary function gim (x), sum 


‘We use ikro; instead of kro; as the variable of the Hankel 
functions in anticipation of the fact that we shall be dealing with 
wave functions involving decaying exponentials, since we have to 
do with bound states, and the spherical Hankel functions of 
imaginary argument are of just that kind. 
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over / and m, and integrate over x. We call the resulting 


function p23. 
Yo3= = £m? (x)hi(ikres) 
l,m 
X V im(Qe3) exp(ix: o1)dx, (11a) 


¥23 is a solution of the wave equation, provided Eq. (8) 
is satisfied. In accordance with what we have said 
before, we write similar solutions in ry. and @; coordi- 
nates, which solution we call ~2, and also in rr; and @2 
coordinates, which solution we call 13. 


gum ® (a)hy (ikris) 


X Vim (Qs) exp(ie- o2)da, 


Yi= L 


Um’ 


(11b) 


Lim (a)h v( tkr 2) 


Yi2e= } 


pe 

X Vem (Qi) exp(io-e;)de. (11c) 
In these last two expressions we have for later con- 
venience called the integration variable @ and the 
summation variables /’ and m’. Of course, k= K?+ {o”. 
With these definitions the total wave function y is taken 
to be 


V=PrtYistyrs, (12) 


and it is this W which we shall require to satisfy the 
boundary condition Eq. (3), at each of the bound- 
aries (2). 


Ill. ANGULAR MOMENTUM CONSIDERATIONS 


For the boundary conditions (3) the Schrédinger 
equation has solutions which are not only eigenfunctions 
of the energy but of the total angular momentum (and 
its g component). It is then clear that the forms of the 
Zim‘) (%) are not completely arbitrary but are limited 
in some way special to each angular momentum state. 
We discuss the form of this limitation in this section. 

The usual operators for the components of angular 
momentum in the coordinates mr, re, r3 are, €.g., 


3 0 0 
L.=~i8d ( ——3; ), 
i=l 02; OY; 


with similar forms for the y and z components. We can 
express this operator in any of the three sets of coordi- 
nates (4a), (4b), and (4c). Let the components of rz; be 
(223,283,223) and of o; be (£1,m,¢1), with similar notation 
for the other pairs 12, 93 and 113, @2. The operator for the 
x component of angular momentum in 23, @; Coordinates 
is then 


0 0 0 0 0 
—_—Z—+ 
, Of) 


OZ ay 
with similar expressions for L, and L, and similar ex- 
pressions in the other two sets of variables. Since none 


0203 


On 1 
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of the wave functions (11a), (11b), and (11c) depends 
on R, we see from the form of the commutation relations 
that the angular momentum in r»3, 9; coordinates can be 
looked upon as being that due to the vector composition 
of angular momentum carried by the “rs particle’ and 
that carried by the “p; particle.” Now the angular 
momentum properties of the “‘r23 particle” are given by 
its spherical harmonic expansion, which is explicitly 
exhibited in (11a). To investigate the angular mo- 
mentum properties of the “p; particle,” we expand 
£im™(%) in spherical harmonics of 2,, where 2, stands 
for the angle variables in spherical coordinates in x 
space. 


© hi 
£im?® (x) = > Gimiym?*® (Kk) V tym (Q,). (13) 


11=0 mjy=—1) 


If we put this into (11a) we exhibit the wave function 
W23 as a sum of products of spherical harmonics, one 
with 23 as variable and the other with Q, as variable. 
By taking the appropriate combinations of these 
products, according to the vector coupling model, and 
the known Clebsch-Gordan coefficients, we can make 
the wave function (11a) an eigenfunction of any LZ? and 
L, we choose. Thus we have, using 

exp(ix-o1)=49 S> 7!2j12(kp1) V tome(Qo1) V tome* (Q,) 

lome 


and 


dx=«x"dQ.dk, 


2349 | 2 Gimiymy (x) V ym (Qh (ikres) 


imilim,leme 
x V tm(Q23)t"* jl2(Kp1) V tomo(Qp1 ) V tome*(Q,)x2dQ dk. 


We do the integrals over dQ, to get 


Gimlym, (x)hil tkros) 


Yo3= 4 ; 


imlym, 


x ¥ pm(Q3)4" jy (Kp) V tym y(Qp1)K7dk. 


Now suppose we want y2;3 to be an eigenfunction of a 
given L and M=L,. We can effect this by writing 


Gimym (x) = Fi? (K)Ciy (LM ; mim), 


where the Ci,(L,M;m,m,) are the Clebsch-Gordan 
coefficients, in the notation of Blatt and Weisskopf.® 
For then we have 


Fi (x)h 1( tkr3)t" 71, (Kp1) 


Yo3=4r y r 


l=0 l1=0 


l 4 
x( a ZZ Cu,(L,M > m,m,) V tm(Qo3) V tymj(Qpy)) xd. 
m=—I1 mj=—l; 
5 J. Blatt and V. Weisskopf, Theoretical Nuclear Physics (John 
Wiley & Sons, Inc., New York, 1952). 
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This clearly exhibits 23 as an eigenfunction of L and M’ 
and it is plausible, though unproven that this is the 
most general way we can solve the problem. 

We write similar expressions for ¥i2 and 13, using 
arbitrary functions Fi*)(x) and Fi," (x) and the 
wave function W defined by Eq. (12) will then be an 
eigenfunction of L and M. 

Let us consider the case L=0 in a little more detail. 
For this case the Clebsch-Gordan coefficients vanish 
unless /=1,, m=—m,; moreover, those that do not 
vanish are independent of m and in fact are just equal 
to unity except for a factor (—)™. If we use these facts 
we get for Wo; 


Yo23=4r x ih, (ikro3) F 11° (x) j(Kp1) 
l=0 


l 
x ( > V tm(Qe3) V im* (Qo1)) x2 


m=—l 


(14) 
ith 1 thro) F y,° (x) J1(Kp1) 
X (21+1) Pi(cosy)xdk, 


> 


l=0 


where ¥ is the angle between res and 9. It is clear from 
first principles that this form of the wave function is 
correct. For L=0 the wave function can depend only on 
the scalars formed from ro; and 9, that is, it can depend 
only on |@:|, |r| and rm-o;. The function (13) is a 
general function of just that form. 

One can make a similar analysis for other L and M, of 
course, using the appropriate Clebsch-Gordan coeffi- 
cients. We shall not do this here since this problem in 
any case is rather academic. For practical problems, one 
has to deal with the symmetry requirements involved in 
taking spin and statistics into account. Rather, in this 
paper we shall take the case L’=0 as leading to integral 
equations which are more or less representative, and 
discuss them in some detail. We turn to this now. 


IV. THE COUPLED INTEGRAL EQUATIONS: L?=0 


The discussion of the last section can be summarized 
by saying that for L°=0, the gim‘'?(%) that appear in 
Eqs. (11) must be of the form 


Lim (x) = F p(k) V im* (Q4), (15) 
where F;,‘) (x) is an arbitrary function of x. This result 
came about because the summation in Eq. (13) reduced 
in this case to a single term. We now wish to determine 
the functions Fy,‘'(k) so that the wave function 
satisfies the boundary condition Eq. (3), at each of the 
three boundaries r}2=4d, f23;=4, f13=4a. 

In general, one must expect that the forms of the 
functions F,,"‘” (x) are, as the notation indicates, actu- 
ally different for different i and 7. On the other hand, 
there may well be states for which all the Fy,“ (x) are 
of the same form, i.e., for which the total wave function 
is completely symmetric among all the particles. It is 
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clear that this will be true for the ground state, which is 
a state of maximum symmetry. In this section we shall 
write the integral equations for states of this symmetry. 
The same techniques we use can be carried out for other 
states with some additional complication. 

For a symmetrical state of the kind we discuss, the 
superscripts in the function Fy,‘ (x) are in fact super- 
fluous. We then simplify the notation by writing for this 
case 


Fu (x)= filk). (16) 


Now it suffices to apply the boundary conditions at only 
one boundary; it follows from the symmetry of the wave 
function that the conditions will automatically be 
satisfied at the other two boundaries as well. Consider 
first the boundary r2;=a. To satisfy boundary condi- 
tions there, we must express the functions Yi2 and yrs 
in the coordinates r23 and 9;. The general technique for 
this is to express these functions as Fourier integrals. 
Then all the space dependence, that is the dependence 
ON f12, 3 aNd F413, 2 is in an exponent, and transforming 
these functions to the coordinates r23, 92, corresponding 
to the boundary r2;=a, simply means shuffling the 


Yetyis= ™ 


l’=0 =—l! 
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variables in this exponent, using the linear relations (5). 
With this approach we get a set of coupled integral 
equations relating the f;(x) to integrals over themselves. 
These integral equations are homogeneous; hence they 
have solutions only for special values of K, the eigen- 
values, and have corresponding eigenfunctions. 

We begin to express ¥i2+ Ys in f23, 01 Coordinates. 
To do this we use the integral representation 


hy (ikrie) V ym’ (Qi) 


1 
~ (On)! J Wim’ (kya) explid-tie|da. (17) 


i (29)! 


On inverting this transform it is found that uy m(k,d) 
can be written in the form 


tym’ (Rd) = (2m) 40, (k,A) V ym’ (Qh). 


Expressions for the functions vy (,A) are derived in the 
Appendix. We put Eq. (17) and the similar equation 
involving 713 into Eqs. (11b, c) and use Eqs. (15) and 
(16) to get 


V 
J ffeoreaar 5. V vm * (Qe) V vm: (Qa) {expla (A- rite: 03) |+exp[i(A-ris+e- 2) |}dade. 


We use the transformation Eqs. (5) to rearrange the exponents 


exp[i(2-rie+e@- 03) |+expli(-rist+e-o2) |= expLior: (A—}e) |{expLires: (444+ fe) ]+exp[—ire;-(444+ 40) J}, 


and we go from the variables 2, @ to new variables x, «’ by the transformation 


~A—}0=K, 


Then we have 


Yet vis= L [ [seen ((K2+ gx?)4, | x+3x' |) 
l’=0 
i 


4 oe 


m’=— |’ 


To exhibit the spherical harmonics of Qe; we use 
] hibit tl I | harn {Q u 


L l 
exp[ife;: (x/+3x) |+expl—ires: (x +3) ]=4r DD 


l=0 m= 


and 
” 


+m Vm * (Qa) V vm (Qu 


m’=—| 


where 


Then we have 


virtva= ff teen cart ae 
V’=0 


, 
o=tk. 


V pm * (Qe) V vm (Qepye)) expLior: « ]{ exp[ires(«’+ 3%) |+expl—ites(%’+ 3x) }}dx’dx. 


iC 1+ ( —)"|jrlres| x’ +4!) V m(Qe3) V inc (hers hu)y 
l 


2l’+1 
= P (cos), 


bf utdx’| )(2’+1) Pv (cosp) explior: « | 


L l 
X{E DY LIA —) Di ilrre3| e+ 3 |) V im (Qos) V om* (Qe 440) }dudr’, 


l=0 m l 





1648 LEONARD EYGES 
and the total W=yy2+¥13+¥2; is just 


“ l 
v=> > 5 exp[ioi: % ]V im(Q23) fil) V im*®(Q,)hi(ikres) +i'L1+(—)] 


l=0 m=—l 


x fz fir («’)vy ((K2+ 3x) 4, | et de’ | (21 +1) Pv (cos) JV im* (Qe-440) jr(res| x’ + 4%] dx’ pdx. (19) 


=O 


This is the desired result. V is explicitly expressed in terms only of r23 and 9. 

Now we apply the boundary condition Eq. (3) at r2;=a-(0/dn) is, of course, 0/0r23 which we form by differ- 
entiating under the integral sign. We insert the expression (19) for WY into this equation. Then we equate the 
coefficients of the V im(Q2s) on each side of the resulting equation; and, since this boundary condition holds for all 
0; we equate the integrands of the dx integration. This gives the set of equations which holds for all / and m. 


=f file V im*(Q,)h (tka) +i'[1+(—)*"] 


K PCL fle’ )ov((K2+ fu?)4, | e+ 3%! | (2-1) Pv (cos) JV im* (Qe440) jr(a| x’ +3] de’ 


l’=0 


=ikf r(x) ¥ im*(Q)hi (ika) +é'[1+(—)"] 


x 
K PLL fle )ov (R24 fe?)4, | etx’ | )(2U’+-1) Pv (cosp) JV tm* (Qe444) | we’ + 3x] 71 (a] we’ +4] de’, 
l’'=0 
The prime on the functions 7; and h, denotes differentiation with respect to their argument, and we have replaced 
hi by hy. 

This set can be reduced somewhat further by noting that the integrals in them are functions only of | «|, | «’|, 
«-«’. Thus we can in the usual way imagine x is along the z axis. This simply amounts to a rotation of the vector x’ 
and since eventually we integrate over the whole solid angle of x’ this makes no difference in the final result. With 
x along the z-axis, Vim*(2,) becomes [(2/+-1)/4m ]!P1(0)=[(2/+1)/4a]} for all 1. Moreover, «,’=x’ cos6’ and 
V im* (Q4er44e) becomes just [ (2/+-1)/4a }!P1(Qe44.) =[(214+-1)/42 }!P (cosa) where a is the angle between x«’+3x 
and the z-axis. That is 

x: (x + 5x) x’ cos0’ + 3K 
cosa = : (21) 
[el fa’ 3u!  (4x2+K%’+xx’ cosd’)! 


Similarly we have 
x cosO’+ 3x’ 


cosp = =, (22) 
(4x?+K?-+ xx’ cos6’)! 
With these definitions, and the further definitions 
s= (3+ + xx’ cod’), (23) 
t= (4«+12+ xx’ cos6’)}, (24) 
the equations are 
—y} filx)hiika)+2mi'[1+(—)] 
x f Ld f(x’ )ov ((K2+ 3x”), 1)(21’+1) Pv (coss) JP (cosa) j (as) sin6’k’2d0’dk’ 
0 0 l’=0 (25) 


= ik fy(x)hy (ika)+2mi'[1+(—) 9] 


l’=0 


xf f es ful’ )ov ((K2+3x”)), 1)(2I’+1) Py (cos8) |P i(cosa)s j;/(as) sin8’’2d6'dk’. 
0 0 ‘= 
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UANTUM 


Setting /=0, 1, 2--- successively gives the infinite set 
of coupled integral equations in which each fi(k) is 
related to integrals over all other fi-(x). Since the set is 
homogeneous, or if one likes, since it is equivalent to the 
original Schrédinger equation, it can have solutions only 
for certain eigenvalues and corresponding eigenfunctions. 
That is, for each eigenvalue K, the form of all the func- 
tions f;(x) is determinate from these equations. 

Although this set of equations is an infinite one, it is 
natural to try the approximation of truncating it and 
keeping only a finite number of partial waves, i.e., of 
functions f;(x). The dimensionless parameters that de- 
termine the nature of the solutions of these equations 
and the degree of approximation required are of course 
Ka and ya since these are the only such parameters that 
enter the equations. These are in fact not independent, 
since the eigenvalue Ka is determined from ya. The 
magnitude of Ka affects the nature of the solutions of 
Eqs. (25) in that it sets the scale for some of the 
functions that appear in the integral equations as for 
example, the function /,(i(K?+{x*)!a) whose behavior 
is quite different according as Ka is much greater or 
much less than xa. 

Now we ask why, for given Ka, we can get away with 
keeping only a finite number of partial waves. Speaking 
generally, this comes about because of the asymptotic 
properties of spherical Bessel functions, namely, that 
those of high order are small when their arguments are 
small: j,(~)~0 for «<i. This is, of course, the same 
property that limits the number of partial waves in 
ordinary scattering theory. Physically, we imagine that 
any two particles have a relative ‘wavelength and that if 
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this relative wavelength is of the same order as the 
radius a, we need only a few relative partial waves. For 
a small number of particles we expect this relative 
wavelength to be roughly the deBroglie wavelength 
corresponding to the total energy of the system. Mathe- 
matically, we can see from Eq. (25), by a rather circular 
argument, how a limited number of partial waves might 
suffice. In the integrands of these equations are the 
functions f;(x). Now it seems likely that each of these 
functions is of finite range, i.e., is essentially zero beyond 
a certain x. If this is in fact the case this means that 
there is, speaking roughly, a maximum value of the 
argument (jx?-+x’?+«x’ cos6’)! of the Bessel functions 
which comes into play ; for « greater than this argument, 
the 7; vanish, which is consistent with the fact that the 
functions f(x) have finite range. Even if one has an idea 
of the number of partial waves required for a given Ka, 
in applying these equations it must be remembered that 
Ka is not given in advance, but comes out of the solu- 
tion. In practice, then, one must guess at Ka, choose an 
appropriate number of partial waves, and then having 
found Ka, verify that it was indeed permissible to 
neglect higher order ones. These points are discussed in 
somewhat more detail in Sec. Ve. 
V. THE TRUNCATED EQUATIONS 
a. The Equation for S-Waves Only 


In this section we discuss in some detail the solution 
of the infinite set Eq. (25), when we truncate them and 
retain only fo(x) as being different from zero. The 
equations then reduce to the single homogeneous inte- 
gral equation 


-y blialk+ 9 fe) dk’ ,. fo(x’) jo(as)vo((K2+ 9x’?)}, t) sna | 


1(K? Pf 342 -) thy’ (ial (K? +3 aK" ”) 4) fo(x) ae a foln’ )s ji (as) vo(( (x= +3 K?)), t) sind’dé’, (26) 


We have used jy’=— /, here. 

We reduce this equation in the following way. We 
make the substitution y=sin6’, introduce dimensionless 
parameters Ko, Yo 

Ka = Ko, 
and the definitions 


oon Ro NS EN" 


ya= Yo, 


yo—1 
foro] (438) 


2 ra 1 
= ff jortenerae f 
wT “9 1 


| cos(KoleH4 £24 £e’y)})+ (14 2¢”)! 


t= Ko(jé?+2+ Et’) (28) 


We use the definitions of the Hankel functions, the 
expression for v(,A) from the Appendix and introduce 
a new function 


fo*(&) = fo(é) exp| — Kol 1+38))] ‘( 1+ 3 £2) 4 


Then after some rearrangement the integral equation 
for fo*() is, written out in full, 


(29) 


Yojo(Kol g*+- i e+ ft’y)4)— Kol §/2-4- i f+ &t’y) May (Ko! §3+4- i + tt’y) ') 
£24 £4 EE’y+ 1 


sin(Ko(?+ 4 ag + &é’y)*) 
(30) 


(£2- 12 tt’y)} 
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When an eigenvalue Ko and corresponding eigenfunction /o*(€) of this integral equation is found then the (un- 
normalized) wave function of the system corresponding to this Ko and fo*(é) is 


v= F (112,03) +F (113,02) +F(r23,p1), 


where 


eran 


_ —1 Ko( i+ } (Ko 
F(rp)= f fr expL r/a ) ae) J sin opé/a) it 


Before discussing the solution of Eq. (30) it is inter- 
esting to note that it is the right-hand side of Eq. (30) 
that reflects the fact that we have a three body problem. 
This side comes from the transformation of ¥i2+y13 to 
23 coordinates. The left-hand side comes from y2; itself. 
If the right-hand side were zero, then the left-hand side 
would describe the system of particles 2 and 3 and if the 
center of mass of this system were at rest fo*(£) would 
simply be a delta function of &, i.e., the left-hand side 
would be zero except for £=0. Thus, for Eq. (30) to be 
satisfied for this case, for all ¢, including £=0, we would 
have to have 1+Ko=v‘o0, which is of course just the 
relation we found previously for the two-body problem. 


b. Qualitative Discussion of the 
S-Wave Equation 


In this section we shall present a qualitative discussion 
of the integral Eq. (30). First, we shall try to show why 
it can have solutions only for certain values of Ko, the 
eigenvalues. The point is clear in principle, of course, 
since the integral equation is equivalent to a Schrédinger 
equation which we know has solutions only for certain 
eigenvalues. Nonetheless, it is illuminating to see from 
a direct analysis of the integral equation how this comes 
about. We shall also try to find the qualitative form of 
the eigenfunctions and make a rough estimate of the 
eigenvalues, as a guide to the more refined treatment in 
the next section. 

Before we proceed, it is useful to have fixed in our 
minds the range of values of the parameters that enter 
the equation. First we consider Ko, the eigenvalue we 
seek. As we have pointed out before, the magnitude of 
Ko, among other things, determines the number of 
partial waves one must retain in the analysis. Ko cannot 
turn out to be too large if the truncated Eq. (30) is to 
make sense, that is if it is really justifiable to go from the 
coupled set of Eqs. (25) to the single Eq. (30). Just how 
large is a moot question, but we are probably safe in 
retaining only S waves when Ko is, say, smaller than 
about 3. We shall think of Ko as being in this range. 
This means that the values of yo we must consider are 








2YoJol }K o£) 
fo*(E)= F 
wt (1+3E?)!— (yo—1), 


Now we consider the y integral in this last equation, 


f dy f 
E+E E +1 JL, 


a £2 f, fo*(#’) Jol Ko8))L1+ Kol 1+3 ey f 


(P+E+ ILL Ee y/(B+E2+1)] 


rp 


necessarily close to unity. For we have found for the 
two-body problem the relation Ko=yo—1 showing Ko 
increasing with yo. Now, for yo=1.5 we already have a 
Ko of 0.5. And it must certainly be true that the ground- 
state binding energy is greater for the three-body than 
for the two-body problem; thus to avoid dealing with a 
Ko greater than 0.5 in the three-body problem we must 
take yo considerably less than 1.5. We shall in fact 
discuss Eq. (30) for yo between 1 and 1.2, although it 
should be noted that there are bound states of the three- 
body problem for yo less than unity. These are qualita- 
tively similar to those for yo greater than unity. 

The technique of the present qualitative discussion is 
to show that Eq. (30) is roughly equivalent to a degener- 
ate equation with a separable kernel. That is to say, we 
replace the integral equation with a crude approxima- 
tion to itself which approximate equation is, however, 
exactly soluble. In doing this we shall in general begin 
by simply stating the approximations we use and will 
discuss and, insofar as is possible, justify them only 
later. 

First consider the function 


*  jo(Ko(€?+42+ &€’y)*) 


which appears in the integrand of Eq. (30). This can be 
expanded by the addition formula 


jo(Kol(E?+42+ &8'y)!)= jo(8) 


=X (21+1)(—)"ju( Kok!) ji(Kok/2)Pi(y). (31) 
i=0 
Now for Ko small and neither é nor &’ too large we can 
take only the first term in this formula, i.e., we use 


Jol §) = jo( Kot’ )jo(Ko&/2 < (32) 


In like manner we could expand the other similar func- 
tions §7;(8), cost, sint/t. We shall, however, approxi- 
mate these even more crudely, i.e., still assuming that 
Ko& and Koé’ never get very large we replace these 
functions by their values at the origin, i.e., for = £’=0. 
Thus we set §7:(8) equal to zero and set cosé and sint/t 
equal to unity. The integral equation then becomes 


—__—— (33) 
1& Tee y+1 


dy 
(34) 
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We observe that ¢’y/(#’+ £+1) is always less than unity, and we can expand the denominator in powers of this 
quantity. We can then conclude that the integral (34) is represented to sufficient accuracy (within 10%) by the 


function 
2/(?+é?+1). 
With this approximation we reduce the integral Eq. (33) to 
- 4y0jo(Koé/2) f £? fo¥ (&’) jo(Kok’)LI+Ko(1+$8")! Jae’ 
w{(1+3£)'— (yo 1)/Ko} Yo i . 


(35) 








fo*(é) (36) 


Now the equation is almost separable. It can be made exactly so by one last and most drastic approximation. We 
consider the possibility of replacing the kernel 1/(#+£?+1) by the separable kernel 1/(1+£)(1+£”). This 
approximation is excellent when either é or é’ or both are small. It is worst when ¢~ é’ and both are large. Even so, 
for £ and £’ not too large it is tolerable. For example, for = ’=1 the exact kernel has the value $ and the approxi- 
mate kernel }. On the other hand the approximation rapidly becomes poor for & and ¢’ large. Thus for = ¢’=2 
the exact kernel is § and the approximate one 1/25. Nonetheless, if we make this approximation, crude as it may 


be, we have the great advantage of being led to an equation whose solution is transparent. 


4y0jo(3K of) 


fo*(¢) =——_— ore 
m{ (1+24)!— (yo—1)/Ko} (1+) 


fF Costas Rasen 


(37) 


1+é” 


The essential form of the function fo*(£) stands in front of the integral sign; all that remains to determine it com- 
pletely is to find the eigenvalue Ko. To do this we multiply the equation by 


# jo(Koé)L1+Ko(1+i2)#)/ (1+) 


and integrate with respect to & We get 





4y0 f £ jo(Ko€) jo($K of) [1+ K o( 1 +42)" ] i 
iS; —— _ —— =" 
‘ [ (1+3£?)!— (yo—1), Ko ](1+é)? 


For a given yo the eigenvalues are determined from 
this equation. It is illuminating to see qualitatively why 
at least one eigenvalue must exist. The right-hand side 
of Eq. (38) must equal unity for. the eigenvalue Ko. 
More generally, let us consider the magnitude of the 
integral on the right-hand side as a function of Ko, 
which we imagine for the moment to be a variable 
parameter. From our previous discussion of the two- 
body problem Ko must be greater than yo—1 (for Yo 
greater than one). If for a moment we imagine Ko close 
to its minimum value of yo—1, we see that the denomi- 
nator of the integrand almost vanishes for small £ Such 
a small value of Ko maximizes the integral; if in fact one 
calculates the integral for this case one finds that the 
right-hand side of Eq. (38) is greater than unity. On the 
other hand, for large Ko it is easy to see that the integral 
vanishes, essentially because of the oscillatory character 
of jo(Koé) and jo(Ko&/2). There is some intermediate 
Ko then for which the right-hand side is unity. Nu- 
merical calculation gives the results: for yo=1.1, 
Ko=0.22; for Yo A Ko=0.46. 

With these values of Ko at hand let us reconsider the 
approximations we have made in deriving the approxi- 
mate integral equation (38). For the sake of being 
specific we take the case yo=1.1. Then the function 


fo*(&), normalized to unity at the origin is just 
0.5 


fo*(é) = angen goat 
[1+ 3#)!—0.45](1+2) 

This function has the value 0.316 at &=1, 0.071 at &=2, 
and has dropped to 0.026 at &=3. Now, we have already 
discussed the approximations involved in truncating the 
expansion in Eq. (35). The next approximation was that 
of Eq. (32), the expansion of jo(8) retaining only the 
1=0 term. To get an idea of its validity we evaluate it 
for those parameters of interest for which it converges 
least well, say £=£’=3 and Ky=0.40, to choose an 
arbitrary value. Then the series (31) is 0.731—0.199P, 
+(0.0101P:—0.00024P,; which is not too bad for this 
worst case, since the convergence becomes rapidly better 
for either — or &’ smaller than 3. Less justified was the 
third approximation: the dropping of the term §7;(8) 
and the setting of cosé and sint/t equal to unity. Our 
reason for so doing was heuristic; if we had expanded 
them in the manner of Eq. (31) we would not have been 
led to a separable equation. However, we see that these 
last mentioned approximations make no qualitative 
difference in the integral equation. That is, these func- 
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tions are essentially functions which oscillate faster and 
faster the larger Ko is. As such they supplement the 
roles of jo(Kof) and jo(Ko&/2) as convergence factors, 
without changing anything essential. It is difficult to 
assess quantitively the last approximation, the replace- 
ment of the kernel 1/(#+ £+-1) by 1/[(1+&) (1+ €”) ]. 
Roughly speaking, it is good up to &=£’=1 and quite 
bad by the time it gets to = ¢’=2. Now the value of 
fo*(&) has dropped appreciably by &=1, but it is still 
not negligible at = 2, so the error in the kernel may be 
quite important. All told it is hard to assess the ap- 
proximations, except to repeat that they seem almost 
certain to be qualitatively correct and to give the right 
general behavior of the eigenfunction and the correct 
order of magnitude of Ko. 


c. Quantitative Discussion of the 
S-Wave Equation 


Having established the general features of the solution 
of Eq. (30), we turn to a more quantitative, although 
still approximate, discussion. We start in a way similar 
to the last section, by expanding the functions 7o(8), 
8j1(8), cost, and sint/t in Legendre polynomials in y and 
keeping only the first term in the expansion. These 
expansions are easily got by differentiating the expan- 
sion Eq. (31) for jo. Thus we have, for example, 

. 4 
*() 
r{(1+38)!— (9 


where 


A(E,E") = jo( Kok) jo( Kod’ 2| 


B(E,£') = jo( Kok’) jo( Kok/2) | (yo-2)+ 


LEONARD 


ees 1 ) Ko} 


jo( Kok’ 2) 


EYGES 


_ fd sin(at) d 
cosi=| ec | -|-aiu(a) 
da t a=1 da a=1 


1 x 
= (aE (214-1)(-)!4Aa8) jd08'/2)PA9)) 


da l=0 | oust 
This gives, keeping only the /=0 term, 


cost = jo(K ot) jo Koé’, be ) “pe: 3 Kot’ jo( Kok) ji(Kot’/2) 
— Kot jo(Kot’/2) j:(Koé). 


In a similar way we can expand 87,(8) and get 
871(8) +}. Kokjo(Kok’) j(Koé/2) 
+ Kok’ jo(Koé/2) j1(Koé’). 
Putting these approximate expansions into Eq. (30), 
we can do the y integral; this integral is: 


. dt’ 2 
f ses eS eee (39) 
E+E +E VEL E+E 

Here we have used the same approximation as in the 
qualitative discussion of the last section, and as we have 
seen there the average error this introduces into the 
kernel is of the order of a few percent. The integral 
equation (30) is now reduced to 


; P4241 


f fo*(E)A (EE) BEE) Ede’ 


cos(Kyé’/2) cosKok 


+-—- —1+Ku(t3e9)|, 
Jo( Koé) 


cos( Kot/2) cos(Koé’) 
Jol K o&/ 2) jo( Kok’) 


From Eq. (40) we see that fo*(&) is given by the function standing in front of the integral times that function of 
£ represented by the integral itself. We discuss this latter. We observe first that for the values of £ and &’ which we 


are likely to be interested in 


and which are, from the results of the previous section between, say, 0 and 3 for each 


variable—the functions A (£,£’) and B(é,£’) are slowly varying compared to the function 1/(#@+£?+1). The main 
£ dependence of the integral in Eq. (40) derives from this factor. This suggests that a reasonable approximation to 


the true solution is a function of the form 


fo*( &) ~ 


((1+28)!— (0 1)/Ko) (182) 


1 
(41) 


where ¢ is a constant still to be determined. If we try this and put it into Eq. (40) we get the following equation to 


be satisfied for all &. 


x 


Since the functions A (£,£’) and B(é,¢’ 


4 A (Ef) B(E,£) Edt! 
r 0 (C1488)! (yo—1)/Ko} (1 tet) (€2-+¢2-+1) 


(42) 


are slowly varying compared to the other factors in the integrand, we 


shall in the argument that follows imagine them as evaluated at some average value = &. With this approximation, 
and a little rearranging of the denominator Eq. (42) becomes 


4 


4 : (&,f’) B(E,E) £7’ 
i= ate) f —____—— — 
7 0 {(143E")'— (yo—1)/Ko} [14+ 82/ (€2+1) (14-82) (1+ €€”) 


(43) 
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Now the left-hand side of this equation is a constant; 
the right-hand side ostensibly depends on ¢. How is this 
to be reconciled? To answer this question let us antici- 
pate a bit, and take e of the order of 3. Then we observe 
that the factor 


£°/{((14+-28)#— (yo— 1)/Ko](1+-£?) (1+e€)} 


is one which is peaked fairly sharply around ¢’=1. If it 
were very sharply peaked to the extent, say, of being a 
delta function, at some £’= £0’, we could do the integral 
and we see that by choosing e=1/(&°+1) the integral 
equation would be satisfied exactly, i.e., the dependence 
of the right-hand side on ¢ would cancel out. To the 
extent that the integrand has a finite width, the pro- 
posed solution (41) is necessarily approximate and the 
right-hand side is not strictly a constant independent of 
£, but we shall see that it is so to a good approximation. 

Although it is clear that, since the integrand peaks at 
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Taste I. As a function of yo: the eigenvalue parameter 
Ko=m|E| /h? and the parameter ¢ which appears in the function 


0° (&) =1/{((1+38)4— (yo— 1)/Ko](1+e8)}. 


vo Ko € 
0.27 
0.30 
0.40 


0.27 
0.50 
0.60 


1.00 
1.10 
1.20 


t’ about unity, ¢ is about 3, we would like to determine 
€ more closely and of course determine the eigenvalue 
Ko as well. To determine these two constants we must 
have two equations; we can get these by demanding 
that the integral Eq. (40) holds exactly at two points 
and if the representation of Eq. (41) is reasonable it 
will hold approximately at intermediate points as well. 

The requirement that the integral equation be satis- 
fied at =0 leads to the equation 


(44) 


This is an equation very similar to the eigenvalue Eq. (38) we found previously and a similar qualitative discussion 
applies, remembering that e~ 3. It is convenient to choose x= 2 as the other point at which we demand the integral 
equation be satisfied. At this point the function fo* has dropped appreciably from its value at the origin, without 
being excessively small. The condition that the integral equation be satisfied exactly at «=2 gives the equation 


A 


4 oo 
1=-(1+40 f saunas : | 
ui 0 (LA RE”)! (yo 1)/Ko} (1+) (S+E”) 


From Eq. (44) we get a set of possible allowed values of 
Ko for a given e, i.e., a curve in the Ko, € plane. From 
Eq. (45) we get a different curve; the intersection of 
these curves gives then unique values of Ko and ¢, and 
we must then verify that these values lead to a function 
which satisfies the integral equation at least approxi- 
mately at other points than the two chosen above. 

The results of calculations along these lines are given 
in Table I. Because of the various approximations, in 
particular that represented by Eq. (39) we do not 
consider the eigenvalues Ky to be accurate to, say, 
better than 10 percent. 

The results of Table I are quoted for a minimum value 
of yo equal to unity, because our main purpose is to 
compare the three body binding energy with the two- 
body and the two-body system is not bound for ¥o less 
than unity. As we have mentioned before, there are in 
fact solutions of the integral Eq. (30) for yo less than one 


(E,2) BEE) Ede 


(45) 


(although not for yo too small). From Table I we see 
that the ratio of three-body binding energy to two-body 
binding energy is infinite for yo= 1, nine for yo= 1.1, and 
four for yo= 1.2. This seems reasonable. 

There is one final point we would like to discuss, 
namely, to show qualitatively why the higher order 
partial waves we neglected in truncating the general 
integral Eqs. (25) are in fact negligible for small Ko. To 
discuss this, we observe that if the amplitudes of these 
waves are in fact small we can estimate them from the 
integral Eqs. (25). From these equations we see that the 
factor [1+ (—)"] precludes relative P waves. Thus the 
next amplitude after fo(«) which we have to discuss is 
the D-wave one, f2(x). We get an approximate equation 
relating fo(x) to fo(«) from Eqs. (25) by setting /=2, 
‘=(), i.e., by neglecting all partial waves except S and 
D. This gives 


nf f fo(x’)v0(k,t) P2(cosa)[y j2(as) +5 jo’ (as) |x’? sind’d6’dx’ 
0 0 


fal) = —————— 


As before, we introduce dimensionless parameters Ka= Ko, ya 


ikhs' (ika) +~yh2(ika) 


xa/ Ko, &'=«'a/Ko, and define 


Yo, new variables & 








1654 LEONARD EYGES 


a new function 





fo(t) expl—Ko(1+$2?)*] 


2*(£)= 
sites (1+9£)! 


Then we get 





* mee “ P2(cosa)[Yoj2(Ko8) +Ko8j2' (Kos) Jé7dé'dy 
wv¥o 7_\ + £-+ Et’y+1 


f:*(Q=—_—___— as 
[1+ (4/p)+ (9/p*)+ (9/p') ]—rL (1/P)+ (3/P9)+ (3/9)] 


2 f f 7 f eee eee 
#(py Ne ae 





where p= Ky(1+ $£)!. Now we can see the basic reason that, for small Ko, f2*(£) is small. It is that we have to 
integrate in this equation fo*(£), a function sharply peaked for small ¢, with j2’(Ko8) and j2(Ko8), which are func- 
tions which vanish for their argument small. Thus for Ko small the overlap integral is small and f.* is small com- 
pared with fo*. 


APPENDIX: INTEGRAL REPRESENTATION FOR hi (ikr) ¥in(Q) 


We want to represent the function hi(ikr) Y 1m(Q), defined by Eq. (9) of the text, as a Fourier integral. We write 


1 
hy(tkr) V im(Q) = (2m)! fvrntda9 exp[ia-r]dd. 


2x 


Inverting this we get 


1 , 
4 im( ky») =—— fhutieey ¥en exp[ —74-r |dr 
(2zr)! 
1 - 
“Onl hilikr) ¥ im(Q) (4 d vm’ ( —1)" jy(Ar) V pm (QY) V vm *(Q))rdrdQ, 
ug 


or 


thim(k,») = (2 mr) —1)'V mia) f hilikr) j(dr)rdr. 
0 


We define a function 2;(&,A) by 
U.1m(R,d) = (2) #01(k,A) Y im(Qy). 


whence 
(~#)? 2” | 
v,(k,A) =——— hi(ikr)7i(Ar)r'dr 
r 0 
(—i)! a 
= — f hy(ikr) 7(Ar)r°dr. (A-1) 
2dr? J, 


The integral in Eq. (A-1) is an example of a well-known general integral® 


x? 

J secs)gu(px)stas= pag filer) er-sBx) —afr-s(ax)gr(Be) | 
a’?—8? 

where f, and g, can be spherical Bessel, Neumann, or Hankel functions. This gives 


—4\ba2 
01(k,A) =—————_[—hr, (ik) j 1-1 (Aa) —ikhy(ika) j,(a) J. 
2n?(A2-+ 2) 


6 See, for example, P. Morse and H. Feshbach, Methods of Theoretical Physics (McGraw-Hill Book Company, Inc., New York, 1951 5 
p. 1574. 











QUANTUM-MECHANICAL 


The case /=0 is, written out explicitly, 


THREE-BODY PROBLEM 
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v0(R,A) =_— 
2m*knr 


e-*@ (\ cosAa+ sinda) 


(R?-+)?) 
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The cross sections for the elastic scattering of protons by nitrogen have been measured in 105 angular 
distributions ranging in angle from 53° to 155° and in energy from 1.05 Mev to 2.93 Mev. Resonances have 
been observed at 1065+5 kev, 1557+6 kev, 174347 kev, 1803+7 kev, 2344+10 kev, and 2468+10 kev. 


INTRODUCTION 


HE elastic scattering of protons by nitrogen in the 
energy range covered by electrostatic accelerators 
shows, in addition to a number of well-established reso- 
nances, a large background of potential scattering. It 
has been reported from 0.6 Mev to 4.1 Mev in several 
recent papers.'~ Spin and parity assignments have been 
found for the resonances at 1.065 Mev and 1.557 Mev. 
The lack of knowledge of the background scattering, 
on which the emphasis of the present work rests, has 
been the principal obstacle to establishing the assign- 
ments for the others. An early report® that the P-wave 
potential phase shifts were small below 2.0 Mev has 
been found incorrect. This conclusion arose from an 
incorrect normalization for the cross sections, which 
were about 10% too low. The cross sections of Tautfest 
and Rubin® are approximately the same as those of 
reference 5. Hagedorn ef al.? have compared these 
results with the more recent data and show that they 
are consistently low. The results of the present work 
are available in tabular form in an unpublished report.’ 
The present data, which are in agreement with the 
1 Bolmgren, Freier, Likely, and Famularo, Phys. Rev. 105, 210 
(1957). 
2 Hagedorn, Mozer, Webb, Fowler, and Lauritsen, Phys. Rev. 
105, 219 (1957). 
3 Bashkin, Carlson, and Jacobs, Bull. Am. Phys. Soc. Ser. II, 1, 
212 (1956). 
4Olness, Vorona, and Lewis, Bull. Am. Phys. Soc. Ser. II, 2, 
53 (1957). 
5 Gove, Ferguson, and Sample, Phys. Rev. 93, 928(A) (1954). 
6G. W. Tautfest and S. Rubin, Phys. Rev. 103, 196 (1956). 
7 Ferguson, Clarke, Gove, and Sample, Atomic Energy of 
Canada Report PD-261, 1956 (unpublished). 


recent data! cannot be satisfactorily analyzed with 
S-waves only and thus imply the presence of P-waves 
and possibly higher ones. A phase shift analysis of these 
results which includes P-waves is described in the 
following paper.® 


APPARATUS 


The proton beam for the work was supplied by the 
Chalk River electrostatic accelerator. The upper limit 
to the proton energy available was 3 Mev. The lower 
limit was about 1 Mev, which was the minimum energy 
where a resolved peak in the spectrum from the scintilla- 
tion proton counter could be obtained. The energy of 
the proton beam was measured by deflecting it through 
90° in a uniform magnetic field which, in turn, was 
measured and controlled by a proton gyromagnetic 
resonance detector. 

The scattering chamber is shown in Figs. 1 and 2. 





acon 

J 

71 
A 





Fic. 1. Gas scattering chamber. A, B, and D are collimating 
apertures. Z and F are thin nickel windows for beam entry and 
exit. F, G, and H are the beam catcher assembly. J is the counter 
collimator and J a small scintillator for particle counting mounted 
on the rotating cover. 


8 A. J. Ferguson, following paper [Phys. Rev. 115, 1660 (1959) ]. 
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Fic. 2. Vertical section of the gas scattering chamber perpendicular to the beam axis showing the 
photomultiplier, AK, and counter collimator mounting. 


It is essentially a large scattering chamber similar to 
those described by Blair et al.° and Herb ef al.” Its 
main feature is the use of a large rotating lid which 
allows electrical connection to be made directly to the 
photomultiplier. This avoids the difficulty of electrical 
discharges from the high voltage lead to the counter 
occurring in the low pressure target gas. 

Figure 1 shows a horizontal section of the chamber at 
the beam axis. Apertures A and D collimate the incident 
proton beam. These are circular holes of diameter 
0.089 in. in 0.04 in. thick tantalum sheet and are 
separated by 20 in. Aperture B is a “scraper” to 
suppress particles scattered by the edges of the first one. 
A gas tight metal foil is located at E about one in. from 
the center of the chamber. This foil separates the target 
gas from the vacuum of the accelerator. The foils used 


“a. + 8 T err « S te 
\ 
% 
ih & 
dao* x N Meters 
dos : y | 
‘ ~~ a as paaeed 
A_-— —~Jf | 
N mS } 
oO us 2.5 30 


Ep Mev 
Fic. 3. Yield curve at the maximum angle of scattering. The 
yield is normalized to that from a pure Rutherford scatterer. 
E, is the energy of the incident protons in the laboratory system. 
* Blair, Freier, Lampi, Sleator, and Williams, Phys. Rev. 74, 
553 (1948). 
Herb, Kerst, Parkinson, and Plain, Phys. Rev. 55, 998 
(1939). 





were of nickel, 40 ug/cm? or 80 wg/cm? in thickness. 
Earlier work with 250 ug/cm? mica foils indicated that 
these introduced a straggling of the beam of about 
15 kev, which is intolerable for the present work. Even 
the thin nickel is believed to cause 1 or 2 kev of 
straggling. C is a flap valve which permitted the 
chamber to remain under vacuum when disconnected 
from the beam tube of the accelerator. 

The counter assembly is mounted on the rotating lid. 
It consists of a small potassium iodide scintillator J 
about ;’¢ in. thick by } in. square mounted on a Lucite 
post. The light pulses from this are picked up by a 5819 
photomultiplier, K, shown in Fig. 2. Below the scintil- 
lator is a polished aluminum hemispherical reflector to 
improve the light collection. J is the counter collimator 
which defines the target region by the intersection of its 
acceptance angle with the beam. The two extreme rays 
defined by the collimator in the plane of scattering 
have an angular separation of 6.4°. The mean spread in 
scattering angle will be somewhat less than this. An 
Q-ring vacuum seal between the glass envelope of the 
photomultiplier and the housing for it permits the 
photosensitive cathode to be near the scintillator. The 
counter collimator was aligned optically. For this, the 
counter and beam catcher were removed and the lid 
was rotated to the zero angle. The alignment of the 
collimator could be checked and adjusted by sighting 
through it into the beam collimator. The zero setting 
was found to be reproducible to within +0.1° by this 
procedure. The angular distribution of a pure Ruther- 
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ford scatterer provided another check of the accuracy 
of the angle measurements. The lid is supported at M on 
a complete ring of ball bearings. Just within this is the 
O-ring seal to maintain the internal vacuum. Two hand 
wheels, Z, turn the rotating lid through worms and 
sprockets that engage a gear cut around the top flange 
of the chamber. 

After traversing the target region, the proton beam 
enters the beam catcher through another nickel 
window F, of thickness 200 yg/cm?, situated 2 in. 
from the center of the chamber. The catcher has two 
electrodes in it. The inner, H, is a Faraday cup to 
collect the beam, the end of which is a quartz disk 
lightly aluminized on one side. This made the proton 
beam visible through the Lucite end plate of the 
catcher and was helpful in getting it lined up along the 
axis of the collimator. The outer, G, is an electrode 
biased to —300 v to suppress the escape of electrons 
from the cup and the collection of electrons from the 
gas and window. The entire catcher assembly was 
evacuated by a small fractionating oil diffusion pump. 
A current to the negative electrode was observed which 
was normally about 5% of the current to the collector 
cup. Since ion currents in the electron trap can affect 
the measurement of the beam current, a test was made 
for this source of error. A comparison was made between 
the counts of scattered particles at a fixed angle and 
energy and for a specific accumulated proton current 


first when the beam catcher was evacuated by the 
diffusion pump and second with it at forevacuum 
pressure. The difference between the counts was neg- 
ligible, indicating that the contribution from ion cur- 
rents was not significant. 

Calibration of the energy scale was done by placing a 
thick LiF target in the center of the chamber and 
evacuating all gas but retaining the entry foil at £. 
Then from the observed threshold of the Li’(p,n)Be’ 
reaction, the energy scale provided by the beam 
analyzing magnet was calibrated, using 1.882 Mev for 
this threshold.'! A correction was made for energy loss 
in the gas between the window and the target region. 
The energy of the 1.743-Mev resonance was fixed in 
this scale and it was then used as a secondary standard 
in other runs. The accuracy of the energy is about 0.3%. 
The stability of the beam for the duration of an angular 
distribution measurement was +2 kev. 

The pressure of the gas in the chamber was measured 
on a manometer containing buty] phthalate vacuum oil. 
Pressures ranged from 20 cm of oil used in a few early 
measurements down to 4 cm and 2 cm in the later work. 
The error of reading the manometer was deemed to be 
0.1 mm of oil, which constitutes generally the largest 
source of error in the work. Typical values for the 
errors arising from various sources of uncertainty are: 
counting statistics % 


2%, pressure measurement 3%, 


1 Herb, Snowdon, and Sala, Phys. Rev. 75, 246 (1949). 
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5. Yield curves at six angles near the 2.344-Mev 
and 2.468-Mev resonances. 


argon calibrations 3%, beam measurement 2%, gas 
impurities }%, and the angle measurement 3%. The 
tabulation of reference 7 gives the estimated error of 
each point. 


EXPERIMENTAL RESULTS 


The bulk of the experimental data consists of 105 
angular distribution measurements taken for incident 
proton energies ranging from 1.05 Mev to 2.94 Mev. 
There are, in addition, a number of yield curves taken 
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Fic. 6. Typical angular distributions between 1.05 Mev and 
2.32 Mev. The solid line is the theoretical curve from the phase 
shift analysis of reference 7. 


CLARKE, 


AND GOVE 
at the maximum available scattering angle of 155.2° 
(center-of-mass system). All of the cross sections have 
been normalized to the Rutherford differential cross 
section as described below. The accuracy of the results 
is generally about 5%. A yield curve at the largest 
angle is shown in Fig. 3. Open circles denote the data 
taken from yield curves, and crosses those taken from 
angular distributions. The energy of the incident pro- 
tons, E,, is always given for the laboratory system. In 
Figs. 4 and 5 are plotted data for the region near 1.79 
Mev and 2.4 Mev as a set of yield curves for the center 
of mass angles 53.1°, 73.9°, 94.1°, 113.9°, 133.1°, and 
155.2°. These show the constant angle contours for the 
resonances 1.743 Mev, 1.803 Mev, 2.344 Mev, and 
2.468 Mev. Figures 6 and 7 show some typical angular 
distributions. The solid lines are theoretical curves 
based on the phase shift analysis of the following paper. 
The angular range covered was from 53.1° to 155.2° 
(c.m.) in approximately 10° steps for most of the runs, 
although a few runs extend in angle down to 27.8°. In 
measuring a distribution, the procedure was to start 
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Fic. 7. Typical angular 
distributions between 2.4 
Mev and 2.93 Mev. The 
solid line is the theoretical 
curve from the phase shift 
analysis of reference (7). 
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at the maximum angle and to proceed to the minimum 
taking every second angular position and then to return 
to the maximum filling in the omitted steps and finally 
repeating the measurement at the maximum. This 
procedure was designed to reveal systematic changes 
that occurred in the course of a distribution, and a 
number of runs have been rejected from inconsistencies 
revealed in this way. 

The yield of scattered particles in the laboratory 
system may be written 
FntiA dor 

G dQ’ 
where F is the incident flux in particles per second, 
n the number of scattering centers per cm’, / is the target 
thickness in cm, A=da/dor, the ratio of the actual 
differential cross section to the Rutherford cross section, 
G is the ratio between the yield in the center-of-mass 
system and the laboratory system,’ and dop/dQ is the 
Rutherford scattering cross section. 


Y= 


12 J. P. Marion and A. S. Ginzbarg, Atomic Energy Commission 
Report NP-6241 (unpublished). 





BLASTIC 


We write /=w/sing, where ¢ is the scattering angle 
in the laboratory system and w the width of the beam 
accepted by the detector. We have also 


9 


dor 


Zee \?2 
<-( ) csc4(3/2), 
dQ 2M? 


where Z is the charge number of the target element, 
M the reduced mass of the system, v the velocity of the 
incident particle relative to the target and # the scat- 
tering angle in the center-of-mass system. It is evident 
that the quantity (Y/A)E’ sing sin‘(8/2) is indepen- 
dent of angle and the energy, E. By substituting in the 
scattering chamber a gas known to obey the Rutherford 
law, this constant, and consequently A, can be evalu- 
ated. If the quantity YE°G sing sin*(#/2) is independent 
of E and 9, then this is virtually conclusive evidence 
that pure Rutherford scattering is obtained, i.e., that 
A=1, since the departures that can occur vary irregu- 
larly with energy and angle. Argon has been found 
satisfactory at energies below 1.9 Mev. We have ob- 
served a cluster of narrow resonances at 1.9 Mev and a 
broad one at 2.5 Mev, making these regions inacceptable 
for calibration purposes. Figure 8 shows a typical 
angular distribution for argon at 1.573 Mev in which 
the angular dependence has been corrected as in the 
foregoing. 

The resonances observed in this work have energies: 
(1065+5) kev, (1557+6) kev, (17437) kev, (180347) 
kev, (2344+10) kev, and (2468+10) kev. These agree 
well on the whole with the recent work.!” 

There is some disagreement with Hagedorn ef al.? 
regarding the width of the 1.55-Mev resonance. On the 
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Fic. 8. Angular distribution for argon gas at 1.573 Mev. Each 
point has been corrected by the appropriate angle dependent 
factor required in normalizing to the yield of a Rutherford 
scatterer. The errors shown are those due to counting statistics. 


basis of fits at 154°, 125°, and 90° these workers give 
34+4 kev for this width. Gove, Ferguson, and Sample’ 
reported 53 kev based on a fit at 155° only. Both of 
these have ignored the variation of the nonresonant 
scattering. Some attempts have been made to fit the 
155° cross section data assuming that the S-wave 
potential scattering is given by the average course of 
the 6*/?+ phase shift curve derived in the following 
paper.® These have not been satisfactory for either 34 
kev or 53 kev for the width, but they suggest that the 
first figure is too low. 
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A phase-shift analysis has been made of a set of angular distributions for the elastic scattering of protons 
by nitrogen in the energy range 1.0 Mev to 3.0 Mev. Two independent S-wave phase shifts and one P-wave 
phase shift of hard-sphere type have been assumed. Moderately good agreement with the nonresonant 
scattering below 2.3 Mev and with the scattering at the 4+ resonance at 1.557 Mev has been obtained, 
indicating that the gross features of the scattering can be represented in this way. Between 2.3 Mev and 
3.0 Mev the fits are poor. The results indicate a broad + resonance at 2.32 Mev with a width of 0.55 Mev. 


INTRODUCTION 


PHASE-SHIFT analysis of the low-energy elastic 

scattering of protons by nitrogen is of interest in 
two ways. In the first place it appears to be necessary 
for the establishment of spin and parity assignments for 
some of the observed resonances.' In the second place 
it will provide some information about the average 
interaction between a nucleon and the N™ nucleus 
through the behavior of the nonresonant scattering. An 
early report’ stated that contributions from P waves 
were small below 2.0 Mev. However, subsequent work 
showed that the experimental cross sections used there 
were low by about 10%, and with the revised normali- 
zation good fits using S waves only could not be 
obtained. The next logical step in studying the problem 
is to introduce P waves. However, due to the spin of 
nitrogen, the introduction of P waves with full general- 
ity involves a large number of arbitrary parameters, 
substantially more, in fact, than can be obtained from 
an analysis of angular distributions. It is thus necessary 
to introduce some simplifying assumption. The present 
work has been carried out using the assumption that 
all of the phase shifts describing the P waves are equal. 
Equality between the two S-wave and between the 
various P-wave phase shifts will result if the scattering 
is described by a potential well without spin-orbit 
coupling. The assumption thus has some resemblance 
to the “cloudy crystal ball” model*~® although it does 
not include the complex part. Since this part serves 
to transfer some of the particles from the pure ‘shape 
elastic” process to “compound elastic” and reaction 
processes, its neglect will be justified if the latter are 
small or absent. For the present case particle reactions 
are energetically impossible. Compound elastic processes 
will be represented by transitions between different 
channel spins which result from off-diagonal elements 
of the R matrix. It is argued by Teichmann and Wigner® 


! Hagedorn, Mozer, Webb, Fowler, and Lauritsen, Phys. Rev. 
105, 219 (1957). 

2 Gove, Ferguson, and Sample, Phys. Rev. 93, 928(A) (1954). 

3 Melkanoff, Nodvik, Saxon, and Woods, Phys. Rev. 106, 793 
(1957). An extensive bibliography is given here. 

4 J. Bowcock, Proc. Phys. Soc. (London) A70, 515 (1957). 

5G. E. Brown and C. T. de Dominicis, Proc. Phys. Soc. 
(London) A70, 668 (1957). 

6 T, Teichmann and E. P. Wigner, Phys. Rev. 87, 123 (1952). 


that such terms will be small in the nonresonant regions. 

Independent phase shifts for the two states formed 
by S-waves have been used. Although this departs from 
the requirements of a simple well, any differences found 
between the phases can be attributed to compound 
elastic effects and the procedure has consequently 
somewhat greater generality. An S-wave resonance will 
be fitted with these parameters, and if the fits are good, 
they will identify the spin of the state. 

The experimental data used are those described in 
the preceding paper.’ Fitting of the angular distribu- 
tions has been done by the least-squares method, the 
required computations being carried out on the 
Datatron computer at Chalk River. While the em- 
phasis of the analysis is on the nonresonant scattering, 
the fits have been carried through most of the resonances. 
The immediate neighborhood of the resonances at 
1.746 Mev and 1.803 Mev have been omitted as these 
are unquestionably not S-wave ones. Below 2.3 Mev 
the results are reasonably good, although with some 
qualifications discussed later in detail. Above this they 
are poor and suggest that a simple potential well is an 
inadequate description of the situation. 


THEORY 


A discussion of charged-particle scattering analysis, 
together with remarks on the number of parameters 
involved, has been given by Christy.* For the case of 
nitrogen in the energy range considered, reactions do 
not occur, and it is found that nine parameters are 
required to include S and P waves with full generality. 
Of these, seven are involved with the P waves. If 
these are known to be absent, the formulation in terms 
of S waves is simple, and fitting by hand calculation 
relatively easy. That this simplification is inadequate 
has been noted so that a single phase shift has been 
introduced to represent the P waves. Three adjustable 
parameters are thus used, two S-wave phase shifts for 
the 1/2+ and 3/2+ states and the single phase shift 
for the P waves. 

It is of interest to consider the information content 
of charged particle scattering. When S and P waves are 

7 Ferguson, Clarke, and Gove, preceding paper [Phys. Rev. 
115, 1655 (1959) ]. 

§R. F. Christy, Physica 22, 1009 (1956). 
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included in the most general way, the ratio of the dif- 
ferential cross section to the Rutherford cross section 
is found to be a linear combination of the seven linearly 
independent functions of angle (1—y)*Po, (1—s)?Pi, 
(1—y)*P2, Mo, No, M;, and N, which are defined below. 
An identity between the coefficients reduces the number 
of effectively independent functions to six. Clearly no 
more than six parameters describing the scattering can 
be determined from an angular distribution. However, 
it has been found that the situation is not as favorable 
as this. Although the six functions are theoretically 
linearly independent, when the angular range is 
restricted approximate linear dependencies may arise. 
Such a situation has been found near 2.3 Mev where it 
was found that two of the functions could be repre- 
sented to an accuracy of about 1% by linear combina- 
tions of the other four in the range 50° to 150°. Here 
it is effectively impossible to determine more than four 
parameters. This degeneracy may be removed by 
extending the angular range and may disappear at 
other energies. Its occurrence emphasizes the desir- 
ability of measurements over as wide a range of angle 
as possible. 

The theory used is based on the work of Blatt and 
Biedenharn’:” in which the summations over magnetic 
substates required when polarization is not observed is 
effected with the aid of Racah coefficients. Under the 
assumptions described above, the scattering cross 
section is given by 


da (u)/dor(u) = 1+2M o(sin?6'/2++-2 sin*6*/?+) 
— 2No(sind/?+ cosé'/*++- 2 sind*/*+ cosé*/**) 
+18M;, sin*¢;— 18, sing; cos; 
+ (1—p?)nPo(p) (3 sin?6/++-§ sin’6*/+ 
+3 sin’p;)+2(1—p?2)n-?P (uw) sing,[sind!*+ 
X cos(¢i+ 2yi—6"*+) +2 sind*/* cos (git 21-69) ] 
+6(1—p?)n?P2(u) sin’. (1) 


Here do(u)/dor(u) is the ratio of the observed differ- 
ential cross section to the Rutherford cross section, 
evaluated in the center-of-mass system, u is the cosine 
of the angle of scattering, 6'/?+ and 6*+ are the phase 
shifts for the states of spin and parity 1/2+ and 3/2+, 
respectively, and ¢; is the phase shift for the P waves. 
n=Ze?/hv, Z being the atomic number of the target, 
e the electronic charge, # Planck’s constant, and v the 
relative velocity of proton and target. yi=tan™'y. 
Po(u), P1(u), and P2(u) are the Legendre polynomials 
of degree 0, 1, and 2, and Mo, No, Mi, and Nj, are 


9]. M. Blatt and L. C. Biedenharn, Revs. Modern Phys. 24, 
258 (1952). 

1 A, J. Ferguson, Atomic Energy of Canada Limited Report 
No. 157 (unpublished). This extends the charged-particle scat- 
tering formulas of Blatt and Biedenharn to the case where a 
general phase-shift representation is used for the nuclear wave 
functions. Complicated cases such as those where resonances 
overlap may be treated in this formulation. The formulas contain 
some small errors. 


functions" of angle and energy given by 
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The fitted parameters are 6'/*+, 6*/?+, and ¢;. The 
fitting has been done by the least-squares procedure. 
Since the phase shifts occur nonlinearly, an iterative 
procedure is required. The standard linearizing method” 
was set up initially, but this procedure was found not 
to converge on most of the distributions. Convergence 
was secured by computing a correction to the normal 
matrix involving the second derivatives of the theo- 
retical cross sections with respect to the unknown 
phases. It was found that for those cases that required 
the correction term, the convergence was to a point 
representing a singularity of the uncorrected matrix. 
This clearly is the reason for the failure of convergence 
in these cases, and it is believed that it arises from the 
fact that an adequate fit is not available. This point 
will be discussed subsequently. 

The significant correlations which exist between the 
errors in the points of any one distribution due to the 
uncertainty of normalization have been ignored. To 
accommodate these would be a substantial complication 
in the routine and the results would probably not be 
significantly altered. Two solutions for the phase shifts 
are ordinarily expected; in the cases of the singular 
normal matrix noted above, the two solutions de- 
generate into one with 6!/+= 69+, 

5'/2+, 63/2+, and ¢; are given as functions of £ in Figs. 
1, 2, and 3. When the two solutions for 6'/2+ and 6*/?+ 
are distinct, the preferred one is shown as a dot and 
the second as a cross. The remaining points where the 
solutions are coincident are shown as dots. Typical 
angular distributions calculated from these phase shifts 
are indicated by the solid lines in Figs. 6 and 7 of the 
preceding paper.’ No points are shown at the sharp 
resonances at 1.067 Mev, 1.746 Mev, 1.803 Mev, 
2.343 Mev, and 2.458 Mev. At the S-wave resonance at 
1.067 Mev the measurements have been distorted by 
inadequate energy resolution and the phases were not 


These differ slightly from the similar functions given in 
reference 10. 

12 See, for example, E. T. Whittaker and G. Robinson, Calculus 
of Observations (Blackie and Sons, Ltd., London, 1932), Chap. IX, 
Sec. 109. 
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Fic. 1, Energy dependence of 5'/*+. The crosses are alternative 
solutions. The series of solid and dashed lines labelled r=0, —0.6 
and —1.0 represent lines of constant logarithmic derivatives. The 
solid line extending from 1.8 Mev to 2.9 Mev is the phase shift 
used to fit the broad resonance at 2.3 Mev. 


meaningful. The four higher ones are believed not to 
be S-wave resonances. 

The connection between the causality principle and 
the resonance formulas is discussed by Wigner." This 
imposes a lower limit on the energy derivative of the 
phase shift which is governed by the size of the scat- 
tering system. Wigner gives for the derivative of this 
phase shift relative to the wave number, , 


: a [-—I'=R R 


I-I'R [#—I"=R 
R 
remnant, 
\I—T'R|? 


Ec \T—I'R|? 
(2) 


Here J is the radial factor of the wave function for an 


N"(pp) N“ 
S-WAVE PHASE SHIFTS 
FOR J- 


5 %+ 


OEGREES 





oe ase 
Mev 
Fic. 2. Energy dependence of 6*/*. The solid and dashed curves 
indicate lines of constant logarithmic derivatives as in Fig. 1. 
The solid line running from 1.8 Mev to 2.9 Mev shows the S-wave 
hard-sphere phase shifts used in fitting the 2.32-Mev resonance. 


183. P. Wigner, Phys. Rev. 98, 145 (1955). 


incoming particle and R is the reciprocal of the loga- 
rithmic derivative. The dot signifies differentiation with 
respect to k, and the prime signifies differentiation with 
respect to the radial coordinate. The asterisk indicates 
the complex conjugate. 59 is the & derivative of 6 for 
R independent of &, i.e., independent of energy. The 
essence of the development is that R>0, which im- 
mediately gives 6>6. The limitation imposed by 
causality can thus be conveniently displayed by 
plotting families of curves representing R=const on a 
graph of 6 versus energy. The slope of the curve of 6 
cannot be less than that of any member of the family 
which it intersects. The application of this principle to 
charged particle scattering demands only that we use 
charged particle wave functions'*® for computing the 
family of curves. 

These curves are indicated in Figs. 1, 2, and 3. In 
Figs. 1 and 2 for the two S-wave phase shifts are given 
the curves for r=0, —0.6 and —1. In Fig. 3 for the 
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Fic. 3. Energy dependence of ¢,. The solid and dashed curves 
indicate lines of constant logarithmic derivatives as in Fig. 1. The 
solid line running from 1.8 Mev to 2.9 Mev is the P-wave phase 
shift used in fitting the 2.32-Mev resonance. 


P-wave phase shift, the curves are for r=0, —0.25 and 
—0.4. Here r=R/a where a is the nuclear radius and 
has been taken as a=1.5A!*X10-" cm=3.615X10-" 
cm. The curves given are adequate to display departures 
from the condition 6>69. r=0 corresponds to the 
vanishing of the wave function at the nuclear surface 
and this curve is the ordinary hard sphere phase shift. 
In the Wigner-Eisenbud theory R is given by 


Xr 
— oo, (3) 
d k-E 


so that if R= «, then clearly E= Ej, the energy of one 
of the eigenstates of the nucleus. Thus at each resonance 
the graph of the phase shift crosses that for R=, 
which is not shown, but which lies somewhat above 


‘4 Bloch, Hull, Broyles, Bouricius, Freeman, and Breit, Revs. 
Modern Phys. 23, 147 (1951). 

15 Sharp, Gove, and Paul, Atomic Energy of Canada Limited, 
Report TPI-70 (unpublished). 





ANALYSIS OF PROTON SCATTERING 


the one for r=0. Cross sections are periodic in the 
phase shift with a period w and the families R= const. 
in the phase shift graphs repeat with this period. 


DISCUSSION OF THE PHASE SHIFTS 


Below 2.3 Mev the fits to the angular distributions 
are moderately good, with the reservations discussed 
below in more detail. Above this they are consistently 
poor. It may thus be assumed that the phase shifts 
represent the gross aspects of the lower energy scat- 
tering. Inasmuch as they are not materially different 
from the background phases used in earlier attempts to 
fit the 1.746-Mev resonance, the difficulty that this 
resonance has presented is somewhat surprising. Re- 
garding the region from 2.3 Mev to 3.0 Mev, the 
D-wave hard-sphere phases are rather small, and it 
seems likely that the observed discrepancies result from 
more complicated behavior of the P-waves. 

An unexpected result is that for all but one of the 
fits in the regions of energy away from resonances, 6!/?+ 
and 6°+ are identical, or equivalently differ by integral 
multiples of x. This would arise from hard-sphere scat- 
tering, or from scattering by a simple potential well. 
However two aspects of these results indicate that in 
general they are not physically significant. These are 
the only solutions obtained in regions where the fit is 
conspicuously bad, e.g., above 2.5 Mev, and where the 
formula is clearly inadequate to represent the scat- 
tering. Secondly, the two parameters are statistically 
completely correlated. This would ordinarily imply that 
the normal matrix was singular and would prevent any 
solution being found. In the present case the second 
derivative corrections mentioned above have made this 
solution possible. It is easily shown that it is always 
present as a formal solution to the equations, although 
in many cases it represents a saddle point and not a 
true minimum of the fit. In this regard it is interesting 
to note that without the second derivative corrections, 
the normal matrix is positive definite and does not 
admit saddle points. 

The nature of these results can be illuminated by 
introducing the transformation 


= ¢jn241/2 37 283/2+ 
A =sin’6!/?*+-+- 2 sin?6*/?+, 
B=siné!/** cos6!/*++-2 sind*/?+ cosé3/*+, 


62+ and 6*/*+ are readily eliminated from Eq. (1) by 
this transformation. In terms of A and B, the cross 
section is much less restricted than in terms of the 6’s 
if only real values are allowed. Real 6+ and 69+ 
limits A and B to an annular region in the A,B plane, 
the outer edge of which is the locus for 6!/?+=6%/+, 
Equation (1) is not valid for complex phase shifts, so 
that values of A and B outside this region are not ad- 
missible. A least-squares fit with the parameters A, B, 
and ¢; converges without the introduction of second 
derivative corrections. A number of such fits to typical 
distributions have shown that where a nondegenerate 
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solution was previously obtained, the identical result was 
found, while when only the degenerate solution was 
obtained, a better fit was found which lay outside the 
region of real 6’s. In the latter cases the degenerate solu- 
tion corresponded to a nearby point on the edge of the 
real region. It is thus to be concluded that essentially 
all of the nonresonant distributions correspond to points 
outside the region of real phase shifts. This implies a 
deficiency in the assumptions underlying Eq. (1). How- 
ever, the fact that many of the degenerate fits obtained 
are quite good ones shows that in these cases the defic- 
iency is a small one. 

Between 1.2 Mev and 1.4 Mev systematic differences 
between the experimental points and the best theo- 
retical curve are apparent. Degenerate phase shifts are 
also obtained in this region. At this low energy the 
presence of D-waves or higher ones is extremely unlikely 
so that these anomalies must reflect more complicated 
behavior of the P-waves. The most probable interpre- 
tation is that channel spin mixing, i.e., transitions from 
one channel spin to the other due to off-diagonal ele- 
ments of the S-matrix, is occurring in the 1/2- and 
3/2-states. While this has not been studied specifically 
for the nonresonant case, studies of P-wave resonances 
with a strong channel spin mixing have produced angu- 
lar distributions with shapes characteristic of those 
obtained in this region. Coupling of the spin of the 
incident proton to its orbital motion or to the angular 
momenta of the target nucleus will produce such 
effects. 

Nondegenerate solutions for the S-wave phases have 
appeared at the 1/2+ resonance at 1.557 Mev and in 
the region from 1.9 Mev to 2.4 Mev. At 1.557 Mev, 6'/** 
follows the course expected for this resonance. However, 
the energy dependences of both 6'*+ and 6*+ are 
distorted here. For 6+ a dispersion-like anomaly is 
present where a smooth variation is expected. The 
energy dependence of 6'/?+ implies a width of 13 kev 
whereas a fit made directly to the cross sections using 
the one-level formula gives 34 kev.! It may be remarked 
that a phase-shift analysis like the present one will not 
give good values for widths unless all of the parameters 
show reasonable behavior. These anomalies are attrib- 
utable to inadequacies of the formula such as appear in 
the nonresonant regions. From 2.0 Mev to 2.4 Mev the 
phases give a strong indication of a broad 1/2+ reso- 
nance which will be discussed fully in the following 
section. 

BROAD RESONANCE AT 2.3 MEV 


Between 1.9 Mev and 2.4 Mev the two solutions for 
6+ and 6+ are distinct. The phase shifts from one 
solution follows a course which would be expected for 
a 1/2+ resonance, while those from the other give an 
energy dependence for 6'/*+ that is incompatible with 
the causality condition so that it can be discarded. For 
the acceptable solution 6*/*+ and ¢; show a reasonably 
smooth behavior except at points where they are 
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Fic. 4. Experimental and calculated yield curves at the 2.32-Mev 
resonance at six different angles. 


perturbed by the narrow resonances at 2.344 Mev and 
2.468 Mev. 
In order to interpret the scattering near 2.3 Mev as 
a 1/2+ resonance, it is necessary to show that the 
scattering cross sections are reasonably well approxi- 
mated by smoothly varying potential phase shifts plus 
a resonance phase shift for the 1/2+ state. In smoothing 
the background phases we must expect the fits to be 
impaired since this operation demands departure from 
the best least-squares values. The result will be satis- 
factory if the impairment is slight. If the smoothed 
phases represent the average in a small energy interval, 
then the resulting cross sections will be the average 
cross sections over that interval. 
6'*+ is given by the formula’® 
WP 
tand!/*+ = ——_—_+ Ry (4) 


1a 


40 y 


where the first term on the right-hand side gives the 
resonance contributions and Ro gives the nonresonant 
background. We assume that Rp» is obtained from 6*/**, 
1.e., 
tand*/*+= Ro. (S) 
16 This equation applies properly to the nuclear phase shift, 
5/* —@p, in our notation, where ¢» is the hard-sphere phase shift. 
The correction of this inaccuracy will change the curves only 


slightly, and as a formal fit is not being attempted it has been 
omitted. 


The width T is given by 
r=2P7, 


where P is the penetrability and y the reduced width 
of the level. 

62+ has been calculated from Eqs. (4) and (5), 
using for 6*/*+ and ¢, the solid lines in Figs. 2 and 3 
which run from 1.8 Mev to 2.9 Mev. The resonant 
energy and width are Ey>= 2.32 Mev and '=0.55 Mev. 
The energy dependence of I through the penetrability 
has been included, and the above value is the width at 
E,. This choice of resonant energy and width has been 
made to force a fairly good fit to angular distributions 
at 1.99 Mev and 2.44 Mev. The solid line of Fig. 1 
running from 1.8 Mev to 2.8 Mev shows 6'/?* as cal- 
culated in this way. 

do/dor computed from these phase shifts for six 
different angles are shown in Fig. 4, for the energy 
range 1.8 Mev to 2.9 Mev. The solid lines represent the 
computed results and the various circles, triangles, etc., 
the experimental values. The cross section is fairly well 
represented for the largest angles up to 2.4 Mev. For 
angles less than 120°, the fit becomes poor at about 
2.2 Mev. Above 2.4 Mev neither the resonance fit nor 
the unrestricted one is adequate. Better agreement at 
the smaller angles can be obtained by increasing Ey and 
I’, but this makes the fit at higher energies even worse. 

The N'(p,y)O" reaction shows a_resonance-like 
behavior in this region that has been interpreted by 
Duncan and Perry!’ in terms of S-wave resonance at 
2.6 Mev with a width of 1.25 Mev. The resonant energy 
is somewhat higher and the width substantially greater 
than found from the scattering experiments. This dis- 
parity may be due to the presence of an appreciable 
nonresonant contribution to the (f,y) cross section 
whose effect would be to increase the apparent width 
of the resonance if not explicitly included. 
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The complex alpha particle-nuclear potential is determined with small uncertainty at the nuclear surface 
by experiments with alpha particles in the range of bombarding energies up to 50 Mev in conjunction with 
this optical-model analysis which assumes that the shape of the complex potential is exponential at the 
nuclear surface. The potential is given by the expression 


= Ai = Ai 
Vo+Wa={ —1100 exp| (AY) 45.23 exp| -(° ee )I} Mev, 


for values of r (in units of 10-5 cm) where the real part is 2—10 Mev. The elastic scattering data has 
been used to determine the potential. The calculated reaction cross sections are found to be in satisfactory 
agreement with excitation function data. 

The total reaction cross section op for alpha particles in the energy range 0-50 Mev on nuclei with charge 
Z=10, 20, 30, 50, 70, and 90 has been calculated using the potential Vz+iW. obtained from the analysis 
of elastic scattering data. The calculated values may be interpolated to obtain og for other values of Z. 


I. INTRODUCTION 


LASTIC scattering cross sections'~” and excitation 

functions!~'* have been measured from many 
elements with alpha particles of bombarding energies 
ranging up to 50 Mev. The analyses?.°-?? made so 
far have been only of the elastic scattering data. 
However, the optical-model calculations also predict 
the total reaction cross sections. These may be com- 
pared with the combined excitation functions for all 
possible reaction products as a further test on the 


* Work performed under the auspices of the U. S. Atomic 
Energy Commission. 

t Fulbright Fellow 1958-1959. 

1G. W. Farwell and H. E. Wegner, Phys. Rev. 95, 1212 (1954). 

2 Wall, Rees, and Ford, Phys. Rev. 97, 726 (1955). 

3 Wegner, Eisberg, and Igo, Phys. Rev. 99, 825 (1955). 

4R. Ellis and L. Schecter, Phys. Rev. 101, 636 (1956). 

5H. E. Gove, Phys. Rev. 99, 1353 (1955). 

6 Eisberg, Igo, and Wegner, Phys. Rev. 99, 1606 (1955). 

7E. Bleuler and D. J. Tendam, Phys. Rev. 99, 1605 (1955). 

8 Igo, Wegner, and Eisberg, Phys. Rev. 101, 1508 (1956). 

9 Seidlitz, Bleuler, and Tendam, Phys. Rev. 110, 682 (1958). 

10 Kerlee, Blair, and Farwell, Phys. Rev. 107, 1343 (1957). 

1 R, L. Walker, Phys. Rev. 76, 244 (1949). 

12], Halpern, Phys. Rev. 76, 248 (1949). 

18$. N. Ghoshal, Phys. Rev. 80, 939 (1950). 

4K. G. Porges, Phys. Rev. 101, 225 (1956). 

15N. T. Porile, Bull. Am. Phys. Soc. Ser. II, 3, 382 (1958). 

16 Bleuler, Stebbins, and Tendam, Phys. Rev. 90, 460 (1953). 

17 E. Kelly and E. Segré, Phys. Rev. 75, 999 (1949). 

18 Vandenbosch, Thomas, Vandenbosch, Glass, and Seaborg, 
Phys. Rev. 111, 1358 (1958). 

19 Glass, Carr, Cobble, and Seaborg, Phys. Rev. 104, 434 (1956). 

2” J. S. Blair, Phys. Rev. 95, 1218 (1954). 

21N. Oda and K. Harada, Progr. Theoret. Phys. (Kyoto) 15, 
545 (1956). 

2 C, B. O. Mohr and B. A. Robson, Proc. Phys. Soc. (London) 
A69, 365 (1956). 

% G. Igo and R. M. Thaler, Phys. Rev. 106, 126 (1957). 

*W. B. Cheston and A. E. Glassgold, Phys. Rev. 106, 1215 
(1957). 

26 J. S. Blair, Phys. Rev. 108, 827 (1957). 

26 G. Igo, Phys. Rev. Letters 1, 167 (1958). 

27C, E. Porter, Phys. Rev. (to be published) and Phys. Rev. 
99, 1400 (1955). 


ference on the Nuclear Model, 1959, edited by A. E. S. 


optical-model potential obtained from elastic scattering 
data. 

In practice, excitation functions are measured for 
only a few of the many possible reactions induced under 
alpha-particle bombardment. In many nuclei, however, 
these represent the bulk of the reaction cross section, 
and consequently the sum of the measured excitation 
functions can be compared with the reaction cross 
section calculations. A method’* for measuring the 
total reaction cross section has been applied to proton- 
induced reactions.?* No similar measurement for alpha 
particles has been reported so far. The results of this 
analysis suggest that excitation function data may also 
be valuable in optical-model analyses of reactions 
induced by protons, deuterons, etc. 


II. ELASTIC SCATTERING 


The elastic scattering of 18-Mev alpha particles from 
argon,® 40-Mev alpha particles from copper,’ and 48- 
Mev alpha particles from lead* has been extensively 
analyzed in terms of the optical model.”* In Figs. 1, 2, 
and 3, the experimental cross sections are plotted along 
with the best fits obtained from the optical-model 
analysis. The conclusion of this analysis was that the 
elastic scattering cross section is very sensitive to the 
surface of the real part of the nuclear potential. The 
results of earlier analyses are compatible with this 
statement. The real part V. of the potential obtained 


*2N. M. Hintz, Proceedings of the University of Pittsburgh 
Conference on Nuclear Structure, 1957, edited by S. Meshkov 
(University of Pittsburgh and Office of Ordnance Research, U. S. 
Army, 1957); R. M. Eisberg, Proceedings of the International Con- 
Green, 
C. E. Porter, and D. S. Saxon (The Research Council, Florida 
State University). 


1665 





GEORGE 


> 


ARGON i6 MEV 


fo*47 a’? + 1.36 
4*0.6 
(V,W)* -100, -15 


de -1 
on™ ster") 











Oem, (DEGREES) 


Fic. 1. Angular distribution of 18-Mev alpha particles elastically 
scattered from argon. The points are from the experiment of 
Seidlitz, Bleuler, and Tendam, multiplied by the factor 0.724. 
The solid curve is calculated using the parameters listed on the 
graph. 


in this analysis could be represented as 
— {1100 exp[ — (r—1.1744)/0.574]} Mev 


for values > —10 Mev with r in units of 10-" cm. 
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Fic. 2. Angular distribution of 40-Mev alpha particles elastically 
scattered from copper. The points are from the experiment of 
Igo, Wegner, and Eisberg. The solid curve is calculated using the 
parameters listed on the graph. 
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Fic. 3. Angular distribution of 48-Mev alpha particles elastically 

scattered from lead. The points are from the experiment of 


Schechter and Ellis. The solid curve is calculated using the 
parameters listed on the graph. 


In Figs. 4, 5, and 6, the real parts of the potential 


(V+iIW)/{1+exp[(r—10)/d]}, 


obtained by varying the four parameters V, W, Ro, 
and d, are plotted for elastic scattering from argon at 
18 Mev,’ copper at 40 Mev,’ and lead at 48 Mev,' 


r( fermis) 


Fic. 4. The real part of the nuclear potential which best fits the 
elastic scattering data for 18-Mev alpha particles on argon 
plotted as a function of r in units of 10“% cm. The potential 
— 1100 exp[ — (r—1.17A!)/0.574] is also plotted. 
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Fic. 5. The real part of the nuclear potential which best fits 
the elastic scattering data for 40-Mev alpha particles on copper 
plotted as a function of 7 in units of 10~ cm. The potential 
— 1100 exp[— (r—1.17A!)/0.574] is also plotted. 


respectively. Also plotted is V4 for argon, copper, and 
lead, respectively. Figures 7, 8, and 9 are real potentials 
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Fic. 6. The real part of the nuclear potential which best fits 
the elastic scattering data for 48-Mev alpha particles on lead 
plotted as a function of 7 in units of 10% cm. The potential 
— 1100 exp[— (r—1.17A!)/0.574] is also plotted. 


at large values of r which yield relatively small residuals 
in the least squares analysis which is used as a criterion 
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Fic. 7. Comparison of real potentials for the elastic scattering of 18-Mev alpha particles from argon. Curves Ajand B represent 
the best fits obtained (see Fig. 4). The remaining real potentials are lettered in order of increasing magnitude of the residuals obtained 
in the least-squares analysis used as a criterion for goodness of fit. The quantity V is in Mev; d and ro, in 10-8 cm. 
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Fic. 8. Comparison of real 
potentials for the elastic scattering 
of 40-Mev alpha particles from 
copper. Curves A, B, and C 
represent the best fits obtained 
(see Fig. 5). The remaining real 
potentials are lettered in order of 
increasing magnitude of the re- 
siduals obtained in the least- 
squares analysis used as a criterion 
for goodness of fit. The quantity 
V is in Mev; d and ro in 10™ cm. 
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for goodness of fit. The best fits (see Figs. 4, 5, and 6) 


lie close together. The remaining potentials deviate 


considerably from these best fits. 
The imaginary part of the nuclear potential has also 
now been obtained and it may be expressed as 


W a= —45.7 exp[ — (r—1.40A4)/0.578 ] Mev, 


over the radial range where V, is valid. In Figs. 10, 
11, and 12, the best imaginary parts of the potential 
(V+iW)/{1+exp[r—ro)/d]} are plotted for the three 
angular distributions. Also plotted is W. for argon, 
copper, and lead, respectively. The imaginary parts of 
the potentials do not fall together as well at the surface 
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of the potential as do the real parts. The potential 


— (19+ 137)/{1+exp [(r—7.22)/0.5]} Mev 


(see Figs. 11 and 5) shows the largest deviation in the 
imaginary part (as well as in the real part). The 
angular distribution obtained from this potential does 
not fit the copper data as well as those obtained from 
the other potentials (the least-squares residual is five 
times as large as the best of the other two). 

These results show that alpha-particle scattering 
defines the complex nuclear potential reasonably well 
at the surface, and are in accord with qualitative 
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Fic. 9. Comparison of real potentials for the elastic scattering of 48-Mev alpha particles from lead. Curves A and B represent the 
best fits obtained (see Fig. 6). The remaining real potentials are lettered in order of increasing magnitude of the residuals obtained in 
the least-squares analysis used as a criterion for goodness of fit. The quantity V is in Mev; d and 7, in 10° cm. 
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arguments made earlier by Ford and Wheeler.”® These 
authors assumed that the mean free path of alpha 
particles in nuclear matter would be small. Conse- 
quently, the alpha particle would sample the potential 
at large values of r, where the Woods-Saxon potential® 
may be written approximately as 


(V+iW) expl[—(r—r0)/d], 


in agreement with the results of this analysis. Blair*® 
has concluded, from an extensive analysis of alpha- 
particle data using the sharp-cutoff model,” that the 
scattering of alpha particles is primarily determined by 
the nuclear surface. Brussaard*! has also concluded, 
from an application of the WKB method to the 
Schrédinger equation, that the scattering should be 
insensitive to the inner part of the potential. 


III. EXCITATION FUNCTIONS 


Total reaction cross sections have been calculated 
using the potential derived from the elastic scattering 
analysis,” 

VatWa=—1100 exp[ (r—1.17.4!)/0.574 ] 
—47.5i exp[— (r—1.40A 4) /0.578]. 








r(fermis) 


Fic, 10. The imaginary part of the nuclear potential which best 
fits the elastic scattering data for 18-Mev aipha particles on argon 
plotted as a function of r in units of 10-8 cm. The potential 
—45.7 exp[— (r—1.40A!)/0.578] is also plotted. 


29K. W. Ford and J. A. Wheeler (to be published). 

%® R. D. Woods and D. S. Saxon, Phys. Rev. 95, 577 (1954). 

31 P. J. Brussaard, thesis, University of Leyden, 1958 (unpub- 
lished). 

% See Part V for details on the calculation of total reaction 
cross sections. 
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Fic. 11. The imaginary part of the nuclear potential which 
best fits the elastic scattering data for 40-Mev alpha particles on 
copper plotted as a function of 7 in units of 10~ cm. The potential 
—45.7 exp[— (r—1.40A!)/0.578] is also plotted. 


Excitation function measurements where absolute cross 
sections exist have been made on isotopes of boron,!! 
7 


Pb 46 MEV 
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Fic. 12. The imaginary part of the nuclear potentials which 
best fits the elastic scattering data for 48-Mev alpha particles on 
lead plotted as a function of r in units of 10~ cm. The potential 
—45.7 exp[— (r—1.40A4)/0.578] is also plotted. 





GEORGE 


4 ae | 
3 35 4-4 
T (Mev) 


Fic. 13. The quantity or calculated from the alpha-particle 
potential for magnesium. The points represent the trend of the 
(a,m) cross section on magnesium, multiplied by a factor to take 
into account the abundance of Mg*®. 


beryllium," magnesium,” aluminum,” nickel,'* copper," 
zinc,'® silver,'*"* bismuth," uranium,'* and plutonium." 
The reaction cross sections can be compared with 
measurements of excitation functions for alpha-induced 
reactions if proper account can be taken of the contri- 
bution of reactions not measured in the experiments. 
For instance, the (@,p) reaction on aluminum is favored 
by a positive Q-value, whereas only the (a,”) excitation 
function” up to T=4.5 Mev has been measured. 
Consequently no quantitative comparison can be made. 
In the light elements (A $69), the probability of proton 
emission F’, is reported large at T~20 Mev.*-* Conse- 
quently, when excitation functions are missing for 
reactions where charged particles are emitted by light 





T 
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Fic. 14. The quantity og calculated from the alpha-particle 
potential for zinc. The points are Porile’s data on the total 
reaction cross section for alpha-induced reactions on Zn™. 


BBL. Cohen and E. Newman, Phys. Rev. 99, 718 (1955). 
% Cohen, Newman, and Handley, Phys. Rev. 99, 723 (1955). 
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elements, it is impossible to make a quantitative 
comparison.*® Some of these excitation functions are 
missing in the nickel’* and copper’ measurements. 
Here one can only note that the calculated cross 
sections are larger than the partial cross sections which 
are measured. Since it is very difficult to extrapolate 
the total reaction cross section or to values of Z less 
than 10, no attempt was made to compare the calcu- 
lations with the excitation functions of boron and 
beryllium. 

In Fig. 13, og for magnesium is compared with the 
excitation function measured by Halpern” for the (a,7) 
reaction on magnesium in the energy range T=2.5 to 
4.4 Mev. In this experiment neutrons from the (a,m) 
reaction were measured to obtain an excitation function, 
and consequently the (a,) reaction on all three isotopes 
of magnesium (78.47% Mg’, 10.18% Mg**, 77.47% 
Mg**) could contribute to the measured neutron cross 
section. However the (a,p) reaction on Mg” will account 
for most of the total reaction cross section in Mg” in 
this energy range because of the large positive Q-values. 
The (a,m) and (a,p) reactions on Mg** will be small 
because of the negative Q-values (—1.8 and —2.9 Mev, 
respectively) for this range of T. The (a,v) and (a,p) 
reactions on Mg*® have Q-values of 2.2 and —0.7 Mev, 
respectively. It is therefore apparent that the bulk of 
the observed neutrons are coming from the Mg**(a,n)Si** 
reaction. The experimental cross sections multiplied 
by a factor to take in account the relative abundance 
(10.18%) of Mg® in natural magnesium are plotted 
also in Fig. 13. It should be noted that any contribution 
from the (a,z) reaction on Mg* and Mg’® will decrease 
the values of the points plotted in Fig. 15, and bring 
them into better agreement with the calculation. 

In Fig. 14, or for zinc is plotted. The sum of the 
measurements of Porile'® for the (a,y)+ (a@,2pn) + (a,an) 
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Fic. 15. The curve is or calculated from the alpha-particle 
potential for silver. The solid circles, @, are the sum of the (a,m) 
and (a,2n) cross sections. The open circles, O, are the sum of 
(an), (a,2n), and (a,mp) cross sections. 


35 See, for instance, J. M. Blatt and V. F. Weisskopf, Theoretical 
Nuclear Physics (John Wiley & Sons, Inc., New York, 1952), p. 
365. 
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+ (a,a2n)+ (a,apn) +> (ain) +d 0? (a,pin) excita- 
tion function on Zn™ plus the estimated contributions 
of the (a,y) reaction for 7218 Mev and the (a,2p), 
(a,3p), (aay), (a,2a), (aap), (a,a2p) reactions is 
plotted. Porile’® states that the errors indicated by the 
flags are mainly due to the uncertainty in estimating 
the magnitude of the latter contributions. 

In Fig. 15, the experimental data of Bleuler ef al." 
on the (a,7)+(a@,2n) excitation function and of Porges'! 
on the (a,#)+(a,22)+(a,np) excitation function on 
Ag and op for silver are plotted. The curve for op is 
in Satisfactory agreement with the experimental points. 

In Fig. 16, the (a,2n)+(a,3n) excitation function of 
Kelly and Segré" and cx for bismuth are plotted in the 
energy range T= 24 to 38 Mev. The discrepancy at 24 
Mev is probably due to the absence of the (a,) contri- 
bution to the total cross section. 

In Fig. 17, the (a,f)+) 0 :-1°(a,in) +> :0°8 (a, pin), 
the (a,f) +> :=1°(a,in)+ (a,p)+ (a,p2n), and the (a,f) 
+¥ :-°(a,pin)+(a,an) excitation functions on U™, 
U*>, and U**, respectively, of Vandenbosch ef al.'® and 
or for uranium in the energy range 7=18 to 46 Mev 
are plotted. The quantity (a,f) is the fission reaction. 
The fit is considered satisfactory over the entire energy 
range. 

In Fig. 18, the (a, f)+ (a,)+ (a,22) + (a,4)+ (a, pn) 
+ (a,p2n) and the (a, f) +>. =1°(a,in) + (a,pn) + (a,p2n) 
+ (a,p3n) excitation functions Pu** and Pu of Glass 
et al."® and op for plutonium in the energy range T= 18 
to 48 Mev are plotted. The agreement between or and 
the experimental points for Pu** is satisfactory. Part 
of the difference the total reaction cross 
section for Pu®** and Pu*® according to Glass ef al.’ is 
due to the missing (a,pm) reaction on Pu, which 
could not be measured. 
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Fic. 16. The curve is or calculated from the alpha-particle 
potential for bismuth. The points are the trend of the sum of 
the (a,2m) and (a,3m) cross sections on bismuth. 
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Fic. 17, The curve is or calculated from the alpha-particle po- 
tential for uranium. The points are the sum of measured reaction 


cross sections for alpha-induced reactions on U** (QO), U™® (@), 
and U2 (x), 


IV. REACTION CROSS SECTIONS FOR ENERGETIC 
ALPHA PARTICLES ON NUCLEI 

Shapiro*® and Feshbach and Weisskopf* have calcu- 

lated the total reaction cross sections or for protons, 











a 
30 48 


a (Mev) 

Fic. 18. The curve is or calculated from the alpha particle 
potential for plutonium. The points are the sum of the measured 
reaction cross sections for alpha-induced reactions on Pu** (@) 
and Pu*® (©). 


86 M. M. Shapiro, Phys. Rev. 90, 171 (1953). 
37 J. M. Blatt and V. F. Weisskopf, Theoretical Nuclear Physics 
(John Wiley & Sons, Inc., New York, 1952), p. 352 and 491. 
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deuterons, and alpha particles using the schematic 
theory of nuclear reactions.** These authors have com- 
pared the shape of og with excitation function measure- 
ments for alpha particles on Rh", Ag™ ®, aluminum,” 
magnesium,” and Bi®.!? They have found that the 
shape could be fit quite well with the radius parameter 
1.3A1X10-" cm, where A is the atomic number. Later 
measurements on other heavy nuclei where absolute 
reaction cross sections were measured disclosed that the 
radius parameter must be larger than 1.541X10~-" cm 
in order to fit the data.'®* Shapiro” and Blatt and 
Weisskopf*” made calculations only for two values of 
the radial parameter, 1.54! 10~-" cm and 1.34!X 10-" 
cm (except for Z<30).** It has therefore been difficult 
to obtain quantitative fits to the excitation function 
data. 
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Fic. 19, The total reaction cross section versus T/e 
a for Z=10, 20, 30, 50, 70, and 90. 


38H. Feshbach and V. F. Weisskopf, Phys. Rev. 76, 1550 
(1949), 
%* TD). J. Tendam and H. L. Bradt, Phys. Rev. 72, 1118 (1947). 
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Another reason for interest in better values of op is 
that or enters into the calculation of energy level 
densities in the statistical theory of nuclear reactions.” 
According to the statistical theory, the energy level 
density p(Ep) of the residual nucleus (after the emission 
of an alpha particle, for instance) at an excitation Ep is 


N(Eo—Er)dEr 
p(Er)dER=const——— ; 
(Eo- Er)o-(Eo— Er) 


where Ey is the kinetic energy available to the alpha 
particle and residual nucleus in the center-of-mass 
system, when the residual nucleus is left in its ground 
state, and N(E£)— Er)d(Ey— Er) =number of nucleons 
emitted with energy between (Eyo— Ep) and (Eo— Ep) 
+d(Ey— Er), where T= Eo— Ep is the kinetic energy 
available in the center-of-mass system, and (o,Ho— Er) is 
the cross section for the formation of the compound nu- 
cleus for the inverse process. The interpretation of energy 
spectra of alpha particles emitted in (a,a’), (p,a), (d,a), 
etc. reactions is often made in terms of the statistical 
theory of nuclear reactions. To make a quantitative 
interpretation it is essential to have og. The evidence 
presented in Part III shows that reliable values of or 
can be obtained using the optical-model potential re- 
sulting from elastic scattering analysis.”® 


V. THE CALCULATION OF TOTAL REACTION 
CROSS SECTIONS 
Total reaction cross sections have been calculated 
using the potential derived from the elastic scattering 
analysis,”® 


1—1.17A! 
V.tiW.=—1100 exp (- ae )| Mev 
0.574 


r—1.40A! 
—45.7i exp| ~ ( )| Mev, 
0.578 


for values of Va= —10 Mev. The charge distribution 
determined by Ford and Hill" from the analysis of 
experiments® sensitive to the charge distribution pa- 
rameters has been used. In the analysis of the data on 
elastic scattering of 40-Mev alpha particles, it was 
noted that the calculated cross section was not very 
sensitive to the shape of the charge distribution. The 
Coulomb potential due to the Hill-Ford charge distri- 
bution™ is 


Zeri 1 x e*si-e™* 1 
ERE 
T.in? 2 6 nv nx Z 


ae 
(++ , el 


3 nn? 


“ V. F. Weisskopf, Phys. Rev. 52, 295 (1937). 

41 —D. L. Hill and K. W. Ford, Phys. Rev. 94, 1617 (1954). 

“R. Hofstadter, Annual Review of Nuclear Science (Annual 
Reviews, Inc., Palo Alto, California, 1957), Vol. 7, p. 231. 
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TABLE I. Woods-Saxon potential parameters. 





V 
(Mev) 


—50 
—50 
—50 
— 50 


Ww 


A (Mev) 


20 
“ 


ro d To 
(1073 cm) (107% cm) (107% cm) n 


0.576 17A! 
0.576 17A3 
0.576 17A3 

1 

1 

1 





44 
5.5 
: 6.5 
119 0.576 7A4 8.0 
0.576 7A' 9.0 
0.576 7A! 10.0 


17A042.17 
17A842.17 


1. 
1, 
I. 
ye 
iif 
1. 


where 
s=1/Te, 


and where » is 10 for heavy elements and is proportional 
to A}. Large values of n (10) give charge distributions 
approximately constant for small values of r; small 
values, exponential charge distributions. The quantity 
r, is the distance out to the half-value point of the 
charge distribution. The value 1.174!X10-" cm was 
chosen larger than the value obtained from the electron 
scattering experiments” to take roughly into account 
the effect of the finite size of the alpha-particle charge 
distribution. 

The parameters V, W, ro, and d listed in Table I for 
the Woods-Saxon® potential 


y'Fe 
v+in) /|1+e(—*)] 
¢ 


have been chosen to reproduce Va+iWa for values of 
r where the real part of the potential 2 —10 Mev. The 
depth of the potential for small values of r is also fixed 
when the parameters in Table I are used although it 
has been shown” that alpha-particle scattering is not 
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z 
20. The quantity e versus Z; e has been calculated using the 
parameters listed in Table IT. 


Fic. 


sensitive to the depth. This procedure was necessary in 
order to use the existing calculational program. 

Table II lists the calculated values of the total 
reaction cross sections using the parameters listed in 
Table I. The quantity 7 is the kinetic energy in the 
center-of-mass system, and e is the maximum height of 
the potential barrier for /=0 alpha particles. Figure 19 
is a plot of or versus T/e. It is relatively easy to inter- 
polate between the calculated cross sections for inter- 
mediate values of Z using Fig. 19. The quantity e is 
plotted versus Z in Fig. 20 where e has been calculated 
using the parameters listed in Table II. 


TABLE IT. Total reaction cross sections, or. 





Z 
e(Mev) 3.33 


oR 
(millibarns) 


1176.9 
1231.2 
1219.6 
1038.0 
419.42 f 7.28 
23.74 0.750 4.54 
0.3461 0.502 2.13 
50 
13.76 


i if 
(Mev) (Mev) 
33.3 
25.0 
16.7 
8.33 
4.77 
2.50 
1.67 








27.3 
18.2 
9.09 


Z 
e(Mev) 


T oR r 
(Mev) (millibarns) T/e (Mev) 


30 
8.97 


T oR 
(Mev) (millibarns) 
37.7 1655.0 4.20 
28.2 1528.8 3.14 
18.8 1230.8 2.095 

9.42 200.14 1.05 
7.52 16.56 


0.838 
4.71 0.00419 0.523 


20 
6.34 
oR 
(millibarns) T/e 
1465.4 5.73 
1426.0 4.30 
1286.5 2.86 
659.3 1.43 
322.6 1.147 
3.852 0.715 
0.000762 


rp: é 


90 
22.06 


70 
18.15 

T oR 
(Mev) (millibarns) 


oR 
(millibarns) T/e T/e 





58.7 
39.1 
29.3 
19.55 
14.68 
11.70 


38.7 
29.0 
19.35 
14.50 
9.68 
rie eo 
4.84 


1809.7 
1521.9 
871.5 
230.9 
0.327 
0.001542 
0 


2.81 

2.11 

1.408 
1.053 
0.704 
0.562 
0.352 





59.0 
39.3 
29.5 
19.7 
17.7 


14.75 


2.67 
1.78 
1.335 
0.891 
0.804 
0.667 


2316.6 
1692.8 
920.4 
17.14 
1.3055 
0.00893 


2184.0 3.23 

1783.1 215 

1309.4 1.61 
307.64 1.075 
2.333 0.807 
0.00526 0.643 
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VI. SUMMARY 

The conclusion reached earlier?® and discussed in 
detail in this paper, that alpha-particle elastic scattering 
is sensitive to the potential surface, is strengthened by 
this analysis of excitation function data. The reaction 
cross sections versus bombarding energy obtained from 
VatiW. have been compared with alpha-induced 
excitation functions.’?"*~'’ When proper account is 
taken of missing reactions, i.e., alpha-induced reactions 
on a particular nuclide which have not been measured, 
the excitation functions measure the total reaction 
cross section as a function of bombardment energy. 
The agreement with the excitation function data 
strengthens the conclusion reached from the elastic- 
scattering analysis. 

Qualitative arguments can be made to account for 
the conclusion stated above. Ford and Porter’ make a 
“brick wall” argument to suggest a short mean free 
path for complex nuclear structures such as the alpha 
particle in nuclear material. Rasmussen® has empha- 
sized the importance of the exclusion principle in 
alpha-particle interactions, namely that at these bom- 
barding energies the alpha particle as an entity is 
probably excluded from the center of the nucleus. A 
possible way to treat the optical model taking into 
account the exclusion principle has been suggested 
recently by Frantz ef al." 
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Fic. 21. Curves a and 6 are the form factor for the charge 
distribution of uranium (curve 6 is a scaled-up section of curve a). 
Curves c and d are the Coulomb potential for Z=92 and the real 
part of the alpha particle-nuclear potential, respectively. Curve e 
is the real part of the potential for an /=0 alpha particle. The 
shaded area is discussed in the text. 


“3 J. O. Rasmussen, Revs. Modern Phys. 30, 424 (1958). 
“ Frantz, Mills, Newton, and Sessler, Phys. Rev. Letters 1, 
340 (1958). 
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Some rough conclusions can be drawn concerning the 
alpha decay of heavy nuclei. In Fig. 21 the real part of 
the total potential obtained from this analysis (curve e) 
is compared with the charge distribution (curves a 
and 6) for uranium (curve 8 is a scaled-up section of 
curve a). The neutron distribution is assumed to be 
equal to the charge distribution. The present analysis 
indicates that the elastically scattered alpha particles 
do not penetrate to depths appreciably greater*® than 
r=9.9 fermis in the uranium nucleus, since the scatter- 
ing is not sensitive to the real part of the potential at 
smaller values of r. In alpha decay, alpha-particle 
clusters formed at smaller values of r would dissolve 
before they could penetrate the Coulomb barrier. The 
probability of finding two neutrons and two protons 
beyond r=9.9 fermis (the hatched area in Fig. 21) is 
of the order of 0.02%. This must be considered a 
qualitative result for several reasons. First, no account 
has been taken of the effect of surface deformations. 
Rasmussen and Segall*’ point out that the alpha-particle 
wave functions must be nonuniform on the surface of 
a spheroidally deformed nucleus in order to account 
for measured alpha-particle decay intensities. In addi- 
tion it should be noted that the electron scattering 
results are not very sensitive to the exact shape of the 
tail of the charge distribution.” 
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Calculation of barrier penetrabilities, reduced widths, and hindrance factors for odd-mass alpha-particle 
emitters is made by using the diffuse exponential nuclear potential derived from optical-model analysis of 
alpha elastic-scattering data. The calculations are made on the same basis as for even-even alpha emitters, 


reported in a previous publication. 


N a previous publication,' hereafter referred to as I, 

the calculation of alpha-decay barrier-penetration 
factors by using an exponential nuclear potential was 
discussed in detail and numerical results for all measured 
alpha transitions of even-even nuclei were given. The 
nuclear potential used was the real part of a potential 
deduced by Igo to fit alpha elastic-scattering data 2 


1.17A'—r) 
V(r) = —1100 exp cs 
0.574 | 


Mev. 


Details of the IBM-650 computer calculation of the 
barrier-penetration factor P based on experimental 
alpha-decay energies are given in I. After calculation 
of P, the experimental alpha-decay rate information is 
used to calculate a ‘‘reduced alpha-emission width” 
6° from the expression 


A=6P/h, 


where A is the decay constant and / is Planck’s constant. 
The computer program permits inclusion of the cen- 
trifugal potential for any desired value of angular 


momentum. In contrast to the even-even nuclei of 
spin zero, where only one value of / is allowed for a 
given decay group, one usually expects a mixture of 
two or more / values in the alpha groups of odd nuclei 
with 7>}. In only a few cases is this admixture experi- 
mentally known, so that we have chosen, in the spirit 
of the “hindrance factor” concept, to calculate all 
barrier penetrabilities here with /=0. 

The experimental data used and the calculated P 
and 6? values are given in Table I. The data are taken 
mainly from the 1958 Table of Isotopes.* Table I also 
lists in the last column the hindrance factor, Ff, and 
the basis of calculation of # needs further discussion. 

The hindrance factor is the factor by which a given 
alpha group decays more slowly than would be calcu- 
lated by simple barrier-penetration theory, normalized 
to the ground-state transition rates of neighboring 


* Work performed under the auspices of the U. S. Atomic 


Energy Commission. 

' J. O. Rasmussen, Phys. Rev. 113, 1593 (1959). 

2G. Igo, Phys. Rev. Letters 1, 72 (1958). 

* Strominger, Hollander, and Seaborg, Revs. Modern Phys. 30, 
585 (1958). 


even-even nuclei. The hindrance factor is to alpha 
decay what the fof value is to beta decay. There is an 
arbitrariness in the process of determining the normal 
unhindered rate from even-even neighbors. The pre- 
vious extensive tabulations of hindrance factors (or 
“departure factors” as they were earlier called) have 
usually normalized to some average behavior of even- 
even nuclei over a region.’> Asaro has made calcula- 
tions of F for odd-A nuclei based solely on the average 
rate behavior of the two nearest even-even neighbors. 
It is this latter basis that was chosen for the F calcu- 
lations of Table I, i.e., 


F= (62-+6s?)/25.ua’, 


where 6,’ and 6,” are the reduced widths for ground- 
state transitions of the nearest neighboring even-even 
nuclei (given in Table I of reference 1). In those cases 
for which the data are unknown for one or both of the 
neasrest even-even neighbors, the basis for the un- 
hindered rate was taken from the one (or more) next- 
nearest neighbor(s). 

The three-significant-figure accuracy to which calcu- 
lated results are given in Table I is seldom justified on 
an absolute basis because of experimental uncertainties 
in energies or half-lives. Such accuracy is often necessary 
to afford significant comparison of relative hindrance 
factors of different alpha groups of the same alpha 
emitter, and such relative comparisons to 1% accuracy 
are often justified by accuracy of the data, since only 
the relative intensities and energy differences of com- 
pared states are of major importance to the ratio of 
hindrance factors. 

The hindrance factors given in this paper will usually 
differ only slightly from those calculated by earlier 
methods, and the difference will seldom be significant 
from an absolute standpoint. The three-figure accuracy 
of the computer calculation should, however, make 


4 Perlman, Ghiorso, and Seaborg, Phys. Rev. 77, 26 (1950); 
I. Perlman and F. Asaro, Ann. Rev. Nuclear Sci. 4, 157 (1954); 
Bohr, Fréman, and Mottelson, Kgl. Danske Videnskab. Selskab, 
Mat.-fys. Medd. 29, No. 10 (1955); C. J. Gallagher, Jr., and J. O. 
Rasmussen, J. Inorg. and Nuclear Chem. 3, 333 (1957); P. O. 
Fréman, Kgl. Danske Videnskab. Selskab, Mat.-fys. Skrifter 1, 
No. 3 (1957). 

57. Perlman and J. O. Rasmussen, //andbuch der Physik, edited 
by S. Fliigge (Springer-Verlag, Berlin, 1957), Vol. 42. 
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TABLE I. Alpha groups of odd-mass nuclei. 


Alpha-decay data* assumed for computation Calculated results® 


Partial a a@ group Barrier Reduced Hindrance 
Atomic Mass Neutron Fa +Exe half-life intensity penetration width factor 
No. No. No. (Mev) (sec) ° factor P & (Mev) F 





62 147 85 2.201 4.12(18) 1.378(—39) 5.047 (—1) 3.01(—2) 
63 147 84 2.921 2.075(11) 1.394(—31) 9.910(—2) 5.07(—1) 
64 149 85 3.021 1.15(11) 2.612(—31) 9.541(—2) 8.93(—1) 
65 149 84 3.972 1.48(5) 1.692(—24) 1.145(—2) 7.44(0) 
151 86 3.422 2.4(10) 1.947 (—28) 6.133 (—4) 1.39(2) 
83 199 116 5.501 1.578(7) 6.001 (—25) 3.027 (—4) 8.26(1) 
201 118 5.181 1.260(8) 1.288 (— 26) 1.767(—3) 1.41(1) 
203 120 4.881 6.310(11) 100 2.465 (— 28) 1.843(—5) 1.63 (3) 
211 128 6.651 1.300(2) 82.6 9.879 (—20) 1.843(—4) 3.872(2) 
6.304 17.4 4.296(—21) 8.930(—4) 7.993 (1) 
213 130 5.891 1.382(5) 100 7.601 (—23) 2.728 (—4) 
205 121 5.232 7.260(6) 100 8.493 (—27) 4.648 (—2) 
207 123 5.132 1.893 (8) 100 2.496(—27) 6.066(—3) 
209 125 4.909 3.156(9) 99.4 1.279(—28) 7.057 (—3) 
4.652 0.6 2.896(—30) 1.881(—3) 
127 7.472 5.200(—1) 99.0 2.540(—17) 2.149(—4) 
6.912 0.53 3.302 (—19) 8.848 (—5) 
6.5 0.50 2.125(—20) 1.297(—3) 
127 8.732 2.500(1) 7.0 8.220(—14) 9.763 (—11) 
7.882 2.5 4.437 (—16) 6.460(—9) 
7.171 90.5 2.654(— 18) 3.910(—5) 
8.382 4.200(—6) 100 1.143 (—14) 5.970(—2) 
7.412 1.830(—3) 100 1.892(—17) 8.276(—2) 1.327 (0) 
6.572 1.000(1) 100 2.215(—20) 1.294(—2) 8.840(0) 
5.782 7.200(4) 100 2.006(— 24) 1.984(—2) 6.57 (0) 
5.674 4.320(5) 100 6.411(—25) 1.035(—2) 1.372(0) 
5.894 6.590 (4) 100 7.926(—24) 5.487 (—3) 1.70(0) 
8.032 1.00(—4) 100 5.773 (—16) 4.965(—2) 2.332 (0) 
7.082 1.800(—2) 100 6.231(—19) 2.556(—1) 8.437(—1) 
6.302 5.400(1) 100 6.863 (— 22) 7.734(—2) 1.965 (0) 
6.173 1.620(4) 100 4.180(—23) 4.232(—3) 2.26(0) 
6.070 1.080(4) 1.565 (— 23) 1.696(—2) 8.119(—1) 
5.880 2.230(5) 335 2.220(—24) 1.940(—3) 7.702 (0) 
5.812 a 1.047 (—24) 7.915(—3) 1.887 (0) 
5.646 j 1.577(—25) 1.630(—3) 9.168 (0) 
7.773 1.000(—3) 4.392(—17) 6.526(—2) 4.937 (0) 
6.840 3.92(0) 3.337 (—20) 1.512(—2) 1.675(1) 
6.575 5 3.162(—21) 3.469 (—2) 7.300(0) 
6.450 2 9.866 (—22) 9.893(—2 2.847 (0) 
6.230 1.162(—22) 2.517(—1) 1.006(0) 
6.033 7.56(3) 1.660 (—23) 2.284(—2) 7.553 (0) 
7.333 2.00(—2) 7.244(—19) 1.978(—1) 1.629(0) 
6.363 2.88 (2) 1.612(—22) 5.186(—2 3.108 (0) 
6.153 6 1.953 (—23) 8.153(—2) 1.977 (0) 
5.373 2.64(7) 2.727 (—27) 3.982(—2) 3.852 (0) 
6.744 3.00(1) 2.065 (—21) 4.627(—2) 2.984(0) 
5.894 1.17 (6) 4.322(—25) 5.668 (—5) 7.505 (3) 
5.767 1.043 (—25) 2.114(—3) 6.717 (1) 
5.738 5: 7.276(—26) 1.784(—2) 7.958 (0) 
5.626 1.931(—26) 3.045 (—2) 4.664(0) 
5.559 8.559(—27) 2.576(—2) 5.513 (0) 
5.521 5.359 (—27) 9.142(—3) 1.513(1) 
5.452 : 2.261(—27) 3.251(—2) 4.368 (0) 
6.675 1.32(2) 4.244(—22) 5.116(—2) 2.699 (0) 
5.853 8.64(5) : 9.504(—26) 1.885(—2) 7.745 (0) 
5.817 6.282 (—26) 1.478(—2) 9.878 (0) 
5.756 a) 3.086 (—26) 1.021(—2) 1.430(1) 
5.748 i 2.808 (— 26) 3.071(—3) 4.754(1) 
5.707 p 1.728(—26) 1.535(—3) 9.54(1) 
5.662 3. 1.008 (— 26) 1.251(—2) 1.1671) 
5.634 i 7.182(—27) 2.771(—3) 5.269(1) 
5.605 : 5.042 (—27) 4.606(—3) 3.170(1) 
5.578 07 3.618(—27) 6.418(—4) 2.275(2) 
227 138 4.977 5.68 (10) 1.171(—30) 4.310(—2) 3.202 (0) 


® It is to be most strongly emphasized that this Table is not to be used as a reference for energy and half-life data. In many cases only the crudest partial 
alpha-life or energy values are to be found in the Table of Isotopes. We give in Table I numbers used in the computer, simply in order to facilitate correct 
ing the table for revised experimental measurements. 

» Alpha-particle energy plus orbital electron screening correction. 

© The number in parentheses in each case is the power of ten by which the preceding number is to be multiplied, 
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TABLE I.—Continued. 


Alpha-decay data* assumed for computation Calculated results® 


: : Partial a a@ group Barrier Reduced Hindrance 
Atomic Mass Neutron Eq +Ex half-life intensity penetration width factor 
No. No. No. (Mev) (sec) ° (%) factor P & (Mev) F 





90 225 135 6.605 5.40(2) 100 8.471(—23) 6.266(—2) 2.31(0) 
227 137 6.071 1.62(6) 23 3.848 (—25) 1.057(—3) 1.277 (2) 
6.042 2.8 2.780(—25) 1.770(—4) 7.627 (2) 
6.011 24 1.985(—25) 2.139(—3) 6.311(1) 
5.993 3.5 1.625(—25) 3.811(—4) 3.542(2) 
5.949 0.9 9.912(—26) 1.606(—4) 8.406 (2) 
5.900 3.0 5.679 (—26) 9.347 (—4) 1.444(2) 
2.842 (—26) 6.225(—4) 2.169(2) 
2.471(—26) 2.148 (—4) 6.288 (2) 
1.699 (— 26) 3.125(—4) 4.321(2) 
1.583 (— 26) 2.348 (—2) 5.750(0) 
9.507 (—27) 9.304(—3) 1.451(1) 
9.063 (— 27) 1.698 (—2) 7.95(0) 
8.139(—27) 8.694(—3) 1.553 (1) 
7.484(—27) 3.546(—3) 3.807 (1) 
5.541 (—27) 6.067 (—3) 2.225(1) 
2.316(11) 1.128(—30) 1.097 (—3) 1.144(2) 
3.467 (—31) 7.138(—3) 1.760(1) 
8.881 (—32) 9.754(—2) 1.288 (0) 
1.104(—23) 9.617(—2 1.30(0) 
2.507 (— 27) 2.264(—2) 5.47 (0) 
5.095(—31) 5.209 (—4) 2.29(2) 
3.329(—31) 1.834(—3) 6.51(1) 
2.628(—31) 2.424(—3) 4.93(1) 
1.681(—31) 3.631(—4) 3.29(2) 
1.024(—31) 5.703(—3) 2.09(1) 
7.914(—32) 9.389 (—4) 1.27(2) 
2.241 (—32) 1.658(—3) 7.20(1) 
3.496(—33) 8.350(—2) 1.43 (0) 
2.291 (—33) 1.621(—2) 7.37 (0) 
, 1.401(—33) 3.978(—2) 3.002 (0) 
7.8(1) 1.042(—22) 3.527(—1) 2.70(—1) 
1.8(4) 2.784(— 24) 5.720(—2) 1.91(0) 
6.63 (9) 4.071(—29) 1.062 (—2) 1.11(1) 
5.11(12) ou 4.645 (—33) 1.008(—1) 1.11(0) 
2.335(—33) 3.579(—2) 3.14(0) 
9.248 (—34) 9.704(—3) 1.16(1) 
1.937 (—35) 8.687 (—3) 1.29(1) 
7.062 (—35) 1.214(—4) 8.871(2) 
3.584(—35) 9.638 (—5) 1.116(3) 
1.381(—35) 8.338(—5) 1.292 (3) 
2.469 (— 36) 1.503 (—2) 7.162(0) 
1.942 (—36) 2.833 (—2) 3.800(0) 
1.365(—36) 1.125(—2) 9.573(0) 
. 5.991(—38) 1.239(—1) 8.695(—1) 
1.8(5) 2.411(—25) 6.603 (—2) 1.86(0) 
2.209 (8) 3.833 (—29) 3.385 (—1) 2.76(—1) 
1.009 (12) 5.992 (—32) 4.740(—2) 2.01(0) 
j 6.94(13) 3: 3.612 (—33) 3.545 (—4) 2.555(2) 
4.853 ais 1.481(—33) 9.761(—4) 9.278(1) 
4.824 53 9.274(—34) 2.360(—2) 3.837 (0) 
4.803 6.698 (—34) 1.788 (—2) 5.064(0) 
4.750 ; 2.754(—34) 2.549(—3) 3.552(1) 
4.710 oa 1.435(—34) 9.497 (—3) 9.535(0) 
4.681 8.643 (—35) 2.867 (—2) 3.159(0) 
4.626 5 3.367 (—35) 6.132(—3) 1.477(1) 
4.557 1.007 (—35) 8.204(—4) 1.104(2) 
6.337 1.20(6) 1.101(—25) 2.169(—2) 5.16(0) 
5.887 7.89(7) 7.130(— 28) 5.095(—2) 1.97(0) 
5.687 1.19(11) 6.446(—29) 7.847(—5 1.059(3) 
5.397 1.388 (—30) 37 2 6.061 (0) 
5.184 7.67 (11) 7.223 (—32) 3.75 2.476(0) 
5.171 5.967 (—32) 05 8.827(0) 
5.133 3.398 (—32) 7.890(0) 
5.101 2.105 (—32) 6.569 (—5) 1.414(3) 
5.036 7.838 (—33) 6.198(—5) 1.498 (3) 
4.954 2.192(—33) 8.525(—5) 1.089(3) 
4.930 9.47 (12) 1.621(—33) 1.400(—1) 7.14(—1) 
4.885 ‘ 7.901 (—34) 9.576(—2) 1.04(0) 
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6.030 
5.981 
5.938 
5.882 
5.846 
5.818 
5.789 
5.727 
6.800 
6.520 
6.673 
6.632 
6.592 
6.580 
6.537 
6.519 
6.469 
6.289 
6.249 
6.216 
6.931 
6.891 
7.071 
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TABLE I. 


\lpha-decay data* assumed for computation 


Partial a 
half-life 
(sec)° 


9.47(7) 
1.041 (9) 
1.455(10 


2.493 (11) 


3.156(8) 
1.105(9) 


4.418(11 


1.578 (7 


3.58(8 


3.156(11 


.23(12) 


7.92(3) 
.14(10) 


.52(6) 
3.54(6) 
.74(4) 
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Continued. 


a@ group 
intensity 


(%) 


100 
100 


0.42 
0.24 
84.3 
13.6 
1.4 
0.015 
0.004 
0.17 
0.16 


Barrier 
penetration 
factor P 
1.670(—27) 
7.766(—29) 
5.240(— 30) 
3.421(—30) 
2.380(—30) 
1.328(—30) 
6.317 (—31) 
2.543 (—31) 
1.361(—31) 
3.891(—31) 
2.514(—31) 
1.383(—31) 
7.547 (—32) 
3.377 (—32) 
3.118(—28) 
1.241(—27) 
1.144(—27) 
6.430(— 28) 
5.194(—28) 
1.826(—28) 
1.300(—28) 
4.149(—29) 
2.381(—29) 
1.163 (—29) 
1.104(—29) 
6.401(—30) 
5.893 (—31) 
1.684(—31) 
8.281(—32) 
5.337(—25) 
9.362 (—26) 
2.067 (—27) 
1.476(—26) 
1.576(—27) 
5.653 (—29) 
3.729(—30) 
4.398(—31) 
2.289(—32) 
1.362(—31) 
4.183 (—34) 
9.046(—24) 
8.100(—28) 
4.279(—28) 
1.943 (— 28) 
7.260(—29) 
3.991 (—29) 
2.345 (—29) 
1.163(—29) 
7.369(—30) 
5.152(—30) 
3.546(—30) 
.580(—30) 
.275(—25) 
7.243 (—27) 
3.982 (— 26) 
2.587 (— 26) 
1.689 (— 26) 
1.486(— 26) 
9.355(—27) 
7.697 (—27) 
4.458 (—27) 
5.895 (—28) 
3.715(—28) 
2.529(— 28) 
1.834(—25) 
1.322(—25) 
8.391(—25) 


Calculated results* 


Reduced 
width 


1.387 (—4) 
6.978(—2) 
2.017 (—2) 
4.365 (—3) 
1.162(—4) 
5.791(—5) 
5.023(—5) 
7.316(—5) 
7.226(—2) 
1.752(—2) 
4.426(—3) 
2.913(—2) 
2.090(—5) 
1.134(—4) 
3.630(—5) 
2.996(—4) 
1.420(—5) 
9.995(—5) 
4.564(—2) 
1.253(—2) 
3.569(—3) 
4.700(—4) 
6.078 (—4) 
1.651(—3) 
2.966(—2) 
6.267 (—3) 
1.021(—4) 
1.028(—3) 
1.498 (—2) 
1.790(—4) 
2.083 (—3) 
3.682 (—2) 
9.010(—4) 
1.198(—2) 
1.984(—2) 
1.608 (—2) 
3.342(—1) 
4.001 (—2) 
5.898 (—6) 
6.463 (— 6) 
5.175(—6) 
2.770(—6) 
2.079(—4) 
1.286(—4) 
6.485 (—4) 
2.866(—2) 
2.440(—4) 
3.119(—3) 
6.366(—4) 
4.131(—3) 
1.570(—2) 
3.753(—2) 
4.931(—3) 
7.845 (—4) 
1.004(—3) 
4.427(—4) 
2.152(—4) 
3.716(—4) 
1.124(—3) 
2.230(—3) 
1.310(—3) 
6.200(—3) 
6.123(—3) 
4.413(—2) 


Hindrance 
factor 
F 


4.84 (0) 
2.345 (0) 
5.598 (2) 
6.395 (2) 
1.267 (0) 
4.381(0) 
2.025(1) 
7.608 (2) 
1.526(3) 
1.57 (3) 
1.08 (3) 
1.09(0) 
4.51(0) 
1.78(1) 
2.701(0) 
3.220(3) 
5.935 (2) 
1.854(3) 
2.246(2) 
4.739(3) 
6.733 (2) 
1.475(0) 
5.371(0) 
1.886(1) 
1.432(2) 
1.107 (2) 
3.923(1) 
2.189(0) 
1.036(1) 
6.824(2) 
6.775(1) 
4.652(0) 
3.425(2) 
2.943 (1) 
1.665 (0) 
4.96(1) 
3.73(0) 
2.25(0) 
3.694(0) 
1.777(—1) 
1.44(0) 
8.833 (3) 
8.061 (3) 
1.007 (4) 
1.881 (4) 
2.506(2) 
4.051(2) 
8.033(1) 
1.818(0) 
2.135(2) 
1.670(1) 
8.184(1) 
1.08 (1) 
3.78 (0) 
1.98(0) 
1.50(1) 
9.440(1) 
7.376(1) 
1.672(2) 
3.441 (2) 
1.993 (2) 
6.589 (1) 
3.321(1) 
5.653 (1) 
8.31(0) 
8.251(0) 
1.145(0) 
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possible more careful comparisons of relative hindrance 
factors in future theoretical studies. 

For some purposes it may be desirable to know the 
effect of a nonzero angular momentum on the calcu- 
lated values of Table I, and we therefore offer the 
following approximate formula: 


P1/Po=exp[—2.027L(L+1)Z712A-°], 


This formula uses the calculated! centrifugal barrier 
factor P,/P, of 5.917 for /=4 on a gsRa™* group of 
energy 5.481 Mev to normalize the approximate rela- 
tionship given in Eq. (22.10) of reference 1. The Z and 
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A in the formula refer to atomic number and mass 
number of the daughter nucleus, respectively. 
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Band structure predictions assuming the formation of proton single-particle states above the Ca“ core in 
the ground state or one of its excited states are compared with the available data on the elastic and inelastic 
scattering of protons from Ca*. The band expected above the 3.35-Mev state in Ca” is confirmed by experi 
mental results, and some evidence is found for bands above the higher core states. 


INCE the introduction of the unified model of the 
nucleus by Bohr and Mottelson,! numerous ex- 
amples of the rotational and vibrational bands predicted 
by this model have been reported.’ Band structure may 
also appear for rather different reasons in excitation 
curves for the inelastic scattering of charged particles. 
Because of its charge, the projectile must overcome the 
Coulomb barrier when entering and leaving the target 
nucleus, and at sufficiently low energies, the small 
penetrabilities strongly inhibit particle emission because 
of energy lost to the excited residual nucleus. Maxima in 
the inelastic yield populating a particular state in the 
residual nucleus are expected when the compound 
system can form a state composed of the core in the 
excited state to be populated plus the projectile in a 
single-particle state. The excitation energy in the com- 
pound system will be the sum of the core excitation 
plus that for the single-particle state aside from a 
coupling between the two. When a sufficiently high 
density of states is available to the entrance channel 
to form a compound state with the required excitation 
energy and angular momentum, the excitation curve 
should exhibit a band structure corresponding to the 
formation of single-particle states on various excited 
states of the core. The detailed structure of states in 
* Supported in part by the Air Force Office of Scientific 
Research, ARDC, Contract AF 49(638)-427. 
1A, Bohr and B. Mottelson, Kgl. Danske Videnskab. Selskab, 
Mat-fys. Medd. 27, No. 16 (1953). 


2 Alder, Bohr, Huus, Mottelson, and Winther, Revs. Modern 
Phys. 28, 535 (1956). 


the entrance channel will impose a fine structure on 
members of the bands. For ease in identifying the band 
structure, the core states and single-particle states 
should be well separated. Favorable conditions for the 
observation of band structure obtain in experiments on 
elastic®4 and inelastic®.* scattering of protons from Ca”. 

Bands predicted by the recurrence of Sc* states 
above the Ca® core in one of its excited states are given 
in Table I. The first two columns give the excitation 
energies and assignments of states in Sc" observed by 
elastic scattering of protons. In subsequent columns, 
the excitation energies expected for a recurrence of 
these states above an excited Ca” core are presented, 
The quantities in parentheses are the corresponding 
bombarding energies assuming a proton separation 
energy’ of 1.63 Mev and the usual center-of-mass 
correction. Uniquely predicted assignments are also 
included in the parentheses. 

The curve in Fig. 1 is copied from the paper by Bent 


5 Davis, Prosser, Spencer, Young, and Johnson, Bull. Am. Phys. 
Soc. Ser. II, 2, 304 (1957). 

* Class, Davis, and Johnson, Phys. Rev. Letters 3, 41 (1959) ; 
Johnson, Kashy, Perry, and Class, Phys. Rev. (to be published). 

5R. D. Bent and T. H. Kruse, Phys. Rev. 109, 1240 (1958). 

6 J. H. Johnson and C. M. Class, Bull. Am. Phys. Soc. Ser. IT, 
4, 79 (1959). (Assignments for the 3.73- and 3.90-Mev states are 
3° and 1°, 2t, respectively. These were incorrectly reported in the 
publication. Private communication from C. M. Class.) 

7 Nuclear Level Schemes, A=40-A=92, compiled by Way, 
King, McGinnins, and van Lieshout, Atomic Energy Commission 
Report TID-5300 (U.S. Government Printing Office, Washington, 
D: C.,. 2955). 
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TaBLE I. Predicted excitation energies for inelastic proton 
strength in Sc“. States formed by the proton on the Ca® core in 
its ground state are given in columns one and two. Values in 
columns three through six represent a recurrence above the 
higher core states. Corresponding bombarding energies and 
uniquely predicted assignments are given in parentheses. 


\. States in Ca® 
Spin 
\. Energy 
(Mev) 


States 
in Sc# 
Energy 
(Mev) 


Spint.e\. 
Region of Sc“ excitation covered 
y experiments 
6.69 6.86! 
(5.19) (5.36) 


7.44 
(5.96) 


6.31 
(4.80, 47) 
6.99! 7.74 
(5.49) 


6.61! 
(5.10, 4*) 
7.69 8.44 
(6.21) 


7.31 


| 7.74 


8.00 
Region above experiments 


® See reference 7. 

+R. H. Helm, Phys. Rev. 104, 1466 (1956). 

© See reference 6. 

1 See reference 4. 

¢ See reference 3. 

' Corresponds to maxima in inelastic scattering cross section. 


and Kruse,° and it gives as a function of proton energy 
the yield of electron pairs produced by inelastic scatter- 
ing presumably to the first excited state of Ca®. Experi- 
mentally determined resonance bombarding energies 
are denoted by “E, Exp.” and the predicted values 
from Table I by “E, Th.” Where angular momentum 
assignments are known,® they are shown directly above 
the peaks. 

Neighborhoods of inelastic strength are observed at 
bombarding energies of 4.80, 5.10, 5.82, and 5.87 Mev 
which correspond to a band above the first excited state 
in Ca®. The fine structure may be due to modulation 
by the entrance channel. Predicted spin assignments 
agree with the few observed values available. 

Other band structure built on higher excited states 
in Ca is predicted in Table I and indicated in Fig. 1. 
Where either the core state or the single-particle state 
has zero or one-half unit of angular momentum, re- 
spectively, inelastic strength is observed at approxi- 
mately the predicted energy. Predictions of recurrences 
involving higher values for both angular momenta do 
not fit the observed results as well, possibly due to a 
coupling of the angular momenta. For example, the 
predicted value of 5.36 Mev falls somewhat below the 
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observed value at about 5.40 Mev instead of slightly 
above as is the case when one of the angular momenta 
is minimized. No prominent strength is observed at 
5.19 and 5.96 Mev, possibly due to a coupling of angular 
momenta or a suppression by penetrabilities. The 
asterisks in Fig. 1 denote the various quantities associ- 
ated with /=3 strength observed® at 5.43 Mev and 
predicted at 5.49 Mev. Interpretation of the recurrence 
of single-particle states above the higher core states is 
complicated by the fact that more than one excited 
state in the residual nucleus may be populated. Where 
more than one mode of decay is possible as appears® 
to be the case in the vicinity of 5.40-Mev bombarding 
energy, enhancement of yield of all decay modes may 
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Fic. 1. Correlation between observed®.* and predicted bombard- 
ing energies for neighborhoods of prominent inelastic strength. 
The values marked with an asterisk correspond to a superposition 
of the 3.26-Mev state in Sc on a Ca® core in its second excited 
state. Experimentally determined® angular momenta are shown 
above the peaks. Displacement arrows adjacent to the predicted 
values indicate the fit when the energy of the predicted values is 
lowered by 0.03 Mev, the average value of the discrepancy 
between predicted and observed values. No prominent inelastic 
strength appears to be correlated with the predicted values 5.19 
and 5.96 Mev. 


result from the formation of a state consisting of an 
excited core plus a single-particle state which can feed 
the several decay modes. 

The predicted band structure above the first excited 
state in Ca® is observed in Sc# and there is some 
evidence for bands above higher core states. While the 
zero-spin character of the first excited state in Ca* 
should minimize difficulties in a shell-model description, 
the band structure could be simply understood if this 
state is collectively excited. According to this interpre- 
tation, the next strong pair-producing resonance should 
occur at a bombarding energy of 6.15 Mev which 
would account for the absence of strength from 5.58 
to 6.00 Mev. 
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Energy Levels in a Bounded Isotropic Harmonic Oscillator Potential 
and Nuclear Shell Structure 


S. SENGUPTA AND S. GuosH 
Physics Department, Hooghly Mohsin College, Hooghly, West Bengal, India 
(Received March 14, 1958) 


Energy levels for a three-dimensional bounded harmonic oscillator are obtained for intermediate distances 
of the boundary. Energy levels are found to depend on the dimensionless parameter po= (wM /h)R?. po is 
the ratio of the strength of the oscillator levels (Aw) to that of the square well levels (4?/M R?). 

Adding a spin-orbit energy 30 times the Thomas value, nuclear energy levels are worked out for the two 
mass regions at A = 50 and 90. For po=6, good agreement with the experimental level sequence is obtained. 


T is well known that neither a harmonic oscillator 
nor a square well potential combined with suitable 
spin-orbit interaction energy gives a good shell structure 
and the experimentally observed ground-state level 
sequence. The empirical level scheme simultaneously 
proposed by Mayer! and Haxel, Jensen, and Suess? is 
obtained by introducing spin-orbit energy to a level 
sequence which is somewhat intermediate between that 
of a harmonic oscillator and a square well. Recently 
some attempts have been made’** to obtain such a level 
scheme with different types of potentials. The present 
authors have found that the energy levels in a bounded 
harmonic oscillator correspond very closely to such a 
scheme. 
We investigate a potential of the form, 


r<R 
r>R. 


V=—Vot dur’, 
V=0, 


(1) 


Such a potential as a possible alternative to represent 
the collective nuclear potential was proposed by Mayer 
and Jensen,° but the energy levels have not been worked 
out. Auluck and Kothari® previously considered the 
case of a bounded linear harmonic oscillator. Their 
energy expression, however, is valid only for the 
limiting cases of boundary at very long and close 
distances. 

Equation (1) represents a harmonic oscillator bounded 
by a potential wall of finite height. For simplicity in 
analytical treatment we shall assume the boundary to 
be of infinite height, i.e., the walls of the spherical 
enclosure are considered to be rigid. As the constant 
term —Vo does not affect the level order, we shall 
neglect it in our treatment. The wave function for this 
case can be written in the form, Y= RnrY im(0,6), where 
VY, are the normalized spherical harmonics and R,, is 

1M. G. Mayer, Phys. Rev. 75, 1969 (1949). 

? Haxel, Jensen, and Suess, Phys. Rev. 75, 1766 (1949). 

3A. E. S. Green and K. Lee, Phys. Rev. 99, 772 (1955); 99, 
1410 (1949). 

4 Ross, Mark, and Lawson, Phys. Rev. 102, 1613 (1956). 

5M. G. Mayer and J. H. D. Jensen, Elementary Theory of 
Nuclear Shell Structure (John Wiley & Sons, Inc., New York, 
1955). 

6F. C. Auluck and D. S. Kothari, Proc. Cambridge Phil. Soc. 
41, 175 (1945). 


given by 
Rny=e- Pp"? 1Fy(—Xni, +3; p), 

where 
wM R? Ent 
pee ee 


h 2hw 4 


al+3 


w is the classical circular frequency of the oscillator 
and E,; is energy. 17; is the well-known confluent 
hypergeometric function. Rigid wall implies that 
Rii(po)=0, where po=wM R*/h, R being the nuclear 
radius. The zeros of confluent hypergeometric functions 
have been investigated by several authors.’ Tricomi 
has given an approximate expression for the zeros 
(result quoted in Bateman*) which after suitable 
modification gives the following zeros with respect to A 
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Fic. 1. Comparison of energy levels for infinite well and bounded 
harmonic oscillator. Energy is expressed in units of #?/2M R®. 
1s, 1p, and 1d are from numerical calculation of the zeros of :/1. 
The rest are obtained from Eq. (4) with po=6. Distinct improve- 
ment in the direction of shell formation is clearly noticed. 











7F. Tricomi, Ann. Mat. Pura Appl. 26, 141 (1947); G. E. 
Tsvetkoff, Compt. rend. acad. sci. U.S.S.R. 32, 10 (1941). 

8H. Bateman, /ligher Transcendental Functions (McGraw-Hill 
Book Company, Inc., New York, 1953), Vol. 1. 
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Fic. 2. Comparison of nuclear energy levels for infinite well 
and bounded harmonic oscillator in the mass regions 50 and 90. 
An asterisk indicates the level in which a shell is completed. 


for fixed values of po and /: 


Jnt 


+3 (21+3)(2/—-1) po 
—+ —— + 


Anl= 7 2 
Mj 12 


, 


4po 4 


where jn: is the mth zero of the Bessel function of order 
1+-4. This expression can be shown to be valid for not 
very large values of po (po<3n+/+1). With this 
restriction, the energy levels are given by 


Eat 


h? (2/+-3)(21—1) py" 


po+ ‘ (4) 
2MR° 6 jn” 3 
The first term in (4) corresponds to the energy levels 
for an infinite well. In Fig. 1 the level sequence obtained 
from (4) is compared with that for an infinite well for 
py=6. This value of po, though arbitrarily chosen, is of 
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the right order of magnitude. Other authors,’ in 
connection with calculations with the unbounded 
harmonic oscillator wave functions, used po™5 (Kurath,’ 
with a range of nuclear force ~2X10~" cm) and po™4 
(Nagai and Yamaji’). In order to get nuclear energy 
levels, we shall have to consider the spin-orbit inter- 
action energy. Using a term 30 times the Thomas value 


h? 
2M?c? 


the total spin-orbit energy for a bounded harmonic 
oscillator can be calculated by perturbation methods 
and it is given by 


30h? (2143) (2/—1) 
FE, — se (L-8)| jn? aR 


“f, : por+ in| 
M?c?R? 6Jnr 


h? 


X———, (9) 
2M R° 
With the form of the potential as in (1), we need not 

consider Coulomb repulsion for protons as an addi- 

tional perturbation. The Coulomb potential in a uni- 
form charge distribution has r-dependence exactly 
similar to (1), and it is included within the nuclear 
potential. 

The energy levels calculated with (4) and (5) for 


po=6 and R=1.3X10—-"A} 


are shown in Fig. 2 for the two mass regions 50 and 90, 
Results are seen to be very encouraging. All the cross- 
overs of levels correspond very closely to the observed 
level sequence. Both the shells at 28 and 50 are much 
improved. The only defect seems to be with 1/j1/2 
which appears to be somewhat depressed. With in- 
creasing A, however, this level moves up and actually 
crosses both the 3s; and 2d; levels.* 


®D. Kurath, Phys. Rev. 91, 1430 (1953). 

! H. Nagai and Y. Yamaji, Bull. Kyfasha Inst. Technol. No. 3, 
15 (1957). 

* Note added in proof.—Dr. T. D. Newton has kindly pointed 
out a recent work by him [Can. J. Phys. 37, 944 (1950) where 
he has considered the energy levels in a cutoff oscillator potential. 
His conclusion that there is ‘remarkably small change of energy 
due to cutting off the oscillator potential,’ is essentially due to the 
fact that his cutoff boundary is placed at a point where the wave 
function for the unbounded oscillator is already very small. With 
his values of the parameters (r*)/R,?=0.13 for n=1=0; R, is the 
cutoff radius. In our notation his calculations are those for 
po= 11.6. Energy levels and wave functions converge to the other 
limit of a square well [our Eq. (4) ] for po > 0. 
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Binding Effect Corrections in the Energy-Loss Distribution Functions for 
Charged Particles Passing through Thin Absorbers* 


WALTER ROSENZWEIG 
Radiological Research Laboratory, Columbia University, New York, New York 
(Received April 24, 1959) 


In the theoretical treatment of the statistical fluctuation in energy-loss of charged particles passing through 
thin absorbers, terms which are of the order of the electron binding energy are ordinarily neglected. The 
corrections required, when this is no longer justified, are discussed in particular reference to the development 
by Symon, which has as its limiting cases the Gaussian distribution derived by Bohr, and Landau’s distri- 
bution. Formulas are presented which give the correction for each of the three weighted parameters ap- 
pearing in Symon’s derivation. The corrected parameters can then be used in conjunction with Symon’s 
curves to obtain the energy-loss distribution appropriate for a given set of conditions. An illustration is 
presented for the cases of 5-Mev protons and 5-Mev alpha particles. 


HE statistical fluctuations in energy-loss of fast 

ionizing particles passing through a given thick- 
ness of absorbing material have been the subject of 
theoretical calculations by many authors for some time. 
In particular, the works of Bohr,'? Williams,* and 
Landau‘ showed that in two limiting cases the function 
P(e,x), where P(e,x)de is the probability that a particle 
of given initial energy will lose an amount of energy 
between ¢ and e+de while traversing a thickness x of 
absorber, can be represented by a normal error curve 
(Gaussian) when x is sufficiently large, and another 
universal function (Landau curve) when x is sufficiently 
small. A connection between the two extreme cases has 
been effected by Symon*®® who demonstrated that all 
intermediate cases can be suitably represented by a 
single parameter set of curves of varying skewness, 
providing a smooth transition between the Gaussian 
and Landau curves. This apparently simple result 
arises through the introduction of a set of weighted 
parameters which completely characterize, to a good 
degree of approximation, the energy-loss distribution 
function, P(e). These are the weighted skewness, 
Aw, Which indicates the choice of curve representing 
P(e,x), the weighted rms fluctuation, o,, which is a 
scale factor for the width of the curve, and the weighted 
parameter j which locates the peak of the curve by 
relating the average energy loss, é, with the most 
probable energy loss ¢,. Each of the three parameters 
is related, respectively, to the third, second, and first 
generalized moments, as introduced by Symon, of the 
collision spectrum X(é’), where X(E’)dE’dx is the 
probability for the ionizing particle to undergo a 
collision, while traversing a distance dx of the absorber, 


* Based on work performed under Contract AT-30-1-GEN-70 
for the U. S. Atomic Energy Commission. 

1N. Bohr, Phil. Mag. 30, 581 (1915). 

2N. Bohr, Kgl. Danske Videnskab. Selskab, Mat-fys. Medd. 
18, No. 8 (1948). 

3E. J. Williams, Proc. Roy. Soc. (London) 125, 420 (1929). 

4L. Landau, J. Phys. U.S.S.R. 8, 201 (1944). 

5K. R. Symon, thesis, Harvard University, 1948 (unpublished). 

6 Symon’s results are summarized in B. Rossi, High-Energy 
Particles (Prentice Hall, Inc., New York, 1952). 


in which it loses an amount of energy lying between E’ 
and E’+dE’. 


The defining equations are given{as 
Aw=a1X3(s)/LX2(s) ]}, 
ow=[xX.(s) }}, 


aig x 
j=|ln——#’—-[X1(0)—X1(s) ] 
g g 

E,,’ 1 
=|n——/*—-(&—e,), (3) 
g g 


where € is an energy varying linearly with the thickness 
of the absorber, = (2u./8?)x, uwe=rest energy of the 
electron, 8=velocity relative to that of light of the 
ionizing particle, x=absorber thickness expressed in 
collision units; i.e., the average number of spheres of 
classical electron radius which a straight line in length 
equal to the absorber thickness pierces, multiplied by 
the square of the charge number of the ionizing particle, 
Em’ =maximum energy which the ionizing particle can 
impart to an atomic electron in a single collision, and 
the X, are the generalized moments of the collision 
spectrum defined as 


X,(s)= f e!'E™X(E)dE', n>1, (4) 
0 


in which s is a parameter appearing in Symon’s develop- 
ment which takes on such values as to make the peak 
of P(e,x) occur at the correct value for the most 
probable energy loss, €>. 

In view of the fact that Y(/’) is not well known for 
values of ’<n, where y is an energy much larger than 
the maximum binding energy of the atomic electrons, 
the assumption has been made that in evaluating the 
generalized moments, terms of the order 9 can be 
neglected (except for the case of X,). This assumption 
is justifiable when one is dealing with high-energy 
ionizing particles and £ not too small. But it has already 
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been pointed out by Bethe’ that in the Gaussian case 
(s—+0 in Symon’s development), the rms fluctuation 
calculated on the basis of neglecting binding effects, i.e., 
neglecting terms on the order of 9, is already too small 
by about 20% for the case of 5-Mev alpha particles in 
air. It is the purpose of this paper to examine the 
changes produced in Symon’s parameters by taking 
more fully into account the neglected terms, and 
consequently also in the limiting cases of Bohr’s and 
Landau’s, the former having already been done by 
Bethe. 


CORRECTIONS WHEN sy<1 


Symon uses a function x(k’) in place of X(£’) 
defined by 


x(E’)=0, E'> En! 


Zn, 4 E’ 
-—_(1-# ), n& E'S Em 

BE” Ew! 
which, as he shows by citing the literature, is valid for 
all not highly relativistic particles whose primary 
means of energy loss is by collisions with atomic 
electrons, and which is sufficient if terms of the order 
of can be neglected. For the first moment he uses, in 
addition, Bethe’s result, 


] Que , 
f E'X(E’)dE’ =— |n(n/0), (6) 
0 6° 


where 
(1—6*) I? 
e=———— exp(6”), 
Qu. 


and J=mean ionization potential of the atomic elec- 
trons ([13.5Z). The required corrections can now be 
obtained by determining how the generalized moments 
calculated on the basis of a more exact formulation for 
X(E’) differ from those calculated on the basis of the 
function x(Z’). In more quantitative terms, let 


6:(s)=[X1(0)—Xi(s) J—D, (0) — Xi (s) ], 
62(s)= X2(s)—X2(s), 


(7a) 
(7b) 


63(s) = X3(s)—X3(s); (/c) 


then the corrections to Aw, ow, and j can be easily 
discussed in terms of these deltas if they are not too 
large. Now it can be observed that the evaluation of the 
X,(s) when terms of the order 7 are neglected actually 
amounts to the carrying out of the required integrations 
for [X,(0)—X,(s) ], X2(s), etc., instead of from 7 to E,,’. 
from 0 to E,,’ and allowing x(Z’) to keep its functional 
dependence on £’, as defined for £’>n, even for E’ <n. 
Bearing this in mind, one gets the following results for 
the deltas: 


7M. S. Livingston and H. A. Bethe, Revs. Modern Phys. 9, 
283 (1937). 
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En!’ 


5,(s) = a-emye[x(e) 


Que 1 E! 
_ ( 1-?? ) jee, 
B? E” ES 


and similarly for 52(s) and 63(s). If we define the func- 
tion D(E’) such that 


v Sas Zits 1 E' 
D(E)=X(E)-——(1-# ), (8) 
B? EB” | ed 


is the excess of the true collision spectrum over the 
Rutherford collision spectrum, then it is quite rea- 


sonable to assume that D(E’) 0 for E’>n. Hence, 
in terms of D(E’), 


Em!’ 
5:(s)= f (1—e-*®')E'D(E')\dE’, (9a) 
0 


(9b) 


En’ 
52(s) = f e-*8’E!D(E)dE’, 
0 


7? 


I m 
53(s) -f e~* ESD (E’)dE’. (9c) 
0 


Now, by making use of the approximation sy<1, one 
can expand each of the deltas in a rapidly converging 
series to obtain 


Ss 
51(s) = 552(0) ——* aa ( 10a) 


5o(s) =62(0) —sd3(0)+---, 
53(s) =63(0) —s84(0)+---, 


(10b) 
(10c) 


which leaves us with the task of evaluating the 6,(0). 
Actually the convergences are so rapid that only the 
first term in each expansion need be calculated. This 
follows from the fact that when sy<1, e~*”’>e-”'/1, 
and since D(k’)—0 for E’— n, e~*”’=1 over that 
range of £’ for which the integrand is large. 

Let 6j, 60,,, and 6A, represent the changes introduced 
in Symon’s parameters when they are evaluated by 
means of X(E’) and assume that these changes are 
small. Then 

- 
“en (11a) 


5j= (exactly) 


OO w 1 52(s) 


Ow 9 Xo(s)’ 


Aw 53(s) 3 62(s) 


Aw X3(s) 2Xa(s). 


(11b) 


(11c) 


a first order approximation, 67=— (xs/£)62(0). 
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52(0) can be taken directly from Bethe’s result, which, 
for a nonrelativistic ionizing particle whose velocity is 
greater than the orbital velocity of any atomic electron 
in the absorber, amounts to 


Que Ya Lobe | ey 
[x > K,—— In 
on at ZLAt n ue? I, 


where > 4,=summation over the atoms in the molecule 
of the absorber, /,=average excitation energy of the 
nth shell, K,&4/3 for inner shells, and the other terms 
as defined previously. For nonrelativistic heavy par- 
ticles of rest energy u>>u., this can be simplified by 
noting that F,,’=2u,6*. It must also be remarked that 
Symon found the variable s to be related to — by the 
equation 

= 1/aé, 


(13) 


in which a itself varies with & but only within the limits 
of 2.48 to 4 for all permissible &, a result which is not 
very likely to be changed by the corrections made here. 
On the other hand, this relationship translates the limit 
of validity of the present first order approximation 
(sn1) into the terms &>/ km, where Jxm is the largest 
binding energy of an electron in the absorbing material. 
Substituting Eqs. (12) and (13) in 67=—(xs/£)6.(0), 
one obtains, for nonrelativistic heavy particles, 


1 1 Ew 
bin ee eT Ut. ie. 
aé Dat Z At n Fs 


To appreciate the significance of this correction one 
has to compare 6j with the values of 7 presented 
graphically as a function of 1/c=&/E,’ by Symon. 
For £/E,’<0.2, 7 approaches a constant positive value 
of 0.37 so that 67 becomes more significant for de- 
creasing § (E>/xm). In the vicinity of =F’, 7 goes 
to zero and reverses sign. By this time 67 is already 
quite small and Symon’s curve is not altered appre- 
ciably. For increasing ¢, and §/E,,’>1, j approaches 
asymptotically the function j=—In(&/En’), so that 
the fractional change 6j/j becomes rapidly less and 
less important. 

The situation is quite different for d0./o.. Let us 
make use of the parameter 5 in Symon’s computations, 
where 0 is defined by 


Ow=[4Xo(s) }}=bE, 


(14) 


(15) 


so that Xo(s)=b?/x. Then, again to a first order 
approximation, by inserting Eqs. (15) and (12) into 
50 w/7w=43[62(0)/X2(s) ], one obtains for the case of 
nonrelativistic heavy particles. 


boy 1 En! 
— —— FF Xf 8. la —. 


=— (16) 
Ow be > at Z At n Ye 


A plot of 6 vs 1/c=£/E,,’ is also given by Symon, which 
shows that 6 reaches a constant value of 1.48 for 
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§/E»' <0.2. Similar to the previous case, an increasing 
fractional change in o, is indicated for decreasing &. 
As € increases, 6 decreases and is about halved (~0.7) 
by the time ¢/F,’=2. Beyond this, 6 decreases more 
slowly with increasing £ approaching the function 
b=(E,,/£)! which gives rise, contrary to the case of 
67/j, to a constant nonvanishing fractional change in 
ow. This is the correction in the width of the Gaussian 
curve as pointed out by Bethe.” 

In order to determine 6A.,/Aw, it is necessary, even 
to first order approximation, to evaluate 63;(0). This 
could be done by following a procedure used by Bethe 
in arriving at the second moment of the collision 
spectrum, which will not be done here. On the other 
hand, to obtain an order of magnitude for 6;(0), one 
can construct a function for D(£’) consisting of delta 
functions located at the various values of 7, and being 
of such heights as to yield, upon integration, the value 
of 6,(0) as in Eq. (12). Such a function is given by the 
following: 


oa 
D(E)=— —— "In 
us  &£’ E! 


Ez. K,Z,5(E’—I,) En’ 
——— | =| (17) 
B? Dat Z At n 

the bar over the D serving to distinguish this function 
from the real D(E’) of which it is assumed to be a 
reasonable approximation. When this form is used to 
evaluate 6;(0), one obtains for heavy nonrelativistic 
particles the result, 


Aw 1 Pa 
OO, of. sft oe xm eee 
Aw be Dat Z At n I, 


ree Sere 2 
(op 
ag } 


The formula has been written in this way so that one 
can distinguish the separate contributions from each 
of the two terms in Eq. (11c), and, more importantly, 
to observe that the summation to be performed is the 
same as that involved in the evaluation of 60y/ow 
except that each term is multiplied by the quantity 
in the large brackets. For the behavior of \» vs £/Em’ 
we again turn to Symon’s results to find \,,= 1.48 for 
t/Em’<0.1, decreasing in the region 0.1<£/E,»’<1, 
and approaching the value $(&//n’)! for &/Em’>1. For 
the region £/E,,/>1 [recalling that b— (E,’/£)*], we 
find that the bracketed quantity becomes 


Fe 
fatal +s, 
E.! 


DXw/Au 


so that 

— —360,/e.. 

Since 6¢,,/¢ approaches a constant value in this region, 
so does 6A,,/Aw. It is of interest to observe here that as 
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long as the use of D(E’) gave the right order of magni- 
tude, its inexactness is of no importance in this region 
since the term due to it is neglected. For ¢/E’<1, the 
bracketed term becomes 


2 I. 


2.18 & 
when ¢/E,,’ — $0.1. 
DISCUSSION OF THE CASE sy-1 


In this case, it can be seen by Eqs. (9) and (10) that 
more and more terms in the expansions of the deltas 
are required for any degree of accuracy. However, one 
need not exert himself in that direction because insofar 
as Eq. (13) is still valid, as sy— 1, E—>1km (km 
= maximum ionization potential for an electron in the 
absorber) and not only do the approximations of the 
previous section break down, but the very approxi- 
mations made by Symon in arriving at a one-parameter 
set of curves for P(e,x) begin to become untenable. 
Consequently, one needs to occupy himself instead with 
the problem of determining P(¢,x) for the situation in 
which > /xm. with the realization that in the limit 
x0, 


Em’ 
P(ex)—> x(9+5(0| 1-x f x(erae’ | 
0 


NUMERICAL EXAMPLE 


For the sake of an illustration, let us consider the 
corrections required for 5-Mev alpha particles and 
5-Mev protons, respectively, with methane (CH,) as 
absorber. It will be assumed that any appreciable 
correction is due only to the binding of the electrons in 
carbon. In the calculations, the following values have 
been used: Jx=470 ev; J,=54 ev; for the alpha 
particle E,,’= 2.72 kev; and for the proton £,,’= 10.87 
kev. Figure 1 shows 6), 6¢/o~, and dAw/Aw as a function 
of & for the two particles. In order to get an idea of the 
magnitude of energy losses involved, one makes use of 
the Bethe formula which gives the average energy-loss, 


é= 2¢ In(E,,’/1); 
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and using 7=40 ev for methane, one obtains for the 
alpha particle 

é,= 8.44€ kev, 
and for the proton 

é,= 11.2é kev. 


It should be noted that the corrections were calculated 
here by use of Eqs. (14), (16), and (18) and when 
b¢w/o» exceeds about 0.2, it is necessary to evaluate 
the changes more accurately. This is done by noting the 
direct changes produced in the parameters o~ and Aw 
when they are calculated according to Eqs. (1), (2), 
and (3), as compared to the results obtained by using 
the function x(£’). In these calculations the XY,(£’) 


90.24 











CORRECTIONS 








§ (kev) 


Fic. 1. Corrections in the parameters 7, ow, and \» as a function 
of § for 5-Mev alpha particles and 5-Mev protons passing through 
methane. 


can be obtained from the X,(Z’) and 6,(E’) according 
to Eqs. (7), and 6,(s)=6,(0) is still a good approxi- 
mation as long as £>/ xm. 

Finally, let us summarize, in qualitative terms, the 
changes produced by the above corrections. The dis- 
tribution function P(e,x) when calculated on the basis 
of Symon’s parameters, when binding effects can no 
longer be neglected, undergoes the following changes: 
(1) the most probable energy-loss lies further from the 
mean energy loss; (2) the width of the curve is in- 
creased; and (3) the skewness is decreased. 
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An investigation of the decay schemes of Tc® and Tc*” has been made by using high-resolution conversion 
electron spectrographs, gamma-ray scintillation detectors, and coincidence techniques. 

In addition to the 38.9-kev isomeric transition in Tc®, eight transitions of the following energies have 
been assigned to Mo”: 204.2, 583.9, 763, 767.9, 784, 788.0, 822.5, and 837.3 kev. A decay scheme is proposed. 

The isomer of Tc? has been shown to decay by a single M4 transition of 96.5+40.1 kev. The experimental 
K:L:M relative conversion-electron intensities for this transition are 1:0.48:0.13. 


INTRODUCTION 


HE energy levels of Mo* populated by the decay 

of the 60-day isomer of Tc® have previously 

been studied by Medicus, Preiswerk, and Scherrer! 

and also by Levi, Papineau, and Saunier.? The work 

reported in this paper is based principally on the results 

of high-resolution conversion-electron spectroscopic 

studies, supplemented by gamma-ray scintillation spec- 

troscopy and coincidence work. In addition to the transi- 

tions reported by the previous authors, several new 
transitions have been observed. 

Cork, Brice, Schmid, and Helmer*® have reported 
that the isomer of Tc’ decays through a 90.2-kev E3 
transition and a 99.2-kev M4 transition, analogous to 
Tc®™, The results of the work reported herein do not 
confirm the work of Cork et al., but show that the isomer 
decays by a single M4 transition. 


INSTRUMENTS AND EXPERIMENTAL PROCEDURE 


High-resolution studies of long-lived accelerator- 
produced isotopes are very difficult because of the small 
amounts of activity that can be produced. In this work 
we have been able to make reasonably large amounts of 
the 60-day isomer of Tc* and the 90-day isomer of 
Tc” by bombarding natural molybdenum with the 
intense 7.4-Mev deuteron beam of the Livermore A-48 
accelerator.‘ 

Chemical processing of the target was delayed 3 
months to allow the shorter-lived activities to decay. 
The target was dissolved in a solution of HC] and HNO; 
and evaporated nearly to dryness. The precipitate that 
formed was then digested in a small volume of HNO; 
so that the bulk of the molybdenum would precipitate 
as molybdic acid. After the solution was cooled to 
reduce the solubility of the molybdic acid, the super- 
natant liquid containing most of the technetium activity 
was removed. The molybdic acid precipitate was dis- 
solved and reprecipitated three times in order to obtain 

t Work done under the auspices of the U. S. Atomic Energy 
Commission. 

1 Medicus, Preiswerk, and Scherrer, Helv. Phys. Acta. 23, 299 
(1950). 

2 Levi, Papineau, and Saunier, Compt. rend. 245, 1918 (1957). 

3 Cork, Brice, Schmid, and Helmer, Phys. Rev. 100, 188 (1955). 

4 Birdsall, Clark, Jaspon, Livdahl, Smith, and Van Atta, Bull. 
Am. Phys. Soc. 2, 187 (1957). 


the remaining technetium occluded in the precipitate. 
The technetium was then separated from the remaining 
soluble molybdenum by use of an anion-exchange 
separation.® The technetium obtained in this manner 
was mass-free. Most of the activity was used to prepare 
a source for the conversion-electron studies. This source 
was prepared by cathodic electrodeposition of the 
technetium activity onto a 0,010-inch platinum wire. 
The remaining activity was used to prepare sources for 
the gamma-ray analysis. 

The gamma-ray spectra were obtained using 1X 1}- 
inch and 33-inch cylindrical NalI(TI) scintillation 
crystals in conjunction with a 100-channel pulse-height 
analyzer. The gamma-gamma-ray coincidence measure- 
ments were performed with a coincidence circuit having 
a resolving time of 2 microseconds. In each coincidence 
experiment the chance coincidence spectrum was sub- 
tracted from the observed coincidence spectrum. 

The high-resolution conversion-electron studies were 
performed on two permanent-magnet spectrographs 
with magnetic fields of 99 and 350 gauss.® These spec- 
trographs are very useful for studying long-lived 
activities, since the photographic emulsion used as a 
detector can integrate the line intensities over a very 
long period of time. For the very weakly converted high- 
energy transitions an one month was 
required, 


exposure of 


Tc GAMMA-RAY SPECTRA 


Spectrum 4 of Fig. 1 illustrates the singles gamma- 
ray spectra taken with the 1X1}-inch Nal crystal. In 
addition to the K x-rays, gamma rays of 204, 580, 820, 
and 1040 kev were observed. The 820-kev peak was 
the only one that appeared to be complex. All the 
gamma rays decayed with a half-life of 6142 days. 
These gamma rays were therefore assigned to the decay 
of Tc”, which has been previously reported to have 
a half-life of about 60 days.’ Medicus ef al.' have 
reported gamma rays of 201, 510, 810, and 1017 kev, 
but did not observe that the 810-kev gamma ray was 


§ Huffman, Oswalt, and Williams, J. Inorg. Nuclear Chem. 3, 
49 (1956). 
6 W. G. Smith and J. M. Hollander, Phys. Rev. 101, 746 (1956). 
7 For reference see Strominger, Hollander, and Seaborg, Revs. 
Modern Phys. 30, 585 (1958). 
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Fic. 1. Singles and coincidence gamma-ray spectra of Tc%™. 
Curve A—singles spectrum. Curve B—gamma-ray spectrum in 
coincidence with 204-kev gamma ray. Curve C—complex photo- 
peak not in coincidence with the 204-kevy gamma ray. 


complex. The K x-rays decayed with a half-life longer 
than 60 days, indicating the presence of the 90-day 
isomer of Tc*”. 

Table I summarizes the gamma rays of Tc” that 
were observed and their corresponding relative in- 
tensities. The intensities given represent an average 
of the intensities obtained by using 1X1}-inch and 
3X 3-inch cylindrical NaI (TI) scintillation crystals. The 
photopeak efficiency curves of Kalkstein and Hollander® 
were used for the 1X 1}-inch crystal and the photopeak 
efficiency curves given by Heath® were used for the 
3X 3-inch crystal. 

The relative intensity of K x-rays due to Tc was 


TABLE I. Gamma rays of Tc”. 


Energy (kev) Relative intensity 
K x-rays 1.6 +0.4 
204+ 2 1.8 +0.2 
580+6 1.0 
820+ 10 1.45+0.15 
1040+-10 0.10+0.02 


8M. I. Kalkstein and J. M. Hollander, University of California 
Radiation Laboratory Report UCRL-2764, October, 1954 
(unpublished). 

*R. L. Heath, Atomic Energy Commission Report IDO-16408, 
July, 1957 (unpublished). 
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distinguished from the K x-rays of Tc®’ by resolving 
the decay curve of the K x-rays taken over a period 
of one year. 

The results of the gamma-gamma coincidence meas- 
urements can be summarized as follows: 

(1) The 204-kev gamma ray is in coincidence with 
the high-energy side of the 820-kev complex photopeak, 
spectrum B of Fig. 1. It was found by gating on the 
204-kev peak and comparing the coincidence spectrum 
to the singles spectrum that 534+5% of the 820-kev 
peak was in coincidence with the 204-kev gamma ray. 
This gamma ray, then, has an intensity of 0.77+0.08 
relative to the 580-kev gamma ray. 

(2) The energy of the gamma ray in coincidence 
with the 204-kev gamma ray is 840420 kev and 
appears to be a single gamma ray when the peak width 
is compared to the 662-kev gamma ray of Cs'*’ and the 
1064-kev gamma ray of Bi’. After subtracting out the 
contribution of the 1040-kev gamma ray from the 
singles spectrum (using the 1064-kev gamma ray of 
Bi®’ as a standard) and then subtracting the coin- 
cidence spectrum from the resultant singles spectrum, 


TABLE IT. Conversion-electron lines of Tc**” 
(99-gauss spectrograph). 


Shell of 


Relative electron 
conversion i 


intensities 


Electron energy 


Transition energy 
(kev) ev 


17.88+0.1 K* 38.9+0.1 

36.19+0.4 L 38.9 

38.44+0.4 M* 38.9 

184.2 +0.2 K 204.2+0.2 100 
201.3 +0.2 L 204.2 1342 
203.7 +0.2 M 204.2 4+1 


® Internal conversion occurs in technetium; lines not so designated occur 
in molybdenum. 


one obtains a complex peak at 800+ 20 kev, spectrum 
C, Fig. 1. This complex peak has a half-width about 
20 kev greater than that of the 840-kev gamma ray. 

(3) The 580-kev gamma ray is completely in coin- 
cidence with the 204-kev gamma ray. 

(4) The 580-kev and the complex 820-kev gamma 
rays are not in coincidence. 

(5) The 1040-kev gamma ray is not in coincidence 
with any gamma ray. 


CONVERSION-ELECTRON SPECTRA OF Tc” 


Tables II and III summarize the conversion-electron 
work. Because of the small differences between different 
L- and M-subshell binding energies, the subshell of 
conversion for a particular transition could not be 
determined uniquely. The final transition energies are 
determined from the K-conversion electron lines, by 
use of the binding energies given by Hill, Church, and 
Mihelich.” The relative electron intensities were deter- 
mined by taking a densitometer tracing of the emulsion, 
relating photographic densities to relative intensities by 


10 Hill, Church, and Mihelich, Rev. Sci. Instr. 23, 523 (1952). 
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the method of Slitis'' and also by visual comparison 
of the conversion-electron lines to standard exposed 
emulsions. The inténsities so obtained were corrected 
for the different geometries and photographic blacken- 
ing efficiencies. For the first three electron lines in 
Table II, the relative intensities were difficult to 
estimate, since the lines were rather diffused and in a 
high background. 


POSITRONS 


Since the 510-kev annihilation-radiation photopeak 
was masked by the intense 580-kev photopeak, the 
relative number of positrons to a particular gamma ray 
could not be determined directly by comparing the 
relative gamma-ray intensities. However, the ratio of 
positrons to the 580-kev gamma ray was obtained in the 
following manner. The 510-510 kev annihilation radia- 
tion coincidence counting rate (A) in the technetium 
sample was determined. According to the previous 
coincidence work, there will be no complication due to 
other gamma-ray coincidences. From energy considera- 
tions it can be shown that the 580-kev gamma ray and 


TABLE III. High-energy conversion-electron lines of Tc” 
(350-gauss spectrograph). 


Shell of 
conversion 


Transition energy 


Relative electron 
(kev i 


Electron energy 
(kev) intensities 


583.9+0.6 100 
583.9 1342 
763° +2 ~1 
767.9+0.8 5+1 
784.042 ~2 
788.0+0.8 14+2 
788.0 ~2 
822.5+0.8 6+1 
837.3+0.8 40+4 
837.3 4+1 





563.9+0.6 
580.8+0.6 
743 +2 
747.9+0.8 
764 +2 
768.0+0.8 
785 +2 
802.5+0.8 
817.340.8 
834.54+1.6 


SDS PS 


higher-energy gamma rays cannot be in coincidence 
with positrons. Levi and Papineau have determined 
from (p,m) thresholds that the available energy for 
electron capture for the isomer is 1.73 Mev." Since the 
580-kev gamma ray is in coincidence with the 204-kev 
gamma ray, and it is to be shown (later in this paper) 
that the first excited state is at 204 kev, the 580-kev 
gamma ray must depopulate a level at about 784 kev. 
From the Qg.c. the highest energy level that can be 
populated is 710 kev. Therefore the 580-kev gamma 
ray and the higher-energy gamma rays observed cannot 
be in coincidence with positrons. With the same 
geometry, the 510-510 kev coincidence rate (B) was 
measured for a Na” sample. Both samples were sur- 
rounded by an appropriate thickness of aluminum to 
stop all positrons. Then the singles counting rates of 
both samples were determined, which gave the sum of 
the 510- and 580-kev gamma rays counting rate (C) in 
the technetium sample and the 510-kev annihilation- 
1 H, Slatis, Arkiv Fysik 8, 441 (1954). 
12. C, Levi and L. Papineau, Compt. rend. 244, 1358 (1957). 
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radiation counting rate (D) in the Na” sample. The 
number of positrons relative to the 580-kev gamma ray 


is then given by 
N(B*)/N (580) =44AD/(BC—AD). 


The ratio of positrons to the 580-kev gamma ray was 
found to be 0.010+0.003. The number of positrons 
relative to the 204-kev gamma ray is then 0.0056 
+0.0018. This ratio differs appreciably from the value 
of 1.1 10-* relative to the 204-kev gamma ray reported 
by Levi, Papineau, and Saunier.? Using the value of 
1.6 for the K x-ray intensity relative to the 580-kev 
gamma ray, a fluorescence yield of 0.73," and a L/K 
capture ratio of 0.11, one obtains an estimate of the 
positron-to-electron-capture ratio of 0.4+0.16%. This 
value agrees with the value of 0.4% reported by 
Medicus et al. 


ENERGY LEVELS OF Mo 


The 38.9-kev transition is highly converted, and 
from the energy spacings of the K and ZL lines it is clear 
that the internal conversion occurs in technetium and 
not in molybdenum. These facts strongly suggest that 
this transition corresponds to an isomeric transition in 
technetium. The isomeric transition is probably in 
Tc, as was reported by Medicus and Preiswerk.'® 
Medicus and Preiswerk reported the transition energy 
as 39.0+0.7 kev, in good agreement with the energy 
obtained in this work, and they determined the half- 
life of the transition to be 60+10 days. 

The first excited state of Mo* has been shown to be 
at 204 kev by several investigators,':'® and therefore the 
204.2-kev transition is assigned as depopulating the 
first excited state. This is consistent with all the coin- 
cidence data and the fact that the 204.2-kev transition 
is the most intense in the decay. 

The 204.2-kev transition was found to be in coin- 
cidence with the 580-kev gamma ray, the energy of 
which was more accurately determined as 583.9 kev 
from the conversion-electron spectroscopy. This result 
suggests that an energy level exists at 583.9+4-204.2 
= 788.1 kev. This postulate is substantiated by the 
observation in the conversion-electron spectra of a 
788.0-kev transition, the crossover transition to the 
ground state. 

The 204.2-kev transition is also in coincidence with 
the high-energy side of the 820-kev complex gamma- 
ray peak. The energy of this gamma ray in coincidence 
with the 204.2-kev transition was found to be 840+ 20 
kev, and it appears to be a single gamma ray. This 
transition is most likely the 837.3-kev transition ob- 
served in the conversion-electron work. Since the 837.3- 


181, Bergstrém, in Beta- and Gamma-Ray Spectroscopy, edited 
by K. Siegbahn (Interscience Publishers, Inc., New York, 1955), 
p. 630. 

14M. E. Rose and J. L. Jackson, Phys. Rev. 76, 1540 (1949). 

18H. A. Medicus and P. Preiswerk, Phys. Rev. 80, 1101 (1950). 

16F. K. McGowan and P. H. Stelson, Phys. Rev. 109, 901 
(1958). 
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kev transition is in coincidence with the 204.2-kev 
transition, there is good evidence that an energy level 
exists at 1042 kev. This postulated energy level is sub- 
stantiated by the observation of a 1040+ 10-kev gamma 
ray, which is not in coincidence with any gamma ray. 

The gamma-ray photopeak not in coincidence with 
the 204.2-kev transition was complex, with a width at 
half maximum about 20 kev greater than that of the 
840-kev gamma ray. This photopeak is most likely 
due to the 767.9-, 788.0-, and 822.5-kev transitions. 

The coincidence results confirm the postulate that 
the 788.0-kev transition observed in the conversion 
electron work is the crossover transition to the ground 
state. 

The 767.9-kev transition is very likely the transition 
that has been observed in the 20-hour electron capture 
of the ground state of Tc®® (762 kev)! and the beta decay 
of Nb* (76742 kev,'” 768+1.5 kev,'* and 770+ 2 kev’) 
This transition has been shown to go directly to the 
ground state, in agreement with this work. 

Since the 822.5-kev transition is not in coincidence 
with any gamma ray, it is assigned as going directly to 
the ground state. 

Very little can be said conclusively at this time con- 
cerning the coincidence relations of the 763- and 784-kev 
transitions with the 204.2-kev transition, because of 
their relatively low intensities. If these transitions have 
comparable or smaller conversion coefficients relative 
to the other high-energy transitions, it would be un- 
likely that they are in coincidence with the 204.2-kev 
transition. 


GAMMA TRANSITION MULTIPOLARITIES 

The 204.2-kev transition has been shown by recent 
Coulomb-excitation work!® to be an M1-E2 mixture, 
with a mixing ratio £2/M1 of 0.34+40.17. The K/L 
conversion electron ratio of 7.7+1.2 obtained in this 
work is consistent with such an M1-E2 admixture. The 
theoretical K/L ratio for an M1 transition is 8.8; for 
E2 the ratio is 7.1. The theoretical conversion coeffi- 
cients of Sliv and Band” are used throughout this paper. 

The multipolarity assignments from conversion coeffi- 
cients for the other gamma rays are very difficult to 
make because, with the exception of the 584- and 837- 
kev transitions, the relative gamma-ray intensities 
could not be obtained. Also at this particular atomic 
number for the energies under consideration, the con- 
version coefficients for M1 and £2 multipolarities are 
very similar. At 584 kev the two conversion coefficients 
differ by about 15%, while at 837 kev they differ by 
only about 5%. Also the K/L ratios obtained from the 

‘7K, E. Johansson, Arkiv Fysik 10, 247 (1956). 

16 Cork, LeBlanc, Martin, Nester, and Brice, Phys. Rev. 90, 
579 (1953). 

 Drabkin, Orlov, and Rusinov, Izvest. Akad. Nauk S.S.S.R. 
Ser. Fiz. 19, 324 (1955); Columbia Tech. Translation p. 294. 

*” Tables by L. A. Sliv and I. M. Band, Leningrad Physico 
Technical Institute Reports [ translation: Report 57ICCK1, 1956, 


and Report 58ICCL1, 1958, issued by Physics Department, Uni 
versity of Illinois, Urbana, Illinois (unpublished) }. 
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emulsions in this work are considered to have an un- 
certainty too large to allow one to unambiguously dis- 
tinguish between M1 and £2 multipolarity. 

In this work the absolute conversion coefficient for 
the 584-kev transition was not determined. However, 
from the conversion-coefficient work of Medicus et al.' 
the 584-kev transition was shown to be either M1 or £2. 
The experimental ratio of the K-conversion coefficient 
for the 584-kev transition to that of the 837-kev transi- 
tion, from this work, is 1.9+0.3. If the 584-kev transi- 
tion were of M1 multipolarity and the 837-kev transi- 
tion M1 or £2, the theoretical ratio would be 2.2. If 
the 584-kev transition were £2, the ratio would be 2.6. 
Therefore the 584-kev transition is more likely to be 
M1 and the 838-kev transition M1 or £2. 

The 768-kev transition has been shown by Drabkin, 
Orlov, and Rusinov to be of £2 multipolarity." 

The 788-kev transition cannot involve a parity 
change, since it is the cross-over transition connecting 
two states of the same parity. Both the 584- and the 
204.2-kev transitions have been shown to involve no 
parity change. 

Limits on the multipolarities of the 788- and 822-kev 
transitions can be obtained from the relative conversion- 
electron intensities. The gamma-ray and conversion- 
electron intensities were normalized to both the 584- 
and the 837-kev transitions by using the theoretical M1 
conversion coefficient and the average of the theoretical 
M1 and £2 conversion coefficients, respectively. The 
gamma-ray intensities for the transitions were then 
calculated from the relative conversion-electron intensi- 
ties by assuming various multipolarities and using the 
theoretical conversion coefficients. The sum of the 
calculated gamma-ray intensities was then compared 
to the experimental sum of 0.68+0.17 for the intensity 
of the gamma rays, relative to the 584-kev transition, 
which were not in coincidence with the 204.2-kev 
transition. In both cases, with the 768-kev transition 
of £2 multipolarity, the data are consistent with multi- 
polarity assignments of M1 or £2 for the 788-kev transi- 
tion and #1, M1, or £2 for the 822-kev transition. No 
limits can be set on the multipolarities of the 763- and 
784-kev transitions because of their low intensities. 


ENERGY-LEVEL SCHEME OF Mo” 


Figure 2 represents the energy-level scheme, with 
spin assignments and transition intensities in terms of 
percent of total decay included. The relative intensities 
for the 768-, 788-, and 822-kev transitions were obtained 
from the relative conversion-electron intensities by 
normalizing the data to the conversion-electron and 
gamma-ray intensities of the 584-kev transition by 
using the theoretical M1 conversion coefficient. The in- 
tensity of the 822-kev transition was determined by 
assuming M1 or £2 multipolarity. The justification of 
this assignment in preférence to £1 is given later in this 
section. The approximate maximum limits on the in- 
tensities of the 763- and 784-kev transitions were deter- 
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mined by assuming the maximum photon intensity, 
that is, the intensity if both the transitions were F1. 
From the observed intensities of the transitions 
populating the first excited state and the intensity of 
the 204.2-kev transition, an upper limit of the electron 
capture to the first excited state can be set as 10%, and 
the data are consistent with no capture to this state. 
Because of the larger uncertainty of the K x-ray in- 
tensity, we can only set the less restrictive upper limit 
of the electron capture to the ground state as 20%. 
These limits are consistent with those of Levi ef al.? who 
have reported less than 10% electron capture to the 
ground state and less than 10% to the first excited state. 
Table IV lists the experimental log ft values for elec- 
tron capture to the various energy levels. Levi et al.? 
have reported observing approximately equally intense 
positron groups of 680+30 and 460+30 kev populating 
the ground state and the first excited state of Mo, re- 
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Fic. 2. Decay scheme of Tc**”. Transition intensities 
are given as percent of total decay. 


spectively. Assuming equally intense positron groups, 
we calculate the logft values for positron emission of 
9.0 and 8.2 for the ground state and the 204.2-kev level, 
respectively. 

The ground state of Mo has a measured spin of 
3,712 in agreement with the shell-model prediction of 
3+ due to a (ds)* neutron configuration. 

The 204.2-kev transition is primarily of M1 multi- 
polarity, which indicates that the first excited state 
could have a spin of $+, 3+, or 3+. However, from 
angular distribution and polarization-direction correla- 
tion, McGowan and Stelson'® have shown that the spin 
of the first excited state must be $+. 

From this work alone little can be said conclusively 
regarding the spin and parity of the energy level at 
768 kev, but from other work on the decay of the 
ground state of Nb® the spin has been shown to be 


21 —E, C. Woodward, Phys. Rev. 93, 954 (1954). 
2 J. Owen and I. M. Ward, Phys. Rev. 102, 591 (1956). 
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TABLE IV. Experimental log ft values for electron capture. 


Energy level Log ft 
0 >7.98 
204 >8.17 

763 58.5 

784 >8.1 
788 7.02 
822 7.98 
1042 6.92 


9/2+." In this work, this energy level is probably not 
populated directly from the decay of the }—isomeric 
state of Tc® but rather from the decay of the 9/2+ 
ground state of Tc®®, which is in turn populated by the 
isomeric transition. Medicus ef al.! have reported that 
90% of the ground-state decay of Tc® goes to the 768- 
kev level. Using this percentage, taking the intensity 
of the 768-kev transition from this work, as calculated 
by using the theoretical £2 conversion coefficient, and 
assuming no population from the isomeric state, we 
calculate that ~4% of the decay of Tc*™ proceeds 
through the isomeric transition. The value reported by 
Medicus et al. is 3%. Therefore very little, if any, direct 
population of this state comes from the }— isomeric 
state of Tc®, as would be expected. 

The energy levels at 788 and 1042 kev decay primarily 
to the $+ first excited state with an appreciable branch- 
ing to the $+ ground state. It should be noted that the 
log ft values and the modes of de-excitation are very 
similar for these two states. For both these states, spin 
values of + or $+ would be consistent with the modes 
of depopulation, the assignment of M1 multipolarity 
for the 584-kev transition, M1 or £2 multipolarity for 
the 788- and 837-kev transitions, and the log ft values 
for these states. 

The 822-kev transition has been shown to be of either 
F1, M1, or £2 multipolarity. The occurrence of an odd- 
parity state at an energy such as this and in this region 
just three neutrons from a closed shell is unlikely. The 
log ft value of 7.6 calculated by assuming that the 822- 
kev transition is of £1 multipolarity is inconsistent 
with most log ft values for allowed or second-forbidden 
transitions. However, the log ft value of 7.98 calculated 
by assuming M1 or £2 multipolarity is consistent with 
an assignment of AJ=-+1 or 2, yes, for the electron 
capture to this state. On this basis, the energy level at 
822 kev is assigned tentatively as 3+ or 3+. 

The 763- and 784-kev transitions have been tenta- 
tively assigned as going directly to the ground state. 
This assignment is not completely certain due to the 
low intensity of these transitions. Again, on the assump- 
tion that these transitions are both of £1 multipolarity, 
the calculated log ft values of ~8.5 and ~8.2 for the 
763- and 784-kev transitions, respectively, are not 
consistent with allowed or second-forbidden transitions. 
On the other hand, the log ft values of ~8.9 and ~8.6, 
respectively (assuming M1 or £2 multipolarity), are 
consistent with AJ= +2, yes. It should be pointed out 
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Fic. 3. Decay 
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reported by Cork 
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at this time that we have not ruled out the possibility 
that these transitions are populated in the decay of the 
ground state of Tc®. Medicus e al.) in their work, 
probably could not resolve the transitions of 763 and 
768 kev. 


DISCUSSION 


The decay of the isomeric state and the ground state 
of Tc® are very interesting, since in each case a different 
set of energy levels of Mo*® is populated owing to the 
large difference in the spins of the two initial states. In 
this work it is shown that the decay from the isomeric 
state is rather complex, but according to the work of 
Medicus et al.,' the decay of the ground state is rather 
simple. Most of the electron capture from the ground 
state goes to the energy level at 768 kev, and there is a 
small branching to levels at 930 and 1070 kev. In the 
decay of the ground state of Nb® only the level at 
768 kev in Mo* has been observed to be populated. 

The $+ first excited state of Mo” at 204.2 kev is 
rather difficult to interpret as a state with a single un- 
paired neutron. The log/t value of >8.2 for electron 
capture of the p; isomeric state of Tc®® to this level is 
rather high for a simple first-forbidden transition. Also, 
this excited state has a measured half-life of 7.7 10-” 
sec.4.4 From Moszkowski’s single-particle estimates” 
the half-life of this transition would be about 2.3 10-” 
sec if it were due to a single-neutron transition; that is, 
this transition is retarded by a factor of ~330. 

The energy difference of 204.2 kev between the 
[ (dy)* lp, ground state and the $+ first excited state is 
much smaller than the normal d;—d; splitting and 
therefore the first excited state cannot be due to a 
neutron configuration such as [ (ds)*(d3)! ]),. 

The abnormally high log ft value and the long life- 
time of this state may be understood if the state is due 
to three unpaired neutrons coupled in the configuration 
[ (dy)* ],,. This type of coupling of three d, particles to 
yield a resultant spin of } has been observed previously 
for the ground states of Na* and Ne?'. Kurath?® and 
Talmi*’ have shown theoretically by using interaction 
potentials intermediate to the “long-range” and 6-type 


% J. Quidat, Compt. rend. 246, 2119 (1958). 

* 1). Strominger, to be published in Phys. Rev. (1959). 

28S. A. Moszkowski, in Beta- and Gamma-Ray Spectroscopy, 
edited by K. Siegbahn (Interscience Publishers, Inc., New York, 
1955), Chap. XIII. 

26K. Kurath, Phys. Rev. 80, 98 (1950). 

277. Talmi, Phys. Rev. 82, 101 (1951). 
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interaction that the [ (d;)*];, configuration in this region 
is of lower energy than the [(ds)*];; configuration. If 
the first excited state of Mo” were actually due to a 
coupling of three d; neutrons to yield a resultant spin 
of $+, one might expect retardation of both the electron 
capture to this state and the 204.2-kev gamma transi- 
tion depopulating this state, since both these transitions 
would no longer be simple single-particle transitions. 
DECAY OF THE ISOMER OF Tc” 

Cork, Brice, Schmid, and Helmer* have reported the 
decay scheme for the 90-day isomer of Tc’ shown in 
Fig. 3 from a study of the electron capture of Ru”. 
These assignments were based on the observation of two 
highly converted transitions in a conversion-electron 
spectrum. The K-conversion electron intensities of these 
two transitions were reported to be about equal. No 
half-life determination was made by Cork et al. on these 
conversion lines. 


TaBLE V. Comparison of gamma-decay transition probabilities. 





Experimental 
Moszkowski* 
Goldhaber and Sunyar? 


2.9 10-% 
5.1X10- 
34X10" 


* See reference 25. 

» See reference 28. 

In this study the A, L, and M conversion-electron 
lines of a very highly converted transition in technetium 
of 96.5+0.1 kev were observed. No other conversion- 
electron lines were observed in this energy region. To 
assign this transition to Tc*’™, the half-life of the con- 
version-electron lines was determined over a period 
of one year. The half-life observed was 87+3 days, in 
good agreement with the reported half-life of Tc*’™ of 
about 90 days.’ 

The experimental K:L:M relative conversion-elec- 
tron intensities are 1:(0.48+0.05): (0.13+0.02). The 
K/L ratio of 2.1+0.2 agrees very well with the theo- 
retical ratio of 2.0 for an M4 transition. 

The gamma-decay transition probability was also 
calculated by using the theoretical K-conversion coeffi- 
cient, the theoretical K/L ratio, the experimental 
K/M ratio, and the experimental half-life of 873 days. 
This decay constant is compared to the value calculated 
from the estimates of Moszkowski® using the appro- 
priate statistical factor, and also to the empirical equa- 
tion of Goldhaber and Sunyar** in Table V. 
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Total Neutron Cross Section of Xe’’® as a Function of Energy* 
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The total neutron cross section of Xe'*6 as a function of energy has been remeasured at Oak Ridge National 
Laboratory under more favorable conditions than obtained in earlier measurements. A sample thickness of 
2.5X 10'8 atoms of Xe! gas per cm? was procured from the gases generated in a homogeneous reactor. A me- 
chanical time-of-flight chopper was used to select neutrons in the energy range from 0.01 ev to several 
thousand ev. The number of Xe'> atoms in the sample was determined by means of mass spectrometer 
measurements on the long-lived daughter, Cs'*. The data of the low-energy resonance were fitted to the 
single-level Breit-Wigner formula, taking into account Doppler corrections, equally well with the following 
two sets of parameters: statistical weight factor g= 3; resonance energy e9-=0.08472+0.00027 ev; neutron 
width at energy ¢o, I',°=0.03477+0.00021 ev; capture width, ',=0.083303+0.00062 ev; for g=3, 
€o9=0.08415+0.00028 ev; I’,°=0.02057+-0.00012 ev; I, =0.09493+0.00071 ev. The errors quoted are the 
standard deviations derived from the statistics of the measurements. Systematic errors are discussed in the 
body of the paper. No evidence for resonances at energies greater than 0.085 ev was observed. The results 
described are interpreted in terms of recent considerations on the statistics of the properties of nuclear energy 


levels. 


INTRODUCTION 


HE fission product xenon-135 has the largest 

known neutron cross section. When the spin 
weight factor, g, is considered, the capture cross section 
is close to the theoretical maximum, grA?=3.8X 10° 
barns for a statistical weight factor of }. This is due to a 
resonance level at 0.085 ev with an abnormally large 
reduced neutron width which is approximately 95 X 10% 
(ev)! at 1 ev. The neutron cross section of Xe!* is of 
special interest because the compound nucleus Xe'*® 
contains a magic number of neutrons (82), and because 
of its pertinence to reactor technology. 

The first measurements of the energy dependence of 
the cross section were completed in 1948 by Bernstein 
et al.! A focusing-type single-crystal spectrometer was 
used as a neutron monochromator. Transmission meas- 
urements could be made on samples of 1 mm? in area. 
The xenon was obtained from iodine separated from 
irradiated uranium, and the samples were assayed by 
the technique of activity measurements. The maximum 
sample strength used in these measurements was 
5.4X10"7 atoms/cm? of Xe'*®, This amount of Xe!* 
corresponds to an activity for the whole sample of ap- 
proximately 3 curies of Xe activity. 

About 1954, circumstances at Oak Ridge National 
Laboratory seemed to offer the opportunity of repeating 


* A preliminary report of the work described here has been given 
by S. Bernstein and E. C. Smith, Proceedings of the International 
Conference on the Peaceful Uses of Atomic Energy, Geneva, August, 
1955 (United Nations, New York, 1956). Paper 591, Vol. 4, p. 153. 

t Now at Lockheed Aircraft Corporation, Marietta, Georgia. 

§ Now at Argonne National Laboratory, Lemont, Illinois. 

|| Now at American Machine and Foundry Company, Green- 
wich, Connecticut. 

t Dr. S. Bernstein of Oak Ridge National Laboratory acted as 
“editor” and contributor in the preparation of the manuscript 
of this paper for publication. He did not participate in the experi- 
mental work. This procedure seemed desirable because those who 
took the neutron measurements were no longer at Oak Ridge 
National Laboratory at the time of writing. 

1 Bernstein, Shapiro, Stanford, Stephenson, Dial, Freed, Parker, 
Brosi, Hebert, and DeWitt, Phys. Rev. 102, 823 (1956). 


these measurements under more favorable conditions. A 
mechanical time-of-flight neutron spectrometer had 
been built.? A copious supply of fission product gases 
was available from an existing homogeneous reactor. 
The cross section of Xe'*® was remeasured using com- 
pletely different techniques of sample preparation and 
assay. The spectrometer provided a wider range of 
energies, with a practical lower limit of 0.01 ev and an 
upper limit determined principally by the sample 
strength. In this instance a sample strength of 2.5 10!8 
atoms/cm? was achieved with a sample of approxi- 
mately 500 curies. 

The first level was fitted with the Breit-Wigner 
parameters given below. A search for resonances in the 
higher energy region with a resolution of 0.5 microsec/ 
meter yielded negative results for the Xe'*° isotope. 


THE TIME-OF-FLIGHT SPECTROMETER 


The spectrometer used for this experiment was similar 
in principle to other fast chopper time-of-flight spec- 
trometers*~* and has been described elsewhere.? Bursts 
of neutrons were produced in a collimated beam from a 
nuclear reactor by a slotted rotor which spun about a 
horizontal axis parallel to the beam. Neutrons of 
different energies were sorted by the differences in their 
time of arrival at a detector 11.39 meters away. The 
energy of the neutrons is related to the time of flight by 
the relation 


E= (72.3/r)’, (1) 


where 7 is the time of flight in microseconds per meter, 
and £ is the energy in electron volts. 

An 84-channel time-sorter was used, the timing being 

2G. S. Pawlicki, Oak Ridge National Laboratory Report 
ORNL-1526 (unpublished). 

3 W. Selove, Rev. Sci. Instr. 23, 350 (1952). 

4 Seid], Hughes, Palevsky, Levin, Kato, and Sjéstrand, Phys. 
Rev. 95, 476 (1954). 

5 F. G. P. Seidl, Brookhaven National Laboratory Report BNL 
278, July 1954 (unpublished). 
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established by a calibrated oscillator triggered by 
photomultiplier pulses derived from an optical system 
attached to the rotor assembly. The long-term drift of 
the oscillator was of the order of one percent, but 
variations in frequency over an interval of several hours 
were considerably smaller. After pulses derived from the 
neutron detector were sorted into a channel representing 
the appropriate time interval, they were recorded by a 
scale of 8 and mechanical register. Channel widths could 
be varied from 3.5 usec to 160 ysec. The’ full width at 
half maximum of the neutron burst could be varied from 
3.8 to 70 usec by changing the rotor speed. 

The resolution width of the spectrometer depends® on 
the flight path, burst width, channel width, and un- 
certainty in flight path due to finite detector length, and 
detector “jitter” (variations in detection time within the 
counter). In this instance the detector was a parallel 
plate ionization chamber 30 cm long, and detector 
“jitter” is a second-order effect. The resolution in energy 
varied from 4.8% at 0.01 ev to 7.0% at 30 ev. 

The distribution of intensities in the spectrum meas- 
ured with this spectrometer depends not only on the 
reactor spectrum and detector efficiency, but also on a 
chopper cutoff effect. The geometry of the rotor slit- 
detector system was arranged to prevent an overlap of 
slow neutrons of one burst with the fast neutrons of the 
succeeding burst. Thus, the dynamic transmission of the 
shutter varied so that counting rates decreased mono- 
tonically with time of flight and vanished at the time of 
the next burst. Rotor speeds were selected as a compro- 
mise between the resolution desired and the effect on 
counting rate of the chopper cutoff function. 


counts /20 min 





Fic. 1. Typical counting rate curves of the time-of-flight 
spectrometer for several channel widths and rotor speeds. The data 
for Curve 1 were taken with 2.45 wsec/meter channel width and 
2.02 ysec/meter burst width, resolution width 3.3 ysec/meter 
Curves 2 and 3 were obtained with 8.42 wsec/meter channel width 
and 5.97 wsec/meter burst, resolution width 10.2 wsec/meter. 


®D. J. Hughes, Pile Neutron Research (Addison-Wesley Pub 
lishing Company, Inc., Reading, 1953), p. 167. 


SMITH 


Sia ena ING 


‘Ti Te 


The total time covered by the 84 channel widths was 
divided into 84 contiguous parts. This total time could 
be varied so that the energy region of interest could be 
covered. Channel widths were selected so that the time 
of flight to the last channel corresponded to the lowest 
energy at which measurements were desired. Typical 
count rate curves for several channel widths and rotor 
speeds are shown in Fig. 1. Other things being equal, 
the count rate per channel is proportional to the channel 
width. As a consequence, when wide channel widths 
were used to reach low energies, the counting rate in the 
peak of the thermal neutron spectrum was so high that 
an appreciable loss of counts occurred due to register 
dead time. In the instance of the data for curve 2 of 
Fig. 1, a portion of the beam was blocked with shielding 
material to reduce the counting rate. When the shield 
blocks were removed the counting rates in curve 3 were 
obtained. The shift in the position of the peak between 
curves 1 and 2 is due to chopper cutoff effects. The very 
rapid decrease of intensity with time of flight beyond the 
peak of the Maxwellian distribution is to be noted. 

The background counts were primarily caused by the 
leakage of fast neutrons through the shutter when it was 
“closed.” The ratio of the background counts with and 
without cadmium in the beam was unity. Therefore, 
background could be measured in the region below 0.4 
ev without stopping the rotor by observing the count 
rate with cadmium in the beam. Backgrounds for the 
cases represented in Fig. 1 were 800, 126, and 500 counts 
per channel per 20 minutes for curves 1, 2, and 3, 
respectively. 

An advantage of the fast chopper time-of-flight 
spectrometer is the small beam area near the slots of the 
rotor. For transmission measurements, samples could be 
accurately positioned in the collimator 4.5 inches from 
the rotor where the beam area was less than 0.25 cm’. 
Transmission measurements on gases were made with a 
sample holder having an area in the beam of 0.315 cm? 
and a volume of approximately 0.9 cm*. The sample 
holder could be positioned remotely within a shield 
enclosure external to the reactor shield. 


SAMPLE PREPARATION 


The sample of xenon was prepared’ by bleeding off 
some of the gases generated in the homogeneous reactor. 
The reactor was operated at a power of 200 kilowatts for 
24 hours to permit buildup of the Xe'*° from the parent, 
iodine. The gases generated in the reactor were then 
collected on a 5-liter silica gel trap. After approximately 
5 hours collection time, the trap was warmed, the 
krypton was eluted to a holding trap, and the xenon was 
collected in a trap in a mobile shield. The specimen was 
then transferred to a hot-cell facility for further purifica- 
tion and for loading into the spectrometer sample holder. 

7 Parker, Martin, Creek, and Lantz, Proceedings of the Inter- 


national Conference on the Peaceful Uses of Atomic Energy (United 
Nations, New York, 1956), Vol. 14, p. 96. 
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At this facility the specimen was collected in a one-liter 
flask after several elutions from successively smaller 
silica gel traps and after passage over a hot barium getter 
to remove impurity gases. The sample was then com- 
pressed by filling the flask with mercury and condensed 
into the sample holder, which was at liquid nitrogen 
temperature. The capillary tubes connecting the sample 
holder with the process apparatus were crimped, cut, 
and soldered by remote operation. The sample was then 
warmed and tested for leaks by passing air over the 
holder and onto a monitored charcoal trap. The sound- 
ness of the seals having been established, the sample 
holder was loaded into a lead shield and transported to 
the chopper. Fifteen minutes were required to transfer 
the sample to the holder and to seal it. The mid-time of 
this period was used as an origin for computing sample 
decay. 
SAMPLE ASSAY 

In order to convert measurements of transmission to 
cross section it was necessary to establish the number of 
atoms per cm? (atomic density-thickness) in the sample 
at any time. This was done by means of mass spectrome- 


TABLE I. Mass spectrometer assay. 


Spiking ratio Abundance ratio Number of 
R determinations 
0.000 3.59+0.0095 30 
0.250 4.46+0.0052 30 
0.500 5.35+0.013 29 
0.667 5.84+0.021 27 
1.000 6.92+0.016 26 
1.500 8.57+0.015 30 
2.000 10.31+0.020 20 


ter measurements performed by R. Baldock and J. R. 
Sites of the ORNL Mass Spectrometer group. The 
number of Xe'*> atoms in the sample holder at the time 
it was loaded was determined by the method of isotopic 
dilution. After the completion of the cross-section meas- 
urements, essentially all the xenon was permitted to 
decay to its daughter, cesium, which consisted princi- 
pally of the isotopes of masses 133 and 135. The capil- 
laries were cut near the body of the holder and the 
cesium was removed by rinsing with water solution 
until no further beta activity could be detected. The 
solution was then diluted to 10 milliliters. The relative 
abundance of the isotopes 133 and 135 was measured in 
samples obtained by spiking 100 microliter aliquots of 
the solution containing radiogenic cesium with aliquots 
from a standard solution of normal CsC] containing 66.5 
micrograms of Cs'* per milliliter. 

The concentration of Cs'*® was obtained from these 
measurements in the following manner: Let «= number 
of Cs'*° atoms per milliliter in the solution of radiogenic 
Cs; y=number of Cs'* atoms per milliliter in the 
solution of radiogenic Cs; and z= number of Cs! atoms 
per milliliter in the standard solution. Let R= ratio of 





R, ISOTOPIC ABUNDANCE RATIO (133/435) 
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Fic. 2. The ratio of the numbers of Xe! atoms to Xe!*> atoms 
as a function of “spiking ratio,” the parts of standard solution to 
parts of radiogenic solution. 


133 to 135 atoms in the specimen measured on a mass 
spectrometer. If we mix uw microliters of the unknown 
with v microliters of the standard solution we have 
ny=ux atoms of Cs! and no=uy+vz atoms of Cs", 
whence 


R= n2/ n= (uy+02)/ux=at+bp, (2) 


in which p=v/u, the ratio of parts of standard solution 
to parts of solution containing the radiogenic Cs. 

Thus R is a linear function of p, and the parameters of 
the line may be determined by the method of least 
squares. The spectrometer measurements are shown in 
Table I and plotted in Fig. 2. 

The measured isotopic abundance ratio, R, is given in 
the second column of Table I for each value of p, the 
ratio of parts of the solution added to the aliquot of the 
unknown solution. The errors quoted are the standard 
deviations of the means of the number of observations 
of the abundance ratio which were made on a single 
sample for each value of the spiking ratio. The slope and 
intercept of the least squares line are a=3.623+0.017 
and b= 3.327+0.026. 

There were 66.5 ug/ml in the standard, corresponding 
to 3.01310" atoms per ml. Thus the concentration of 
Cs'®° in the specimen was 9.06X10'® atoms/ml, or a 
total of 9.0610"? atoms of Cs! in the 10 milliliters 
containing all the sample. This corresponds to an ac- 
tivity of 514 curies of Xe'*® activity at the time the 
holder was loaded. 

The volume of the sample holder was calculated from 
dimensions carefully measured during fabrication. It 
was 0.900 cm*. Therefore, the atomic density of Xe!* in 
the holder was 1.007 10'* atoms/cm’. The length of 
neutron path through the sample was 2.86 cm, so the 
density-thickness of the sample when it was loaded was 
2.88 10!8 atoms/cm’. 
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Fic. 3. Typical transmission measurements of Xe!*> sample as a 
function of neutron time-of-flight. Curves I, II, and III were ob- 
tained for values of (¢—t))=0, 1300 minutes, 2400 minutes. The 
resolution for I was 3.3 wsec/meter, and for II and III, 10.2 
psec/meter. 


CROSS-SECTION MEASUREMENTS 
Transmissions were measured by comparison of the 
counting rate when the sample was positioned in the 
beam with the counting rate observed when the sample 
and its holder were removed from the collimator. A 
typical cycle for transmission measurements was as 
follows: sample in 20 minutes, open beam 20 minutes, 
sample in 20 minutes, background 20 minutes. The 
effect of the decay of the sample during the 20 minutes 
required for a count with the sample in the beam was 
negligible. Typical transmission measurements are 
shown in Fig. 3. Curves II and III were taken with a 
resolution of 10.2 usec per meter. The energy region 
above 0.12 ev was measured, also, with a resolution of 
3.3 ywsec per meter. The data using this resolution are 
shown in curve I of Fig. 3. The lowest energy at which 
measurements were taken was 0.0105 ev. 
The transmission of the sample is given by 


T= Te ne (3) 
where 
n= noe! to (4) 


and 7»)= transmission of sample holder; ‘= mid-time of 
sample-in count; /o=origin of time measurements, 
chosen here to be the mid-time of the first sample-in 
count; #=number of Xe atoms per cm? at time /; 
ny= number of Xe" atoms per cm’ at time /o; A=decay 
constant of Xe!*. 

A weighted mean of 7 independent determinations of 
the half-life of Xe! is 9.177+0.023 hours.*® Therefore, 
A= (1.2588=+-0.0032) X 10- min“. 

’F. Brown and L. Yaffee, Can. J. Chem. 31, 242 (1953); 
Newton, Sullivan, Johnson, and Nattorf, Radiochemical Studies: 
The Fission Products (McGraw-Hill Publishing Company, Inc., 
New York, 1951), National Nuclear Energy Series, Plutonium 
Project Record, Vol. 9, Book 2, Part V, p. 1028; E. J. Hoagland 
and N. Sugarman, ibid., p. 1031; Turkevitch, Adams, and 
Freedman, ibid., p. 1034; Thulen, Bergstrom, and Hedgron, Phys. 
Rev. 76, 871 (1949); J. E. Faulkner, Hanford Works (unpub- 
lished); P. E. F. Thurlow, Oak Ridge National Laboratory (un- 
published). 
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If the natural logarithm of Eq. (3) is taken, then 
—InT=—InTot+nooe (5) 


Thus (—In7) is a linear function of the variable 
exp[ —A(¢—¢o) ] with slope moo and intercept (—In7»). 
Typical data are plotted in this manner in Fig. 4 for 
several neutron energies. The data at each energy were 
fitted to Eq. (5) by the method of least squares. The 
slope, intercept, and their standard deviations were 
obtained on the Oracle, the ORNL fast digital com- 
puter. 

Cross sections were calculated by dividing the slopes 
by 2 which is given by 


ny= Ne~*/- = 2,50X 10'8 atoms/cm’. (6) 


In Eq. (6), (¢;—to)=time interval between the mid- 
time of sample loading and the first sample-in count 
(113 minutes). V = 2.88X 10'* atoms per cm’ is equal to 
the number of Cs'*° atoms per cm? of the neutron beam, 
as determined from the assay. The number of Xe" 
atoms at mid-time of loading is taken to be equal to V. 
The measured cross sections are shown in Fig. 5 as a 
function of neutron energy. The resonance character of 
the cross section is more apparent in the plot of o/E 
against E which appears in Fig. 6. 


ANALYSIS OF THE RESONANCE 


The measured cross sections were fitted to a single- 
level Breit-Wigner formula: 


4X0’ 

Cet ee 

4(e—e9)?+T? 

where o;= total cross section; A=neutron wavelength/ 
2r relative to the target nucleus; e=energy of the neu- 
tron relative to the nucleus; e9=relative energy of the 
level ;T,=T’,.°(€/eo)*=neutron width ;T=I,+T'.= total 
level width, full width at half-maximum; ',=capture 
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Fic. 4. Logarithm of the sample transmission as a function of the 
relative number of Xe!’ atoms for several values of neutron 
energy. The total cross section of Xe" is proportional to the 
slopes of the fitted straight lines. 
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width; g= (2/+1)/2(2/+1); 7, J=spins of target and 
compound nucleus, respectively. 

In (7), terms involving potential scattering have been 
neglected, since they affect the cross section by at most 
0.3%. Also an error of 0.75% is introduced into (7) by 
our use of relative energies rather than center of mass 
energies. Since the spin of Xe'*® is probably 3, there are 
two possible values for the statistical weight factor g, 3 
or 3. Doppler effects due to the thermal motion of the 
Xe nuclei were taken into account in the follewing 
manner. Let m=neutron mass, M=mass of target 
atom, 7= target temperature, E= neutron bombarding 
energy in the laboratory system. Then the distribution 
function for ¢ may be approximated by® 


M \3 
w(e)de=I( ) 
amkT 


Ve M 
x— exp| — (—) (By le (8) 
E mkT 


4 





The Doppler-corrected Breit-Wigner formula becomes 
eo(E)= f w(e,E)o.(e)de, (9) 


with w given by Eq. (8) and o; by Eq. (7). Those values 
for the parameters I’,.”, ',, and €9 were determined which 
minimized the weighted sum of squares of deviations of 
the measured cross section from those computed by 
Eq. (9). Weights for the data were based on standard 
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Fic. 5. The total cross section of Xe!5 as a function of energy. 
The circles represent the data. The curves are the Doppler- 
corrected Breit-Wigner formula fitted to the data by least squares 
for two possible values of the statistical factor, g. 


® We are indebted to Dr. R. K. Osborn of Oak Ridge National 
Laboratory for the derivation of Eq. (8). It differs from some 
others in that all three components of the thermal motion were 
taken into consideration, 
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Fic. 6. o\/E as a function of neutron energy, E. The data are 
shown as circles. The curves are derived from the Doppler-cor- 
rected Breit-Wigner formula fitted to the data for two possible 
values of the statistical factor, g. 


deviation of the slopes of the lines used to determine 
relative cross sections in Eq. (5). A standard procedure 
was used.” 

The data were equally well fitted by the following two 
sets of parameters: 


g=%; 
éo= 0.08472+0.00027 eV, 
I’ ,.°=0.03477+0.00021 ev, (10) 


I’, =0.08303+0.00062 ev, 
I”=0.11780+0.00065 ev; 


or 
g=s, 
€y= 0.08415+0.00028 ev, 
l’,°=0.02057+0.00012 ev, (11) 


Pr ,=0.09493+0.00071 ev, 
1™”=0.11550+0.00072 ev. 


The standard deviations given were calculated from the 
deviations of the data points from the calculated values. 
IM=T at e=eo. If center-of-mass coordinates instead of 
relative coordinates had been used in (7), then eo, To, 
and I’,.° should be reduced by approximately 1%. How- 
ever, values of cross sections computed at corresponding 
relative and center-of-mass energies would remain 
unchanged. 

Curves calculated from these values are shown in 
Fig. 5 and Fig. 6, together with the experimental data. 
It is to be noted that the curves differ in shape only at 


1 F, Garwood, Biometrika 33, 46-48 (1941, 1942), 
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the peak is 


extreme energies. The cross section at 
computed from Eq. (7) to be, for g=3, 


a= 3.40X 1078 cm’, 

o4=2.40X10~ cm’; 
and for g=3, 

o,=3.42X 10-8 cm’, 

oa= 2.81108 cm’. 
The 2200-m/sec values are, for g= 3, 

o,=3.07X 10-8 cm?, 

oe= 2.50X 10~'* cm’; 
and for g=3, 

o1=3.08X 1078 cm?, 


Oa=2.16X 1078 cm”. 


ANALYSIS OF ERRORS 


The errors quoted for the Breit-Wigner parameters 
are the standard deviations obtained from the least- 
squares fit alone. There are in addition a number of 
other sources of error which were not included in the 
statistical analysis. Uncertainties in the energy scale 
would be reflected in the value of € and the total width 
I’. Although the oscillator controlling the timing was 
calibrated a short time before the measurements were 
made, it was subject to drifts which make the energy 
scale, or the values of ¢9 and I’, uncertain to perhaps 
1.5%. Uncertainty in the absolute magnitude of the 
cross section is reflected most sensitively in the value of 
I’,.°. The principal random sources of error of these are 
the following: mass spectrometer assay, 0.8%; volume 
to which sample was diluted, 0.5%; volume of sample 
holder, 1%; neutron path length in sample, 0.1%; Xe! 
half-life, 0.25%; effective time of sample loading, 0.5%. 
Considering all the random sources of error, the esti- 
mate of the errors in the Breit-Wigner parameters are 
about 1.5% for e) and I’, and about 1.5% for T’,.°. Cross 
sections computed from Eq. (10) and Eq. (11) for values 
of « very near or very far from ¢€ will then be about 
+2.0% in error. 

In addition to random errors, there are possible 
systematic errors. One error is due to the possibility that 
not all of the Cs'*° was removed from the sample holder. 
It is estimated that about 1% of the sample used for the 
neutron measurements may not have appeared in the 
assay samples. To take this possibility into account, the 
cross sections given by the use of the constants (10) and 
(11) should be reduced about 1%. A second error is the 
possibility that CsCl used in the standard solution 
retained some water of hydration in spite of the fact 
that it was repeatedly baked and weighed. It may have 
contained small amounts of impurity solids, also. If 
water or other impurities were present, then the values 
of the cross section given by the use of the constants 
(10) and (11) should be increased. 
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MEASUREMENTS AT HIGHER ENERGIES 


Measurements were made at energies greater than 
1.5 ev with a resolution of 0.5 microsecond/meter. No 
additional resonances in Xe!® were detected. The sig- 
nificance of these measurements can be evaluated by the 
following considerations. The area under a resonance 
for a thin sample is 


A gE= (Qe? 72.3)A = Qn’ gkoT Nn, (16) 


in which A g is the area of the transmission dip in ev, A ; 
is the area in microseconds per meter, and the remaining 
symbols are defined above. Using the appropriate nu- 
merical factors, Eq. (16) may be rearranged to give an 
expression for the reduced width, y,°, in terms of the 
area of the transmission dip: 


7 2=T,°/€o!=8X 105A , 602, (17) 


in which an average value of } has been used for g. 
Under the conditions of the measurements, an A, of 
about 0.025 microsecond per meter could certainly be 
detected. Thus, all resonances having a reduced width 


¥n > 2XK10-*e0? (18) 


would have been detected. For example, at 10 ev, 30 ev, 
and 100 ev, all resonances with reduced widths greater 
than 0.02, 0.2, and 2.0 (ev)!, respectively, would have 
been detected. In the energy interval zero to 1 ev, all 
levels of reduced neutron width greater than 2X 10-4 
(ev)! should certainly have been observed, except per- 
haps those which might occur within about one resolu- 
tion width or 31 of the 0.08-ev resonance. 


DISCUSSION 


On the basis of current statistical theories of the 
properties of nuclear energy levels, some qualitative 
statements can be made about the experimental obser- 
vations over the range of the measurements, zero to 
about 6000 ev. For the purpose of these qualitative 
considerations, the assumption of a random distribution 
of energy levels should be adequate. 

The neutron cross section of Xe'* is high at low 
energies because of the concurrence of two factors: the 
existence of a resonance at low energy; a high value for 
the ratio (T,°/T). There are a number of resonances at 
low energies in this mass range, but both the capture and 
reduced neutron widths of these are not unusually large. 
The average number of energy levels in the energy 
interval dE is (dE/D), in which D is the average dis- 
tance between levels. As shown below, for the compound 
nucleus Xe!**, D is probably of the order of magnitude 
of 600 ev. The probability of finding a level between 0 
and 1 ev (where the neutron wavelength has its greatest 
values) is roughly (1/600). The occurrence of the level 
in this interval at 0.08 ev is, therefore, quite fortuitous. 

Porter and Thomas!" give a distribution function for 
reduced neutron widths in terms of (y,°/7,°), the ratio 


1 C. E, Porter and R. G. Thomas, Phys. Rev. 104, 483 (1956). 
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of the reduced neutron width at resonance energy to the 
average reduced neutron width for all the levels of a 
given nuclide. According to this distribution, 10% of the 
yn’s are greater than 2.717,°, and 10% less than 
(1/64)7,°. The cloudy-crystal-ball model of the nucleus 
predicts” a value of (7»°/D) of about 2X10~ (ev)! for 
nuclei of mass number about 135. Using this relation- 
ship, the probabilities from the Porter-Thomas distribu- 
tion given above and the measured value 0.1 ev for y,°, 
the qualitative statement can be made that D has a 
value between 180 ev and 10° ev with a probability of 
80%. The semiempirical theory of Newton" gives an 
expression for D in terms of the mass number and 
excitation energy of the compound nucleus after neutron 
capture. Using the measured values’ for the masses of 
Xe!* and Xe!*®, and the mass of the neutron, the binding 
energy of a neutron in Xe'® is 7.89 Mev. Using this 
value in Newton’s expression along with his corrections 
and tables, a value of about 600 ev is predicted for the 
observed level spacing. This value is perhaps good to 
within a factor of 3. 

The significant feature of the measurements at higher 
energies is that no resonances other than the large one at 
0.08 ev were observed between zero and 6000 ev. Since 
Dis roughly 600 ev, and 6000/600= 10, at least “‘several’’ 
levels should exist in this interval. Only one was found. 
Any others were probably not observed because the 
thickness of sample used (corresponding to 514 curies), 
and the resolution of the apparatus, were not adequate. 

A rough estimate of the average number of levels, m, 
in the interval 0 to E, is given by 


E - dE 
m= f If Piadar | 
ID 


2V. F. Weisskopf, Revs. Modern Phys. 29, 174 (1957). 
18 T, D. Newton, Can. J. Phys. 34, 804 (1956). 
144W. H. Johnson and A. O. Nier, Phys. Rev. 105, 1014 (1957). 


(19) 
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The /,” P(x)dx gives the fraction of reduced neutron 
widths greater than the lower limit of observable widths 
determined by conditions of our experiment. The symbol 
b=2X10~-*(E*/T,,°) is the lower limit as determined by 
Eq. (18). P(x) is the Porter-Thomas distribution func- 
tion. We have calculated m for three assumed values, 
r,.°=0.03, 0.1, and 0.3 (ev)!. The value 0.1 is approxi- 
mately the average of the two possible values measured 
for the low-energy resonance. For these values of I’, we 
find m=0.02, 0.03, and 0.06, respectively. Thus for 
r,.°=0.1, the probability, according to the Poisson 
distribution law, of observing only 1 level is 0.029. The 
probability of observing 2 or more levels is 5X10~. 
Thus, on the average, if the cross section of 30 nuclides 
with a D of about 600 ev, were measured under our 
experimental conditions, a single resonance should be 
found between 0 and 6000 ev in only one of these 
nuclides. Xe'*> happens to be the one. The chances of 
observing 2 or more levels under our conditions are so 
small that it is not at all surprising that they were 
actually not observed. In order to observe additional 
levels at higher energies, a much thicker sample and an 
instrument of much higher resolution are needed. Of 
course, all of the probabilities mentioned here are sub- 


ject to variation by a factor of about 3 because of the 


uncertainty in D. 
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The absolute differential cross section for the elastic scattering of (20.3540.25)-Mev protons has been 
measured for enriched Zn™, Zn®, and Zn® foils. In the angular range of 30°-160° about 50 measurements 
were made for each foil (spaced from 1° to 5°) to an estimated accuracy of about 5% standard deviation. 
Scattered protons were detected by nuclear emulsions wrapped around a 4-in. diameter scattering chamber, 
all angles being exposed simultaneously. Detector energy resolution is 2.5%, angular resolution is 1° standard 
deviation, and relative angular shifts are determined to 0.1°. Correction has been made for the finite sizes 
of beam and detector and for multiple scattering in the target and in the detector stopper. The +0.25-Mev 
energy spread includes maximum and minimum energies due to beam drift, beam spread, and target foil 
thickness. Three minima are found for each isotope: Zn™ at 63°, 104°, 142°; Zn® at 62°, 102.5°, 142°; and 
Zn® at 61°, 101°, 142°. The absolute cross sections are approximately the same except at the third minima, 
where for Zn™, Zn®, and Zn® they are respectively, 1.46, 1.07, and 0.61 mb/sterad. 


INTRODUCTION 


IFFERENTIAL elastic scattering of protons has 

been measured by investigators in the 10-40 Mev 
energy region! and analyzed using the optical model 
potential.? Most experiments have been done with 
isotope mixtures according to natural abundances so 
that nuclei of various radii are present in the target. 
It is thus of interest to see if scattering from different 
isotopes will yield additional information concerning 
nuclear parameters. Zinc is an appropriate choice be- 
cause the spread in mass number is large for such a 
light element. In addition, theoretical fits have been 
quite successful for zinc. Preliminary reports on this 
work have been given previously.’ 


EXPERIMENT 


The external beam from the UCLA synchrocyclotron 
was directed into the 4-in. diameter scattering chamber, 


t Assisted jointly by the Office of Naval Research and the 
U. S. Atomic Energy Commission. Part of a dissertation sub- 
mitted by R. W. Boom for the Ph.D. degree at the University of 
California at Los Angeles. 

* Now guest physicist at the Institute for Nuclear Physics of 
the Friedrich-Wilhelms University, Bonn, Germany. 

' Several in the 10-40 Mev range: J. W. Burkig and B. T. 
Wright, Phys. Rev. 82, 451 (1951); B. T. Wright, University of 
California Radiation Laboratory Report UCRL-2422, November, 
1953 (unpublished); B. L. Cohen and R. V. Neidigh, Phys. Rev. 
93, 282 (1954); B. B. Kinsey and T. Stone, Phys. Rev. 103, 975 
(1956); J. Leahy, University of California Radiation Laboratory 
Report UCRL-3273, February, 1956 (unpublished) ; I. E. Dayton 
and G. Schrank, Phys. Rev. 101, 1358 (1956); M. M. Hintz, 
Phys. Rev. 106, 1201 (1957); N. S. Wall and W. F. Waldorf, 
Phys. Rev. 107, 1602 (1957); Gibson, Prowse, and Rotblat, Proc. 
Roy. Soc. (London) 243, 237 (1957); R. H. Chow and B. T. 
Wright, Can. J. Phys. 35, 184 (1957); M. M. Hintz, Bull. Am. 
Phys. Soc. Ser. IT, 2, 14 (1957); M. K. Brussel and J. H. Williams, 
Proceedings of the University of Pittsburgh Conference on Nuclear 
Structure, 1957, edited by S. Meshkov (University of Pittsburgh 
and Office of Ordnance Research, U. S. Army, 1957), p. 34; John 
H. Williams and M. K. Brussel, Phys. Rev. 110, 136 (1958). 

* Saxon, Melkanoff, Nodvik, and Woods, Phys. Rev. 95, 577 
(1954); 100, 1805 (1955); 106, 793 (1957). Glassgold, Cheston, 
Stein, Schuldt, and Erickson, Phys. Rev. 106, 1207 (1957); 107, 
1372 (1957); 109, 1201 (1958). 

*R. W. Boom and H. A. Howe, Bull. Am. Phys. Soc. Ser. II, 
2, 305 (1957); R. W. Boom and J. R. Richardson, Bull. Am. Phys. 
Soc. 3, 419 (1958). 


at the center of which was positioned a foil holder at 
45° to the incoming beam for simultaneous recording 
at all angles. The scattered protons left the chamber 
through thin windows at the median plane, passed 
through thin aluminum absorbers staggered around the 
circumference, were further slowed down by the thick 
front surface of the film holders, and stopped in the 
nuclear emulsion. Since the nuclear emulsion was laid 
along a circumference, protons were perpendicularly in- 
cident ; thus elastic scattering cross sections are propor- 
tional to film-surface elastic-scattering track densities. 
Scattering angles are proportional to linear distances 
along the film and are determined from several 1° wide 
blank spaces on each exposed trace; these “no proton” 
blank spaces are behind thick vertical brass strips 
soldered on the external absorber holder. 

The divergence of the beam (as defined by the 3-in. 
Xj/s-in. slit) in angle and energy is measured behind a 
vacuum extension pipe with a Faraday cup and movable 
slit. The angular divergence of the beam (as defined 
by a 3-in.Xjg-in. slit) for an approximate Gaussian 
current distribution was 0.2° rms deviation and the 
variation of the mean energy in the horizontal direction 
was +100 kev high in the center and dropping on both 
sides. No vertical energy variation was found. 

Careful alignment and centering of the scattering 


eecce - ( rreBirene] 


ee 
ra a 


3 


0.087 CM 
1* 2 0.090 CM 





8 


TRACKS/FIELD 
@ 
° 


/ 


a 
ST) ed 4 
‘ 656 656 654 652 6; 


X AXIS, CM KO) 








a | 1 
646 644 642 


. 1. Counting rate behind an index wire. The expected 
Gaussian distribution is shown by the circles. 
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ELASTIC SCATTERING 


OF FROTONS BY Zn 


TABLE I. The isotopic composition of the foils. 


Atomic % 


Zn Zn*? Zn 


1 93.12+0.06 
2 10.0 +0.7 
3 2.7 +0.2 


Zn*6 


6.2940.05 0.159-40.003 
784430 12 +02 
28 40.2 04 +01 





0.432-4.0.009 
88 +10 
93.9 +0.5 


chamber with respect to the beam and accurate meas- 
urement of the angles of the various index strips allowed 
absolute angles to be measured with an accuracy of 
+0.2°; angles relative to the index strip shadows are 
better than +0.1°. 

The nuclear emulsion, Ilford E1, is 300y thick 
mounted on 0.008-in. celluloid backing. The film holders 
are rigid cylindrical sections, spaced to clear the side 
slits and locked in place on a vertical bar. The front 
surface is 0.050-inch Al, most of the absorber required 
to stop the scattered protons in the film. Behind this 
front surface is a 0.004-inch sheet of black polyvinyl- 
chloride which serves as a light seal and is inert to the 
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Fic. 2. Representative 
number-range histograms 
showing the separation of 
elastic and inelastic scatter- 
ing peaks. 
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‘emulsion surface. Two films were used (for front and 
back quadrants) with considerable overlap near 90°. 

The three zinc foils, supplied by the Isotope Division, 
Oak Ridge National Laboratory, are approximately 10 
mg/cm? thick, and as measured by scanning with soft 
Cu x-rays are known to +1.6%. As foils, these samples 
are of poor quality; the above error is a conservative 
estimate based on pinhole density and size in relation 
to beam size. The composition of the foils is shown 
in Table I. 

PROCEDURE 


Three exposures (one for each foil) were made on 
each film. Before and after the exposures beam energy, 
alignment, and current integration are checked. The 
Faraday cup feed-back electrometer circuit was that 
described by Caldwell and Royden,‘ and was calibrated 
by their current-time method. The effective proton 
energy for the elastic scattering, including beam drift, 
spread, and zinc foil thickness was 20.35+0.25 Mev, as 
found from the ranges in aluminum reported by Bichsel, 


~ 4D. O. Caldwell and H. N. Royden, Rev. Sci. Instr. 27, 91 
(1956). 


<0.10 


0.3 +0.1 


Na 
<0.04 


Zn? Mg Cu Cd 


Ag 


<0.02 <0.15 


<0.04 


1.6 +1.0 <0.04 


<0.04 <0.02 


Mozley, and Aron. The developed emulsions were 
mounted on a special stage with 1111 cm* movement 
and tracks were counted and analyzed under 900X 
magnification. First the index strips are located on 
each trace by counting behind the strips; Fig. 1 is a 
typical example. Next three depth-number histograms, 
evenly spaced, are taken between each set of index 
strips. Finally, top surface track density is counted 
between the histogram points. To be counted, a track 
must enter the film at a sufficiently small angle and 
must end properly. Two of the histograms are shown 
in Fig. 2, each with a different inelastic scattering frac- 
tion. Interpolated inelastic scattering corrections can be 
made from these graphs for the intervening top surface 
points. The inelastic scattering fraction can, in most 
cases, be determined to better than 1% due to the 
23% energy resolution of the detector and the fact 
that the three enriched foils are for zinc isotopes whose 
first excited levels are at about 1 Mev. Figure 3 is an 
exposure made to check the range-energy calibration of 
the emulsion and clearly shows the ease in separating 
elastic from inelastic scattering. 

The detector area is defined by a sequence of adjacent 
squares, each being determined by a 10X10 whipple 
disk mounted in one of the microscope oculars. These 
areas are measured to +1.2%. 

In addition to the inelastic correction or interpolated 
inelastic correction described above, three other correc- 
tions are applied to the data. These account for multiple 
scattering in the aluminum abosrber in front of the film, 
multiple scattering in the zinc target foils, and for finite 
geometry. The final relation between the number of 
elastic protons counted Yo and the scattering cross 
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Fic. 3. Energy calibration of the film. Proton peaks for elastic 
scattering and for inelastic scattering from the 0.99-Mev level 
of Zn. 


6 Bichsel, Mozley, and Aron, Phys. Rev. 105, 1788 (1957). 
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Fic. 4. Corrections at the first minimum for Zn®. The top 
dotted curve is from the surface track counting. The next dashed 
curve shows the results of the subtraction of the inelastic scattering 
contribution. The solid curve shows the final results after the 
other corrections are made. The bottom dashed curve shows the 
inelastic scattering contribution. 


section is 

LN 2wo2h 1+ A, 
V9» =—o(0)O ( — “partactay) 
A R? 1 

Here o(@) is the differential elastic scattering cross 
section in cm?/sterad, 00 is the scattering angle defined 
from the center of the foil to the center of the detector, 
t the zinc thickness in g/cm?, Vo Avogadro’s number, A 
the average atomic weight for the target, Q the total 
number of protons, 2wo detector width, 2/4 detector 
height, R film radius, and the A’s are the various correc- 
tions. The solid angle term 2w»2//R? is not used directly, 
but instead is found from the angular calibration of the 


TABLE II, Summary of experimental errors. 


Standard 
deviation* 


Source Term 


Number of tracks counted (includes 
inelastic subtraction) 

Correction procedure 

Uncertainty in average atomic 
number 

Foil thickness 

Ratio detector height/width 

Detector width in radians 

Total beam current 


~0.04 
<0.01 


dYo/ Vo 
dA/A 


~0.002 
0.016 
< 0.008 
0.016 
0.005 


dA/A 
dt/t 
df/f 
2d6,./0, 
dQ/Q 


Combined standard deviation ~0.05 


* Most of the points have a statistical deviation of 5%, a few as low as 
3%, and a few around 8% at the second and third minima where the 
inelastic scattering correction was high. Separate standard deviations are 
given for each point in Table III. 
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film in the various regions between index wires. This 
method eliminates the necessity to measure R and 
automatically accounts for noncircular deviations in the 
chamber and film holder. 

Ai, the correction for multiple scattering in the alumi- 
num absorber in front of the film, is —}(X?)n~'d*n/dX?, 
where m is the track density, XY is the linear distance 
along film, and (X*) is the rms deviation at the film 
surface. By using the Rossi-Greisen*® formula for cor- 
relating multiple-scattering angle with lateral displace- 
ment in conjunction with the film energy calibration 
of Fig. 3, the lateral displacement of protons on the 
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Fic. 5. The final elastic scattering cross sections 
for protons of 20.35-Mev energy. 


film surface is calculated to be approximately Gaussian 
with ((X*))*=0.014 cm. This value is roughly confirmed 
by counting behind the edge of an index strip stopper 
as shown in Fig. 1. The slit height in front of the films 
is § in.; because of the small value of (X) these slit 
edges in no way affect the counting rates at the center 
of the exposed trace. By putting most of the aluminum 
absorber in contact with the film, A; is decreased and, 
in this case, never exceeded +0.0015. 

Multiple scattering in the zinc target foils is corrected 
by an expression due to Chase and Cox’ and here used 


° B. Rossi and K. Greisen, Revs. Modern Phys. 13, 249 (1941). 
7C. T. Chase and R. T. Cox, Phys. Rev. 58, 246 (1940). 





ELASTIC SCATTERING OF PROTONS: BY: Zn 


TABLE III. Elastic cross sections for 20.35-Mev ‘portens. 


da/dQ Standard 
mb/sterad deviation 


da/dQ 
mb/sterad 


Standard 6 


6 
Lab deviation Lab 





Foil enriched in Zn®4 
222 13 101.3 
183 10 102.6 
171 9 103.9 
134 6 104.8 
120 105.7 
57.0 107.1 
20.6 108.5 
19.2 112.5 
13.0 114.8 
8.06 116.3 
5.95 119.4 
4.26 120.4 
3.60 122.1 
4.12 123.9 
5.23 125.4 
6.78 128.4 
11.8 129.9 
18.0 134.1 
18.7 137.3 
20.7 138.9 
19.7 140.8 
18.9 142.0 
16.8 144.0 
15.7 146.4 
12.1 148.0 
6.91 150.9 
4.65 155.5 
3.60 0.21 158.1 
2.40 0.16 160.0 

Foil enriched in Zn® 
255 ‘ 103.6 
104.7 
172 105.8 
139 108.4 
112.5 
114.8 
118.7 
120.0 
122.5 
123.8 
125.3 
128.2 
129.8 
133.7 


0.12 
0.09 
0.08 
0.09 
0.09 
0.11 
0.11 
0.16 
0.19 
0.21 
0.23 
0.22 
0.21 
0.23 
0.21 
0.21 
0.20 
0.14 
0.15 
0.13 
0.13 
0.12 
0.14 
0.15 
0.11 
0.16 
0.18 
0.12 
0.21 
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0.21 


Re weAoAoGSs 


0.09 
0.09 
0.10 
0.11 


.24 
43 
70 
.98 
.68 
.68 


0.22 
0.24 
0.23 


13.7 
11.8 
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4.02 
0.21 
0.21 
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in the form 


a'(8o) a’ (O0) 1 1 
Fn a) a 
a(0) (0) JLsing sin(@o+ ¢) 


where the + in the denominator is used for reflection 
angles (85°-160° film), and the — for transmission 
angles (30°-95° film) ; 0’ is do(@o)/d0, o”’ is d?a(6o)/dé, 
¢ is the angle between the plane of the zinc foil and the 
beam, and (6?) is the rms deviation for multiple scatter- 
ing for half of the foil thickness. Ag is less than +0.004, 
except near the first, second, and third minima where 
it becomes +0.04, +0.01, and +-0.01, respectively. 
The final and largest corrections account for the 
finite size of beam, the Gaussian nature of the beam 
current, the angular divergence of the beam, and the 
finite detector size. As adapted from Dayton and 





da/dQ Standard 
mb/ ‘sterad dev iation 


4 a /dQ Standard a 
Lab . /sterad deviation Lab 





Foil enriched in Zn*®* 


137.1 
138.6 
141.6 
142.6 
146.0 
147.6 
150.4 
154.9 
157.2 
159.5 


— 
_ 


78.4 
81.4 
83.5 
88.2 


20.6 
21.6 
15.9 
11.9 
10.7 
6.67 
4.20 
2.93 
1.82 
1.29 
1.17 
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Foil enriched in Zn®* 
279 : 91.8 
192 95.7 
148 96.3 
123 98.9 
103 : 100.8 
48.8 . 102.4 
12.5 104.1 
10.7 105.2 
8.63 108.5 
111.9 
114.1 
116.4 
118.6 
120.3 
121.4 
123.2 
125.4 
127.7 
129.9 
131.0 
133.9 
136.6 
140.0 
141.1 
142.8 
144.5 
145.6 
150.1 
154.6 
159.0 


Schrank,* they are 


1 
As=—[(9e?— 3)d?—w?— ?— 8], 
2Re? 


1 a’ (8) 
E ot at (- ~~) +6cse' 
~ OR? go (00) 
1 (80) 


As=—— ——[w*+S*d"], 
2Ro’ a(8 0) 


where a, oa’, o’, gv, and 6 are defined above, Ro is the 
chamber radius, d? is the rms deviation of the beam on 
the zinc foil, 2w is the projected detector width at the 
chamber circumference, 2/ is the detector height, 26 
is the beam height at the zinc foil, Co is cos@o, So is 
sindo, C is cos(@o— ¢), and S is sin(@o>—¢). A; is small 


8. E. Dayton and G. Schrank, Phys. Rev. 101, 1358 (1956). 
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Fic. 6. Details of the measurements near the first minimum 
in the elastic scattering cross section. 


and varies from —0.0058 to —0.0070 with & the 
dominant term. Ay, the first derivative term, varies 
from —0.03 at 30° and 160° to +0.01 at the minima 
and depends mostly on 2), the beam height. A;, the 
largest correction, is the second derivative term and 
reaches a maximum of +0.08 at the first minima. The 
only large dimension, 6, does not appear in the ex- 
pression for As so that a large slit height can be used to 
get the most beam without sacrificing accuracy. 

The product of all corrections is 0.90 at the first 
minima, 1.02 at 30° and 160°, and follows the shape of 
a(6)) between. While these corrections are high, it 
should be noted that the corrections do not affect the 
angular positions of the diffraction patterns. The correc- 
tions raise maxima and decrease minima as indicated 
in Fig. 4, where a summary of the largest corrections 
found, Zn® at the first minimum, are graphed. Table II 
shows a summary of the sources of the experimental 
error. 

Values for cross sections and derivatives in the above 
corrections are averages over an angular interval of 
1° rms deviation (the current intensity is approximately 
Gaussian on the zinc foils). Except at the first minima, 
these functions vary so slowly with angle that the 
resulting cross sections are independent of this angular 
resolution. This is not true at the first minima and the 
true cross sections here may be 1 or 2% lower than 
those found. 


RESULTS 


Cross sections are given in Table III and Fig. 5. 
Cross sections for the three foils are seen to be most 
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different at the third minimum; for Zn; Zn®, and Zn®™ 
they are, respectively, 1.46, 1.07, and 0.61 mb/sterad, 
where surprisingly all minima appear at the same angle, 
142°. Even though the third minima are quite broad, 
sufficient points were taken to detect the expected 
shifts. Figures 6 and 7 show that at the first minimum 
the angular shift per isotope is about 1° and at the 
second minimum the shift is about 1.5° per isotope; in 
both cases, the Zn®* minimum is at the smallest angle 
and Zn® at the largest angle. The angular positions of 
the first two minima, as well as the angular shift per 
isotope, are quite well predicted by the 1/KR empirical 
relations found by Dayton and Schrank.* 

Preliminary analysis by the UCLA theoretical group® 


indicates that the results can be fitted with the addition 
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Fic. 7. Details of the measurements near the second minimum 
in the elastic scattering cross section. 


of a spin-orbit term. It is expected, however, that 
different parameters (in addition to the different atomic 
weights A) will have to be used for the different isotopes, 
particularly to fit the third minima. 
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Cross sections and angular distributions for the (d,p) reaction to a known /=1 single-particle state of the 
captured neutron have been measured for seven target nuclides between Ti and Ni. Targets with effective 
thicknesses of several hundred kev were used with deuteron bombarding energies of 3.8 and 4.5 Mev. The 
angular distributions were found to be similar, with approximately 2:1 forward peaking and a broad maxi- 
mum at about 60°. In addition, a sharp but relatively weak peak was observed at about 25° for the lighter 
of the target nuclides. This is the angle at which the theory of Butler would predict a maximum for neutron 
capture with /, =1. Analysis of the proton spectra and angular distributions indicates that compound nucleus 
formation contributes less than 25% to the reaction yield at these deuteron energies. 


INTRODUCTION 


NGULAR distributions from (d,p) reactions have 

served as a very useful tool in nuclear spectros- 
copy since, as was first pointed out by Butler,! a unique 
angular distribution corresponds to a particular value 
of the orbital angular momentum of the captured 
neutron. Thus the angular momenta of many states 
have been determined by measurements of (d,p) 
angular distributions. One of the limitations of the 
Butler theory is, however, that it fails to take the 
Coulomb field into account. Thus it is to be expected, as 
both theory and experiment have shown, that at in- 
cident deuteron energies below the Coulomb barrier 
angular distributions would not follow the shape 
predicted by Butler.’ 

In recent experiments* we have studied the proton 
spectra from the (d,p) reaction in a series of nuclei 
between Ti and Ni and have found a well-separated 
level or group of levels which was identified with the 
2p; single-particle or shell-model state of the captured 
neutron. In the present experiment we have attempted 
to study the angular distribution of this group of 
protons at deuteron energies of 3.8 and 4.5 Mev, which 
are below the Coulomb barrier (~5-6 Mev) for these 
nuclei. 


RESULTS 


The deuteron beam of the Argonne Van de Graaff 
accelerator was used to bombard thin-foil targets with 
thicknesses ranging from 2 to 7 mg/cm’, corresponding 
to energy losses of several hundred kev at the deuteron 
energies used. A scattering chamber with collimating 
holes of 0.25-inch diameter at 7.5° intervals on a circle 
of 6-inch diameter was used. The detectors were 
CsI(TI) scintillation crystals mounted on photomulti- 
pliers. Typical proton spectra, recorded in a 256- 
channel pulse-height analyzer, are shown in Fig. 1. 


* Work performed under the auspices of the U. S. Atomic 
Energy Commission. 

1S. T. Butler, Proc. Roy. Soc. (London) A208, 559 (1951). 

*See for example W. Tobocman and M. H. Kalos, Phys. Rev. 
97, 132 (1955). 

3 Schiffer, Lee, and Zeidman, Phys. Rev. 115, 427 (1959). 


The appropriate group of protons was selected from 
the spectrum and the number of counts in this group 
was computed at each angle. Data points were obtained 
between 15° and 135° at 15° intervals, and one point 
was obtained at 157.5° for each target. On some targets 
a measurement was also made at 22.5°. The angular 
distributions at the two bombarding energies are shown 
in Fig. 2. The statistical errors in the points were always 
negligible compared to the instrumental errors and the 
errors in determining the exact shape of the proton 
group. Absolute cross sections are given in Table I. 
DISCUSSION 

The following observations can be made regarding 
the data: 

1. The angular distributions for the lighter target 
nuclides show a fairly sharp maximum around 22°-30°. 
This might well correspond to the peak at 22° in the 
angular distribution predicted by the Butler theory for 


Ep * 4.5 Mev 
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Fic. 1. Typical proton spectra as observed in a 256-channel 
pulse-height analyzer. The proton energy scale is approximate. 
Statistical errors are always less than twice the width of the line. 
The shaded areas indicate the portion of each spectrum that was 
used for the angular distributions. 
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Fic. 2. Angular distributions of the (d,p) reaction with the 
deuteron energies and target nuclides indicated on the figure. The 
statistical uncertainties in the points are less than the diameter of 
the points; other errors are generally less than 10%. 


1=1 stripping. The cross section under this peak is less 
than 10% of the total cross section. 

2. All the angular distributions show a tendency for 
more yield forward of 90° than backward. This effect 
seems to decrease with heavier target nuclei and lower 
bombarding energies. The asymmetries are illustrated 
in Fig. 3(a). 

3. All the angular distributions seem to show a rather 
broad maximum at about 60°-70°. The height of this 
maximum relative to the total cross section does not 
seem to vary appreciably for the various target nuclei 
and the two bombarding energies used. This is demon- 
strated in Fig. 3(b). 

From the persistent lack of symmetry about 90° in 
the angular distributions it follows that the (d,p) 
reaction at these energies and for these target nuclei 
still proceeds, at least in part, by a direct reaction 
mechanism. The thickness of the targets and the 
number of target nuclei and bombarding energies 
studied insures that the features described above are 
not due to interferences between resonance levels in a 
compound nucleus. However, as had been observed in 
another experiment,‘ it is still likely that an appreciable 
fraction of the cross section can be accounted for by the 
reaction proceeding through a well defined compound 
nucleus. Since in the present experiment this part of 


TABLE I. Total cross sections. 


Total cross sections* (10-2? cm?) 


Target nucleus Ea=3.8 Mev Ea=4.5 Mev 


Tie 17 25 
ys 11 17 
Mn‘s> 5 10 
Fe* 10 17 
Ni** 5 8 


* The uncertainty in these cross sections for the 23/2 proton group is 
estimated to be approximately 30%. 

>From reference 4 this group is expected to have a smaller intrinsic 
intensity than the others since the fs;: shell in Mn®** was determined to be 
partially filled. 
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the cross section would have to be symmetrical about 
90°, it would be difficult to say what fraction of the 
cross section was due to compound nucleus formation. 
The maximum fraction of the cross section which could 
be symmetric about 90° varies from about 25% for Ti 
at 4.5 Mev to 75% for Ni at 3.8 Mev. These percentages 
then represent an upper limit for the fraction of the 
(d,p) reaction which could proceed through a compound 
nucleus, as calculated from our data. 

Information regarding the amount of compound 
nucleus formation contributing to the (d,p) reaction 
can also be obtained from the proton spectra alone. It 
is evident from Fig. 1 that the gross structures in the 
proton spectra are very similar to those observed at 
higher bombarding energies in reference 3. Such gross 
structure is possible only if the reaction proceeds by a 
direct stripping process where the probability of popu- 
lating a given state of the final nucleus is determined by 
the probability of a neutron being captured by the 
target nucleus. This is certainly not true for the part of 
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Fic. 3. (a) The ratio of 
forward to backward yields 
in the angular distributions 
as a function of atomic 
number. The 75°, 60°, 45°, 
and 30° yields were aver- 
aged for the forward yields; 
the 105°, 120°, 135°, and 
150° yields were averaged 
for the backward yields. 
Circles and crosses are used 
to designate the 3.8- and 
4.5-Mev data, respectively. 
(b) The ratio of the cross 
section at 60° to the total 
cross section as a function 
of atomic number. 
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the (d,p) reaction which proceeds by way of a compound 
nucleus. Since gross-structure effects definitely are 
present, one can estimate that the compound nucleus 
process contributes less than 25% to the proton spectra. 
This independent estimate is then consistent with the 
one derived from the angular distributions. 

These estimates are also in agreement with the 
amount of compound nucleus contribution that was 
found in the study of the Ca(d,p)Ca“ reaction in 
reference 4. There it was concluded from a study of 
resonances in the excitation function that the compound 
nucleus contribution was at least 20% of the total for 
the same region of bombarding energies as in the present 
work. 

Tobocman’s? calculations of (d,p) angular distribu- 
tions include the effects of the Coulomb field as well as 
the interactions of the deuteron and the proton with the 
nucleus. It remains to be seen whether these calculations 
can fit the observed cross sections and angular distri- 
butions. 
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The cross section for the reaction Li§(n,a)H* has been measured as a function of neutron energy in the 
range 1.2<E,¢8.0 Mev. An essentially back-to-back method was used, with a thin cylindrical Li*I (Eu) 
scintillation crystal placed concentric with and adjacent to a thin-walled ionization chamber containing a 
deposit of fissile material (U8 or Np’). The magnitude of o,,4 as measured in this experiment depends on the 
absolute value of ofiss(U?5*), while the shape of the cross section vs energy curve depends on the known energy 
dependence of ofis,(U*** or Np*87). Statistical and other point-to-point uncertainties in the data range from 
+5 to +9%, while the uncertainty in absolute value of the cross section is +7%. The cross section obtained 
from these measurements decreases monotonically from a value of 0.28 barn at E,=1.2 Mev to 0.051 barn 


at E,=8.0 Mev. 


INTRODUCTION 


HE Li®(n,a)H*® reaction has been studied at 

several laboratories in recent years. The cross 
section for this reaction has been measured by various 
investigators in the kev and Mev region'-* with 
generally good agreement. The principal features of the 
cross-section behavior, as given by previous experi- 
ments, are as follows: a thermal cross section of 945 
barns, a monotonic decrease from thermal to the kev 
region, a resonance at 0.26 Mev of maximum cross 
section ~3 barns, followed by a decrease in cross 
section with increasing energy to a value of 17.6 mb 
at 18.3 Mev. 

Accurate knowledge of the (7,a) cross section in Li® 
is, of course, of value in reaching an understanding of 
the level structure of Li’ as well as a general under- 
standing of the properties of light nuclei. Apart from 
this intrinsic interest, the Li®(7,a)H* reaction is well 
suited for purposes of neutron monitoring and neutron 
spectroscopy. The inclusion of Li® in a proportional 
counter, nuclear emulsion, or scintillation crystal 
permits neutron counting by detection of the alpha 
and triton reaction products. Further, a measurement 
of the total energy of the alpha and triton should 
provide a measure of the energy of the incoming 
neutron, £,, since the total energy released is just 
E,+Q, with Q=4.78 Mev. The incorporation of a 
known amount of Li® in a charged-particle detector, 
along with knowledge of the Li®(n,a)H* cross 
section, should then provide a convenient absolute 
= Operated by Union Carbide Corporation for the U. S. Atomic 
Energy Commission. 

1F. L. Ribe, Phys. Rev. 103, 741 (1956). 

2J. M. Blair and R. E. Holland, reported in Neutron Cross 
Sections, compiled by D. J. Hughes and J. A. Harvey, Brookhaven 
National Laboratory Report BNL-325 (Superintendent of 
Documents, U. S. Government Printing Office, Washington, 
D. C., 1955). 

3G. M. Frye, Jr., Phys. Rev. 93, 1086 (1954). 

4 J. B. Weddell and J. H. Roberts, Phys. Rev. 95, 117 (1954). 

5 Gabbard, Davis, and Bonner, Phys. Rev. 114, 201 (1959). 

6S. J. Bame, Jr., and R. L. Cubitt, Phys. Rev. 114, 1580 (1959). 

&Gorlov, Gokhberg, Morozov, and Otroshchenko, Doklady 
Akad. Nauk S.S.S.R. 111, 791 (1956) [translation: Soviet Phys. 
Doklady 1, 705 (1956) ]. 

8B. D. Kern and W. E. Kreger, Phys. Rev. 112, 926 (1958). 


monitor for monoenergetic, or possibly polyenergetic 
neutrons. As an example, the use of Li®I(Eu) scintil- 
lation crystals for fast-neutron counting and spectros- 
copy has been reported elsewhere.’ In view of the 
possible uses of Li® counters in fast-neutron experiments, 
it was felt that more detailed information on the (n,a) 
cross section in the Mev region would be of value. For 
this reason, a new measurement of the Li®(7,a@)H?* cross 
section, in the energy interval from 1.2 to 8 Mev, has 
been performed and is described here. The cross-section 
points determined in this work were measured relative 
to the previously determined fission cross sections of 
U8 and Np””. 


METHOD AND APPARATUS 


Monoenergetic neutrons were obtained from the 
T(p,n)He’ and D(d,n)He’ reactions, the ORNL 5.5-Mv 
Van de Graaff generator being used to accelerate the 
incident charged particle. Those neutrons emitted in a 
small cone about 0 degrees with respect to the incident 
beam were used in the cross-section determination. 
The neutron energy was varied by changing the energy 
of the charged particle. Neutron energies were deter- 
mined from published tables,’ taking into account the 
loss of charged-particle energy upon penetration of 
target foils and target gas. 

The Li®(2,a)H* events were detected in a Li®I(Eu) 
scintillation crystal. Pulse-height analysis of the counts 
from the crystal permitted a determination of those 
(n,a) events arising from primary target neutrons. The 
fast-neutron flux was determined by counting fission 
events occurring in a thin-walled ionization chamber 
containing a deposit of either U** or Np*’. Three 
separate experiments were performed in the course of 
the measurements; different Li®I (Eu) 
crystals, and fission detectors were employed in the 


geometries, 


various runs. A description of the principal features in 
each case follows. 


®R. B. Murray, Nuclear Instr. 2, 237 (1958) 
10 J. L. Fowler and J. E. Brolley, Jr., Revs. Modern Phys. 28, 
103 (1956). 
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Fic. 1. Typical pulse-height spectrum from monoenergetic 
fast neutrons with Li®I(Eu) crystal, at liquid nitrogen tempera- 
ture, in apparatus used for experiment 1. Neutrons were generated 
by T(p,n)He' reaction in solid target of ZrT on platinum. 


Experiment Number 1 


A Li®I(Eu) crystal (3-mm_ thickX35-mm_ diameter) 
was contained in a low-temperature apparatus which 
maintained the crystal at liquid nitrogen temperature. 
The purpose of cooling the crystal was to take advantage 
of the improvement in fast-neutron resolution obtained 


by this method.’ The crystal and photomultiplier tube 
(RCA 6342), contained in a thin-walled evacuated 
chamber, were physically separated by a distance of 2 
in. so that the photomultiplier was operated near room 
temperature. A thin, polished aluminum sleeve served 
as a light guide between the crystal and photocathode 
A typical pulse-height spectrum, resulting 


surface. 
from 2.52-Mev neutrons, is shown in Fig. 1. 


_— HIGH VOLTAGE CONNECTOR 











AND FH. W. 


SCHMITT 


In the cross section measurement, the apparatus 
was arranged so that the crystal was located at an 
accurately known distance from the neutron source 
(4.34 in.). A solid target of ZrT evaporated onto a 
platinum backing was used; neutrons were obtained 
from the T(p,m)He?’ reaction. The pulse-height spectrum 
was recorded on a multichannel analyzer; at the same 
time, the neutron flux was carefully monitored by 
several auxiliary neutron counters and a beam-current 
integrator. The scintillation counter assembly was then 
replaced by the fission chamber in such a manner that 
the fission deposit was exactly the same distance from 
the target as the Li®I(Eu) crystal. In this case the 1-in. 
diameter fission deposit of U** was electroplated in 
the form of U;O0x on a 0.010-in. thick platinum backing. 
The U** mass was determined to be 580 yg, as discussed 
below. The number of fission events per monitor count, 
or per beam-current integrator count, was then deter- 
mined to establish the absolute neutron flux which had 
been incident on the Li®I(Eu) crystal. 


Experiment Number 2 


A 10-mm thick by 38-mm diameter Li®I(Eu) crystal 
was mounted directly behind, and concentric with, the 
fission deposit (U** as in experiment number 1) in 
the ionization chamber. A schematic drawing is given in 
Fig. 2. The crystal was operated at room temperature. 
The apparatus was placed at a known distance from 
the neutron source which was, in this case, a 3-cm-long 
gas cell containing tritium gas at a pressure of about 1 
atmosphere. Because of the vastly different counting 
rates in the fission counter and crystal, it was necessary 
to take the scintillation counter data at a low beam 
current (~0.1 wa) and the fission data at a high current 
(~10 wa). The correlation between the fission data and 
the Li®I(Eu) crystal data was again determined by 
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careful monitoring of the neutron flux with the auxiliary 
counters and beam-current integrator. It was, of course, 
necessary to make a geometrical correction to take 
account of the separation in distance between the 
fission deposit and the crystal. 


Experiment Number 3 


Exactly the same geometry was used as in the 
preceding experiment. In this case, however, a smaller 
scintillation crystal was used (2.5-mm thick by 19-mm 
diameter) and the fission deposit was made of Np””. 
The use of Np*” as a fission monitor rather than U*** 
has a twofold advantage in that (a) the fission threshold 
occurs at a lower energy, and (b) the greater fission 
cross section of Np*’ permits higher fission count 
rates. The Np deposit was vacuum evaporated as 
NpF, on a 0.010-in. thick platinum backing in a 1-in. di- 
ameter layer of thickness ~ 330 yg/cm*. The concen- 
tration of fissionable impurities in the Np deposit was less 
than 1 part in ~ 10°. As a result of these changes in the 
crystal and fission deposit, the counting rates in the 
crystal and ionization chamber were more compatible 
than in experiment number 2, permitting the simul- 
taneous recording of data from the two detectors. 
Neutrons were generated in the gas target, containing 
either tritium or deuterium, by the T(p,z)He*® and 
D(d,n)He’ reactions. A typical pulse-height spectrum 
from D(d,n)He* neutrons is given in Fig. 3. In an 
auxiliary experiment, the fission counting rate from 
the Np**” deposit was determined relative to that of the 
previously used U** deposit by simultaneous counting 
of fission events in a back-to-back geometry. For this 
experiment the right half of the fission chamber as 
shown in Fig. 2 was replaced by the twin of the left 
half, permitting back-to-back counting. The relative 
fission count rates were determined at 5 neutron energies 
between 4.3 and 8.0 Mev. This comparison permitted 
analysis of all data in terms of the absolute value of 
the U*** fission cross section only, as will be discussed 
below. 


ANALYSIS OF DATA 


The calculation of the Li®(,@)H* cross section from 
the experimental data of the first two experiments was 
performed according to the following equation: 


Ona(En) (na) counts mass of U8 
Criss(E,) fission counts mass of Li® 


at. wt. Li® (U3) 


xX (1) 


= ~_ » 4 = _ 
at. wt. U8 = (Li) 


where ona(E,) represents the Li®(”,a)H* cross section 
at neutron energy Fy, criss(E,) represents the known 
fission cross section of U8 at the same energy, and 
(U8) /¢(Li*) represents the ratio of the neutron flux 
(averaged over the detector) at the position of the 
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Fic. 3. Typical pulse-height spectrum from monoenergetic 
neutrons generated by D(d,n) He’ reaction in gas target. Li®I (Eu) 
crystal was contained in apparatus of Fig. 2. 


fission deposit to that at the scintillation crystal. In 
order to facilitate the discussion of data analysis and 
the estimated uncertainties, we will consider each 
member of Eq. (1) in turn. 

It is seen from Eq. (1) that the experimentally 
determined quantity is the ratio on@(En)/otiss( En). 
The numerical value of this ratio for each neutron 
energy is listed in column 2 of Table I. In order to 
determine o,(£,), the ratio is multiplied by the 
known value of oriss(,) for U8 taken from the 1958 
compilation of Allen and Henkel." The value of criss of 
U6 at each neutron energy, as obtained from reference 
11, is listed in column 3 of Table I. 

The number of Li®(”,a)H* events in every case was 
determined from the observed pulse-height spectrum 
as the total number of counts in the fast-neutron peak. 
A correction was made, where necessary, to take account 
of pulse pileup in the electronics. The maximum 
correction was of order 1%, and an uncertainty of 4 
the correction was assigned. In determining the number 
of (n,w) counts, it is necessary to include counts con- 
tained in the extrapolated tail on the low-energy side 
of the fast-neutron peak (e.g., Fig. 3). The extrapolation 
was performed by subtracting a symmetrical slow- 
neutron peak (whose shape was determined in auxiliary 
runs) from the observed pulse-height spectrum. The 
fraction of the total (,a) counts contained in the tail 
was usually of the order of a few percent, reaching 
~20% in the worst case. An uncertainty of 3 of the 
extrapolated area was assigned in each case. A further 
correction to the data must be applied to account for 
loss of alphas and tritons from the faces of a thin crystal. 
For a 2.5-mm thick crystal the correction amounted to a 
few percent, reaching a maximum of 6% at 8 Mev. An 
uncertainty of 4 the correction was assigned. 

1 W. D. Allen and R. L. Henkel, Progress in Nuclear Energy 


(Pergamon Press, London, 1958), Ser. 1, Vol. 2. Also R. L. 
Henkel, Los Alamos Report LA-2114, 1957 (unpublished). 
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TaBLE I. Measured values of Li®(n,a)H* cross section using 
U** fission monitor. The neutron source was T(p,n)He*. Errors 
listed in column 4 are point-to-point uncertainties of type (1) 
as discussed in text. Values of ofis,(U**) from Allen and Henkel.* 


Experi 


tise (U8) 
ment No. 


En (Mev) @5.a/¢@ties(U™*) (barns) ¢,.a(barns) 


0.260+0.018 
0.240+0.017 
0.200+0.013 
0.182+0.012 
0.144+0.010 
0.138+0.010 


0.241+0.013 
0.198+0.010 
0.156+0.008 
0.125+0.007 
0.103+-0.005 


0.465 
0.545 
0.545 
0.537 
0.535 
0.535 


0.535 
0.545 
0.535 
0.540 
0.540 


0.559 
0.440 
0.367 
0.339 
0.269 
0.258 


0.450 
0.363 
0.292 
0.231 
0.190 


NK ~ Ue sD 
—AsmR~N 


FPwWwWNNmM wWWwhnnd— 
SMe nS Beenie: 
wowsd.— 

BW DOD DO me oe ee ee ee 


* See reference 11. 


Fission events were recorded as counts occurring 
above the integral pulse-height discriminator level in a 
pulse amplifier. The discriminator bias was set to 
reject counts from natural alpha particles and any other 
backgrounds. Since the fission deposit was of finite 
thickness, those fission fragments emitted almost 
tangential to the deposit gave rise to pulses occurring 
below the discriminator setting. The correction for this 
effect was determined from an integral bias curve, and 
amounted to 2-3% with an estimated uncertainty 
of 1%. Finally, since the fission fragments are detected 
in the backward hemisphere (Fig. 2), a small fraction 
of the fission events are not registered, due to the 
forward motion of the center-of-mass of the fragments. 
The magnitude of the correction arising from this effect 
is neutron-energy dependent and is influenced by the 
anisotropic emission of the fragments.” The net result 
is a correction of order 1%, with an estimated un- 
certainty of +1%. 

The mass of the U** deposit was determined by 27- 
counting of U** alpha particles in the ionization 
chamber of Fig. 2. The number of alpha disintegrations 
was determined from the observed alpha-particle 
pulse-height spectrum, taking into account the low- 
energy tail arising from those particles emitted at a 
small angle to the layer. The mass was calculated from 
the alpha count rate using a half-life of 4.51 10° years, 
and a counting geometry factor of 0.52 to account for 
alphas back-scattered from the platinum foil.’ The 
isotopic concentration of the U** differed from unity 
by less than 1 part in 10°. The estimated uncertainty 
in the U8 mass is +3%. 

The mass of Li® in the scintillation crystal was 
determined by careful weighing of the crystal on an 
analytical balance, combined with an isotopic analysis 


2 R. L. Henkel and J. E. Brolley, Jr., Phys. Rev. 103, 1292 
(1956). 

'8 J. A. Crawford, The Transuranium Elements: Research Papers 
(McGraw-Hill Book Company, Inc., New York, 1949), Paper No. 
16.55, National Nuclear Energy Series, Plutonium Project 
Record, Vol. 14B, Div. IV, Part IT. 
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for Li® content (~ 96%). The Li® mass is considered to 
be known in all cases to +3%. 

The ratio ¢(U***)/¢(Li®) was calculated from the 
published angular distributions of source neutrons" 
and was checked by an experimental determination of 
neutron flux vs distance from the target. The calculated 
and experimental values of flux vs distance were in 
excellent agreement. It is also noted that the ratio 
¢(U**)/¢(Li®), applied to this experiment, is relatively 
insensitive to the neutron angular distribution (hence 
neutron energy). It is determined primarily by the 
spatial separation of the fission and Li® detectors and 
the source-to-detector distance, quantities which are 
well known from the dimensions of the apparatus and 
experimental arrangement. In a typical case, the ratio 
was 1.15 and is considered to be known to +3%. 

Finally, careful consideration was given to the 
question of neutron scattering and attenuation in the 
apparatus. The number of counts in the fast-neutron 
peak which arise from neutrons reflected to the crystal 
by the Lucite light pipe was estimated, on the basis of 
a Monte Carlo calculation, to be at most a few percent 
in each case. Experimental checks at several neutron 
energies were made to determine the transparency of the 
ionization chamber to neutrons and the effect of neutron 
scattering in the walls of the chamber. Calculations 
were made to determine the attenuation of the neutron 
beam, as well as the effects of elastic scattering, in the 
crystal. Each of the effects discussed above was small 
(a few percent or less) and, further, tend to compensate. 
An uncertainty of +3% in the evaluation of these 
corrections is included. 

In Experiment No. 3, in which the Np**” deposit was 
used, the calculation of on,a(E,) follows Eq. (1) with 
the U*** terms replaced by Np**’ terms. In this case it 
may be noted that 


Tn,a( En) & of iss” (En) X Mass(Np). (2) 


The mass of the Np”? deposit was determined by 
comparison fission counting at several neutron energies 
as indicated previously. Considering a comparison 
fission counting run at neutron energy E,, the Np? 
mass is obtained from 


Mass(Np) « Mass(U) 
Np fiss cts (E.) — otis” (E-) 


x— x manmween, (9) 
U fiss cts (E,) — otiss®?(E,) 





The Np mass was obtained by averaging the results of 
five runs at neutron energies between 4.3 and 8.0 Mev; 
the average mass was 1.627 mg with a standard devia- 
tion of 1.7%. Combining Eqs. (2) and (3) it is seen 
that the resulting expression for on, contains @fiss‘” 
both in the numerator and denominator. The evaluation 
of on.2 thus depends on the shape of the Np”? fission 
cross section vs energy curve, but is independent of its 
absolute magnitude. The Np’ fission cross section 





CROSS SECTION FOR *Ei* (ea) 1" 


REACTION 





0.35 








T(p,7)He’, EXPERIMENT NO. | 
T(p,7)He’, EXPERIMENT NO, 2 
T(p,0)He®, EXPERIMENT NO. 3 


Dig, )He®, EXPERIMENT NO, 3 











Fic. 4. Measured _ 0.20 } 
values of  Li®(n,«)H* 
cross section as a func- 
tion of neutron energy. 
The base of the triangles 
represents typical target 


o@(7,a) (borns) 




















; : 0.15 
thickness, i.e., full neu- 


tron energy spread. 














—— UNCERTAINTY IN ORDINATE = £7% 


——— 








{ 
| 
| 


| 


N b | 


' 





used in this analysis is that given in a recent experi- 
ment! based on measurements of the ratio o¢iss(Np””)/ 
Ot iss (U3), 

The errors associated with the determination of 
ona(E,) have been separated into two types: (1) 
standard errors from counting statistics and other 
uncertainties which vary from point to point, and (2) 
those uncertainties which are energy independent, e.g., 
the mass of the U*** deposit and the absolute magnitude 
of the U* fission cross section. The latter quantity is 
considered uncertain to +5%.'° Errors of type (1) were 
added as the square root of the sum of squares and are 
shown on the individual data points of Fig. 4. The 
uncertainty in the magnitude of co, arising from item 
(2) is estimated to be +7%. Finally, it should be noted 
that the shape of the curve in Fig. 4 is based directly 
on previous measurements of @¢iss(U*) and a ¢iss(Np”*”) 
vs energy. 


RESULTS AND DISCUSSION 


The results of the three runs described above are 
presented in Tables I and II and Fig. 4. The base of the 
triangles in Fig. 4 represents the total neutron-energy 
spread at typical points, determined from a calculation 
of the charged-particle energy loss in passing through 


4H. W. Schmitt and R. B. Murray (to be published). 
18 R, L, Henkel (private communication, 1958). 


NEUTRON ENERGY (Mev) 


the target. It is of interest to compare the results 
presented in Fig. 4 with those obtained from other 


TABLE II. Measured values of Li®(m,a)H® cross section using 
Np*7 fission monitor (experiment No. 3). Errors listed in column 
4 are point-to-point uncertainties of type (1) as discussed in text. 
Values of ofiss(Np*’) from Schmitt and Murray.* 


Neutron 
source 


@tiss (N p%7) 
(barns) 


En Crna 
(Mev)  atiss( Np?) 
1.20 0.163 
ize 0.183 
1.31 0.163 
1.44 0.152 

56 0.150 
.64 0.164 
72 0.155 
81 0.151 
.92 0.157 
2.04 0.148 
2.16 0.141 
2.39 0.140 
2.92 0.112 
3.82 0.0820 
3.60 0.0893 
3.82 0.0871 
4.33 0.0723 
4.88 0.0632 
5.37 0.0596 
5.90 0.0459 
6.39 0.0362 
6.90 0.0300 
7.42 0.0245 
7.96 0.0256 


On,a (barns) 


0.272 +0.025 
0.308 +0.023 
0.276 +0.025 
0.261 +0.018 
0.261 +0.017 
0.284 +0.014 
0.270 +0.014 
0.264 +0.013 
0.276 +0.014 
0.260 +0.013 
0.247 +0.012 
0.244 +0.012 
0.191 +0.010 
0.128 +0.007 
0.143 +0.008 
0.136 +0.007 
0.106 +0.006 
0.0892+0.0045 
0.0835+0.0044 
0.0714+0.0037 
0.0660+0.0036 
0.0596+0.0033 
0.0510+0.0028 
0.0546+0.0031 


T(p,n) He 
T(p,n) He’ 
T (p,n) He 
T (p,n) He’ 
T(p,n) He’ 
T (pn) Heé 
T (pn) He® 
T (p,n) He 
T(p,n) He’ 
T(p,n) He’ 
T (p,n) He® 
T (p,n) He’ 
T(p,n) He’ 
T (p,n) He’ 
D (d,n) He® 
D (d,n) He® 
D (d,n) He? 
D (d,n) He’ 
D (d,n) He® 
D (d,n) He’ 
D (d,n) He® 
D (dn) He® 
D (d,n) He* 
D (d,n) He® 


Sea 


wn AD 


a ee 


a 


* See reference 14. 
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experiments. In the experiment of Ribe! it was necessary 
to apply a correction to all of the measured cross 
sections, except two at 2.50 and 14.1 Mev, in order to 
account for the energy dependence of the long-counter 
detection efficiency. According to a recent communi- 
cation'® the long-counter correction factor as used in 
Table I of his report! was applied inversely. Thus, his 
corrected cross section at a neutron energy of 0.88 Mev 
should read 323+34 millibarns, that at 6.52 Mev 


16 F. L. Ribe (private communication, 1959). 
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should read 55+10 millibarns, etc. With this revision, 
the results of Ribe’s experiment! and those of the present 
experiment are in quite good agreement. The data of 
Gabbard ef al.5 are somewhat lower in the region 
1-3 Mev. 
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Measurements on two jets, a 0+36p jet with primary energy about 2500 Bev and a 44+29a jet with 
primary energy about 8000 Bev/nucleon, are presented. It is shown that the main features of the angular 
distribution of particles from these two primary jets and a third energetic secondary jet can be explained 
satisfactorily by the model in which mesons created in a nucleon-nucleon collision are considered to be 
radiated isotropically from two centers. Further implications of this model are discussed; in particular it 
is shown that definite restrictions are imposed on the values of energy and angle of emission of secondary 
particles; the average transverse momentum is predicted to be relatively constant but to have a lower value 
not only in the forward and backward directions but also around 90° in the center-of-mass system of the 
two colliding nucleons. While it is shown that the available experimental data are in accord with these 
predictions, more events with energies in the region of 1000 Bev must be studied before a definite conclusion 


can be reached. 


1. INTRODUCTION 


URING the past few years several dozen inter- 

actions of particles with energies above 1000 
Bev have been studied in nuclear emulsions. Of these 
events, those in which no or very few low-energy 
particles are emitted are most likely to represent single 
nucleon-nucleon collisions and therefore are more 
amenable to theoretical interpretation. Several theo- 
retical models have been proposed.'"® Recently Ciok 
et al.,7 Cocconi,*® and Niu® have shown that the majority 
of such interactions show an anisotropic distribution 
which can be interpreted as the emission of particles 


* National Research Council Postdoctorate Fellow. 

t Present address: Laboratory of Nuclear Studies, Cornell 
University, Ithaca, New York. 

1 Lewis, Oppenheimer, and Wouthuysen, Phys. Rev. 73, 127 
(1948). 

2 W. Heisenberg, Vortrdge tiber Kosmische Strahlung (Springer 
Verlag, Berlin, 1953), p. 148; and K. Symanzik, ibid, p. 563. 

3 E. Fermi, Phys. Rev. 81, 683 (1951). 

4S. Takagi, Progr. Theoret. Phys. (Kyoto) 7, 123 (1952). 

5S. S. Belenki and L. D. Landau, Suppl. Nuovo cimento 3, 15 
(1956). 

®W. L. Kraushaar and L. J. Marks, Phys. Rev. 93, 326 (1954). 

7Ciok, Coghen, Gierula, Holyfski, Jurak, Miesowicz, 
Staniewska, Stanisz, and Pernegr, Nuovo cimento 8, 166 (1958); 
Ciok, Coghen, Gierula, Holyfski, Jurak, Miesowicz, Staniewska, 
and Pernegr, Nuovo cimento 10, 741 (1958). 

8G. Cocconi, Phys. Rev. 111, 1699 (1958). 

°K. Niu, Nuovo cimento 10, 994 (1958). 


from two radiating centers created during the inter- 
action rather than from a single center stationary in the 
center-of-mass system of the two colliding nucleons. 

In the present work two interactions with energies 
over 1000 Bev have been studied; one is an event of 
type 0+36p (Fig. 1), presumably an interaction of a 
proton with a proton in the emulsion; the other is of 
type 4+ 29a. They were found in a stack of G-5 emulsion 
which had been exposed at 135 000 ft for 8 hours over 
Texas at a geomagnetic latitude of 41°N. The incli- 
nation in the emulsion of both events makes them 
suitable for study. These events and interactions of 
several of their secondaries are considered here from 
the standpoint of the two-center model. 


2. MEASUREMENTS 


The angles of emission of all tracks in relation to the 
direction of the primary were measured. The results for 
the proton jet and for the a-particle jet are given in 
Table I. Measurements were normally made with 
respect to a reference track whose direction relative to 
the primary was carefully determined. 

Tracks in the central core of the proton jet travel 
about 2.5 cm from the interaction in the same plate, 
and a total of about 12 cm in the stack. All the tracks 
in the proton jet were followed individually until they 
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interacted or left the first plate. A track length of 67.1 
cm was followed and three interactions were found 
of types 14+19p, 0+3p, and 4+139. A star of type 
3+4n occurred at a distance of 755 yw from the origin, 
due to a neutral particle emitted at an angle of 0.014 
rad. Another secondary interaction of a particle in the 
inner cone, of type 5+3%, was found in another plate. 
Two interactions, of types 0+14n and 2+99, were 
found in the core of the a-particle jet; as discussed in 
Sec. 4 it is possible that the neutral interaction is that 
of a neutron of the original a-particle. The core of the 
a-particle jet travels only 1.5 mm in the first plate and 
about 5 mm per plate for a total distance of 23 cm. 


3. KINEMATICS OF NUCLEON- 
NUCLEON COLLISIONS 


Before examining the interactions we shall state here 
the relations to be used in their interpretation as 
nucleon-nucleon collisions. 

If the velocity of the center-of-mass system of the 
colliding nucleons (the C-system) is 8, in the laboratory 
system and if a particle is emitted at an angle @ with a 
velocity 8, in the C-system, the angle of emission, a, in 
the laboratory system is given by 


sin 


, (1) 
(8./8,)+cosé 


7. tana= 


where y,-=1/(1—8,7)'. If 8./B.1 and @ is not too close 
to 180°, then 


7. tana=tan(6/2). (2) 


If equal numbers of particles are emitted forwards 
and backwards in the C-system, the median angle, 
@m, Of tracks in the laboratory system must correspond 
to an angle 6=90° in the C-system. In this case 


7- tana,,=1, (3) 


which permits the determination of y,. In the method 
of energy determination introduced by Castagnoli 
el al., the value of a corresponding to the center of 
gravity of the distribution in log tana is used in Eq. 


TABLE I. Angles of tracks from the primary jets. 


Jet Values of tana 


0.0014 0.0056 0.011 
0.0014 0.0056 0.0122 
0.0017. 0.0056 0.013 
0.0026 0.0085 0.0131 
0.0031 0.0087 0.016 
0.0052 0.0101 0.016 


0.016 0.056 
0.018 0.056 
0.019 0.061 
0.031 0.084 
0.040 0.106 
0.052. 0.11 


0.119 
0.176 
0.205 
0.323 
0.462 
0.510 





0+36p 


0.00015 
0.0002 
0.0003 
0.0004 
0.0004 


0.0034 
0.0036 
0.0039 
0.0049 
0.0059 


0.0083 
0.0168 
0.044 
0.045 
0.118 


0.27 
0.28 
0.45 
1.21 


0.0008 
0.0010 
0.0012 
0.0018 
0.0023 


0.121 
0.125 
0.13 
0.132 
0.23 


10 Castagnoli, Cortini, Moreno, Franzinetti, and Manfredini, 
Nuovo cimento 10, 1539 (1953). 
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Fic. 1. Projection 
tracing of 0+36p jet. A 
secondary interaction of 
type 4+3n which occurs 
755 w from the origin is 
shown. 
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(3) instead of the median value a,,. These authors have 
calculated the correction to be applied when the 
approximation 6,/8,=1 is not made; the equation for 
Y- is 

logy. + (log tana), +C=0, (4) 


where C is a term whose value may be calculated if the 
energy spectrum and angular distribution of the emitted 
particles are known. 

To a sufficient degree of accuracy, the laboratory 
energy of the incident particle is 2y2 Bev, which holds 
for any incident particle provided the target is a 
nucleon. 

The form of the distribution of C-system angles in a 
jet can be made readily apparent by plotting against 
log tana the value of log[ F/(1—F) ] for each secondary 
track, where F is the fraction of particles whose angles 
fall within the angle a. The distributions predicted by 
various models are shown in Fig. 1 of the paper by 
Cocconi.* Subject to the adequacy of the approximation 
8.=8,=1, a distribution which is isotropic in the 
C-system gives a straight line with slope 2, whereas the 
anisotropic distributions predicted by Heisenberg, 
Fermi, and Landau have slopes equal to or about 1. 

In each of the theories of Heisenberg, Fermi, and 
Landau, particles are considered to be emitted from a 
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Fic. 2. The angular dis- 
tribution of tracks from the 
0+36p jet and from inter- 
actions of its secondaries. F 
is the fraction of the par- 
ticles whose angles fall 
within the angle a. 











-% 


single center. In the theory of Takagi, developed by 
Kraushaar and Marks, and in the models shown to be 
successful by Ciok et al., Cocconi, and Niu, the inter- 
action of the nucleons is imagined to create two centers 
which radiate particles; these centers move in opposite 
directions in the C-system with Lorentz factors 7; in 
the laboratory system these centers have Lorentz 
factors y; and v2 and give rise to an angular distribution 
with inner and outer cones of particles. The angular 
distribution of particles emitted from each center is 
expected to be isotropic. 


4. THE ANGULAR DISTRIBUTION AND 
PRIMARY ENERGY 


The angular distributions of the particles from the 
proton and a-particle jets and from the interactions of 
their secondaries are shown in Figs. 2 and 3, where 
logl F/(1—F) ] is plotted against log tana. 


The Proton Jet 


Che median angle of the proton jet is 0.016 radian, 
and from Eq. (2) we obtain a value of about 60 for y., 
and an approximate energy of 7200 Bev for the primary 
proton. 

It is clear that the distribution of particles from the 
proton jet is not isotropic in the C-system. Since the 
average slope of the graph is about 1, an interpretation 


based on Heisenberg’s theory is possible. The tracks 
can however be divided into inner and outer cones— 
21 in the inner, 15 in the outer—and this makes it 
natural to consider the event as a case where secondary 
particles are emitted from two radiating centers. The 
inequality in the numbers of charged particles from the 
two centers is not statistically significant and appears 
smaller if we consider that any particles emitted with 
unusually low transverse momentum are likely to 
appear in the inner cone. 

On the basis of the two-center model it appears that 
particles are emitted from systems, moving backwards 
and forward in the C-system, which are characterized 
by Lorentz factors in the laboratory system y:~125 
and y2~10. The Lorentz factor for the C-system of the 
two nucleons is now determined by the relation 
¥- = (yr7¥2)!,° which gives y./~35 and a primary energy 
~ 2500 Bev. 

Taking the particles assigned to each system sepa- 
rately and plotting their angular distributions on similar 
diagrams, Fig. 4, we find that the slope of each graph 
is about 2, which is consistent with isotropic emission 
from each center. 


The e-Particle Jet and its Secondary 
Neutral Jet 
From Fig. 3 it is clear that the primary energy of the 
a-particle jet is even less well defined by the median 
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Fic. 3. The angular dis- 
tribution of tracks from the 
4+29a jet and from inter- 
actions of its secondaries. 


angle than is the energy of the proton jet. The dis- 
tribution of particles shows a more marked separation 
into inner and outer cones; 17 tracks are found in the 
inner cone and 12 in the outer. If we postulate separate 
radiating centers for the particles in each cone, these 
centers must have Lorentz factors yi~700 and y2~6, 
giving y.-~65 and a primary energy 8000 Bev/nucleon, 
assuming the event consists of one or more interactions 
of single nucleons of the a-particle with single nucleons 
of the struck nucleus. 

The interaction of type 0+14n of a neutral particle 
in the inner core also shows a double cone structure. 
If we postulate that emission of particles has taken 
place from two centers, the Lorentz factors we assign 
to the two systems are almost the same as those of the 
a-particle jet, y:~800 and y2~8, and the resulting 
value of y.’ is about the same: y,./~80. This similarity 
suggests that this event may be the interaction of a 
neutron from the original a-particle. 

The distribution of particles in each cone is con- 
sistent with isotropic emission in its own system, with 
the exception of the inner cone of the a-particle jet for 
which the graph has a slope of about 1; this could be 
caused by inclusion of some electron pairs among the 
tracks in the forward cone. 


The Secondary Interactions 


The angular distributions of tracks from the inter- 
actions of secondary particles are also plotted in Figs. 
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2 and 3. Estimates of the energies have been made from 
the median angle of the distribution, and in the case 
of the 3+4p event from relative scattering measure- 
ments on its two most energetic tracks. 

It can be seen that none of the secondary interactions 
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Fic. 4. The angular distribution of tracks from the proton jet, 
inner and outer cones being treated separately. 





TABLE IT. Summary of the jet characteristics. 


(a) Jets with energies over 1000 Bey 
Eq. nergy 
Type (3) yi y2 Ye’ 2ye"% K:/pr (Bev) 


2500 «1.9 1.7 
8400 1.7" = 1.94 
13000 1.4 0.9 


0+36p OO 125 10 2.0 35 
44+29a 200 700 6 § 65 
0+14n 20 800 8 : 80 


2500 
8000 
8000 


(b) Secondary jets. 
Energy 


tana Ye 
Eq. (3) (Bev) 


(107%) 


Type 


14+19p 3.1 15 450 
0+3p 5.6 
5+3p 10.1? 
3+4n 14 30 
4+13p 16 5 50 
24+9p 10 200 


* Assuming that the event represents a single nucleon-nucleon collision 


(except the 0+14n event discussed above) shows a 
clearly resolved double cone structure; this need not 
be taken to indicate a difference between the inter- 
actions of nucleons and of secondary particles (pre- 
sumably mainly + mesons). Ciok ef al.’ have shown 
with primary jets that the energy of separation of the 
two centers decreases as the energy decreases and makes 
the two branches unresolved for energies below about 
1000 Bev. This effect appears to operate also among 
these secondary interactions; the event of type 14+19p 
with energy about 450 Bev may have partially resolved 
branches, but those with lower energy have more or less 
isotropic distributions. 

Table II gives a summary of the characteristics of the 
jets considered above. 


5. INELASTICITY 


The inelasticity is the fraction of the total available 
kinetic energy which is radiated as new particles. Since 
the energies of all the particles have not been measured, 
we may estimate them by using the result that the 
transverse momentum has a fairly constant value of 
0.5 Bev/c..—-% 

Edwards ef al." have used the formula 


1 Spr » cscé 


2y. 


the numerator represents the energy of the secondary 
particles in the C-system and the denominator the 
energy of the primary particles also in the C-system, 
assuming that the target particle is a nucleon. In order 
to reduce the experimental error this calculation should 
be made with tracks in the backward cone only. 


4 Edwards, Losty, Perkins, Pinkau, and Reynolds, Phil. Mag. 
3, 237 (1958). 

2 Cooperative emulsion group in Japan, Suppl. Nuovo cimento 
8, 761 (1958). 

3A lower value of average transverse momentum, about 0.3 
Bev/c, is suggested by experiments in Chicago: E. Lohrmann 
and M. W. Teucher (private communication). 
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In the two-center model much of the energy released 
is accounted for as the energy of separation of the two 
radiating centers. The expression for the inelasticity is 
then 

1.5n,7ps, 3n.Vpr 
K,=— — << - . (6) 
2y- TY: 


we have used here the relation which holds for isotropic 
emission from one or more systems: j,=4pr/7. 

Values of Ki/pr and K2/pr for the jets with energy 
over 1000 Bev are given in Table II. 


6. THE ENERGY OF SECONDARY PARTICLES 
Predictions of the Two-Center Model 


It has been shown by the groups in Bristol! and 
Japan” that the transverse momentum of particles 
emitted from jets has an average value around 0.5 
Bev/c'* which is almost independent of the primary 
energy. If this result were taken to hold strictly, it 
would imply that the secondary particles have momenta 
in the laboratory system related to their angles of 
emission, a, by the equation 


p= pr, sina, 


This corresponds to the straight line in Fig. 5 where the 
total energies of secondary particles are plotted against 
their angles of emission, in the laboratory system. 
Obviously this relation must fail at very small angles 
since the energy cannot increase indefinitely. 


s 





=5 Bev/ 
>, c 


p,=0 5 Bev/c 
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Fic. 5. The calculated total energies of secondary particles 
plotted as a function of their angle of emission, both measured in 
the laboratory system. The particles are assumed to be pions 
emitted with momenta from 0.1 Bev/c to 5 Bev/c from two 
radiating centers whose Lorentz factors are y,;=10000 and 
2= 100. If the particles are emitted isotropically from each center, 
90% of them have angles lying along the continuous curves and a 
further 9% have angles on the broken curves. The straight line is 
a line of constant transverse momentum, pr=0.5 Bev/c. 
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If we assume instead that particles are emitted, all 
with the same energy, /,, in the radiating system, the 
relation between E’, the total energy in the laboratory 
system, and the angle of emission, a, is 


FE! =pe'y’ = Ey, (1+B.B» cos6) ; (7) 


y» is the Lorentz factor and 6, the velocity of the radi- 
ating system; no approximations have been made in 
this equation. If we put 6,=8,=1, we get 


2E sy 
E’= E,y»(1+cosé) =—————_-. (8) 
1+,” tan’a 


Assuming that the emitted particles are pions, we 
have calculated the energies of mesons in the laboratory 
system from Eq. (7) and the corresponding laboratory 
angles of emission from Eq. (1); we have taken values 
of the momentum, #,, in the radiating system from 0.1 
Bev/c to 5 Bev/c and made only the approximation 
that 6,=1. The resulting energies are plotted" loga- 
rithmically against log tana in Fig. 5 for emission from 
two centers with values of y, of 10000 and 100, repre- 
senting a jet with y.=1000 and 7=5. 

If emission from each center is isotropic, when 
emission velocities are sufficiently close to c, 90% of the 
mesons are emitted in a range over which tana changes 
by a factor of 10, and 99% in a range over which tana 
changes by a factor of 100. The effects of nonrelativistic 
emission velocities can be seen from the graphs to be 
important for mesons with ~,=0.1 Bev/c but un- 
important for 95% of mesons with p,=0.5 Bev/c. 

It is clear that a two-center model in which the 
momenta of emission from each center are approxi- 
mately constant and distributed isotropically can 
readily be reconciled with the apparent constancy of 
the transverse momentum; an average transverse 
momentum of 0.5 Bev/c will result from isotropic 
emission with momentum p,= (4/r)0.5=0.63 Bev/c. 
This model predicts not only the existence of a maxi- 
mum energy and a maximum angle but also that 
particles emitted at angles around 90° in the C-system 
should have lower values of transverse momentum than 
the majority of those emitted in the main parts of the 
inner and outer cones. 

Further, if emission from the center is isotropic, 90% 
of the mesons fall within definite regions in Fig. 5. 
Clearly, if appreciably more than the expected number 
of particles were found to fall outside the predicted 
regions, the two-center model as conceived here would 
be untenable. 

We note here also that the model predicts that if 
particles as heavy as nucleons are emitted from the 

4 Two different methods of plotting the energies of secondary 
particles emitted from a radiating system have been used by 
H. Huzita [Nuovo cimento 6, 841 (1957) ] and Ya.I. Granovsky 
and I.Ya. Chasnikov [Zhur. Eksptl. i Teoret. Fiz. 36, 1119 


(1959) ], but their functions of energy and angle and the resultant 
plots are Jess convenient than those used here. 
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Fic. 6. The energies of secondary particles which interact, 
derived from their interactions, plotted against angle of emission; 
the results for several primary jets are plotted on the same 
diagram by making a definite assumption for the Lorentz factor 
of the emitting system and normalizing the energy and angle by 
means of this factor, The curves represent energies calculated for 
the common radiating system in the same way as for each of the 
systems in Fig. 5. The sources of the data used here are as follows: 
Ottawa 0+36p: present work; Warsaw 0+14a: Ciok, Danysz, 
Gierula, Jurak, Miesowicz, Pernegr, Vrana, and Wolter, Nuovo 
cimento 6, 1409 (1957); Minnesota 1+6a: provided by P. S. 
Freier (private communication to G. Cocconi); Bristol 0+4¢9, 
0+32p and 0+11p: reference 11, and private communication 
from D. H. Perkins to G. Cocconi. 


radiating centers with momenta less than about 1 
Bev/c, they will appear in the forward halves of both 
the inner and outer cones with higher energies in the 
laboratory than pions of the same momenta in the two 
systems. 


Comparison with Experiment 


The restriction on values of energy and angle of 
secondary particles as well as the lowering of transverse 
momentum around the median angle are effects which 
should be amenable to experimental examination. To 
test this we study here the available measurements of 
secondary energies. In Fig. 6 are plotted energies which 
have been derived from the interactions of the secondary 
particles. The results for three secondaries of the 0+36p 
jet have been plotted on the same diagram as results 
from several other jets by making a specific assumption 
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about the value of y, for the appropriate emitting 
center; where available the values given by Cocconi 
have been used. More extensive data on secondary 
energies is available from scattering measurements; 
Fig. 7 shows the results from three events on which 
details have been published. In Figs. 6 and 7 the 
straight line corresponds to constant transverse mo- 
mentum of 0.5 Bev/c, while the curves represent the 
energies of pions with p,=0.1, 0.2, 0.5, and 1.0 Bev/c, 
emitted from postulated radiating centers. 

Figures 6 and 7 show that the experimental results 
are consistent with the model discussed above. If 
particles are assumed to be emitted from a single center, 
their momenta in this system, the C-system, do not lie 
within the same range for all directions of emission; 
the two-center model, on the other hand, allows us to 
assume that particles are emitted from the two centers 
with momenta in approximately the same range in all 
directions. 

The momenta of particles emitted from the centers 
appear to lie mainly between 0.2 and 0.5 Bev/c. In the 
cases where the energies have been derived from the 
angular distribution of secondary interactions the 
average momentum appears rather higher than in the 
cases where energies have been obtained from scattering 
measurements. This can be explained as a consequence 
of the inadequacy of the approximation 8,=1 which 
leads to an overestimate of energy.!® 


2+15p CHICAGO 








5+39p TURIN 


7. CONCLUSIONS 


We have seen that a model in which mesons are 
considered to be radiated isotropically from two centers 
explains satisfactorily the main features of the angular 
distribution of particles from three of the jets studied 
here. In these interactions, due to a proton, ana particle, 
and a neutral particle (probably a neutron), the primary 
energy (per nucleon) is over 1000 Bev, and there is a 
clear separation between the inner and outer cones. 
The median angle however is not well defined, i.e., is 
highly sensitive to fluctuations in numbers of particles 
£2 6g mpeces in the two cones; therefore the use of the angle where a 
break between the two branches occurs, rather than of 
the median angle or the mean value of log tana, leads 
to a more acceptable estimate of the primary energy. 

The angular distributions of tracks from interactions 
of secondary particles from these jets do not show a 
clear two-branch structure; those with energy below 
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Fic. The energies of secondary particles from three jets, 
named. roy measurement of scattering. The jets represented 
here are as follows: Chicago 2+15p: Glasser, Haskin, Schein, 
and Lord, Phys. Rev. 99, 1555 (1955) ; Turin 5+-39p: Debenedetti, 
Garelli, Tallone, and Vigone, Nuovo cimento 4, 1142 (1956); and 
Moscow 2+14p: Boos, Vinitskii, Takibaev, and Chasnikov, 
Zhur. Eksptl. i Teoret. Fiz. 34, 622 (1958) [translation: Soviet 
Phys. JETP 34, 430 (1958) ]. The solid points represent particles 
whose energies were obtained by relative scattering. The line 
corresponding to pr=0.5 Bev/c is shown, and curves corre- 
sponding to various momenta of emission from two centers have 
tentatively been added. 














NUCLEAR INTERACTIONS 
100 Bev correspond well to isotropic emission from a 
single center; the one event with energy between 100 
and 1000 Bev does not have an obvious break in the 
angular distribution but might be explained as emission 
from two centers. 

In considering the energy of secondary particles as a 
function of their angle of emission, we have examined 
the predictions of the two-center model under the 
assumption that the momentum spectrum of emitted 
particles is isotropic in each of the two systems. This 
model predicts the apparent constancy of transverse 
momentum, but makes the further prediction that the 
average transverse momentum should be lower, not 
only in forward and backward directions, but also 
around 90° in the C-system. The model also places 
definite restrictions on the values of energy and emis- 
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sion angle which the majority of the particles can have. 
These predictions should be susceptible to experimental 
test, particularly in jets with a wide separation between 
cones. The available experimental results from this and 
other published work have been studied and are seen 
to be readily understood in terms of the model, but are 


not yet sufficient to permit a definite conclusion. 
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Heavy Primary Cosmic Rays at Geomagnetic Latitude of 41°N 


O. B. Younc AND H. Y. CHEN 
Southern Illinois University, Carbondale, Illinois 
(Received April 21, 1959) 


This report includes the results from 9 balloon flights at geomagnetic latitude 41°N of altitude range from 
70000 to 100000 feet. Only primaries of Z 210 are considered. 2410 tracks are involved, in Ilford G-5 and G-0 
emulsion exposures. Given are the charge spectra, flux, mean free paths, and angular distributions. 


INTRODUCTION 
HIS paper represents an expansion of an investi- 
gation previously published.! The results are now 
based on an increase in the number of heavy nuclear 


tracks from 1082 to 2021. The emulsions containing 
these tracks were exposed at altitudes above 90 000 feet. 
Also herein included are the results of an additional 
389 tracks in emulsions exposed in the 67 000-foot to 


TABLE I. Percent of each Z for each plate. 


Plate number 


1423 3200 


36.2 
14.7 
8.6 


Nuclear charge Z HHNII 


10 21:5 
11 13.6 
12 16.6 
13 905 
14 
15 
16 
17 
18 


2601 4001 
20.2 
12.0 
19.5 
11.6 
5.6 
5.6 
3.4 
22 
2.6 
3.0 
2:2 
1.9 
3.0 
3.0 
2.2 
0.4 
1.5 


267 


2588 


16.4 
12.6 
11.4 
13.9 
11.4 
3 


37 Aand B 


18.8 
12.4 
16.1 


LHNII 
26.8 
14.2 
18.6 

9.1 
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No. of tracks 

Flux, Zo: multiply by 
10 part./cm? 
sec sterad 


Altitude (g/cm?) 
Peaking angle 


2.41+0.46 
32.40 
45°+5° 


2.50+0.80 
52.80 
45°+5° 


2.57+0.41 
7.70 


55°+5 


2.50+0.30 
15.43 


57°35 


2.40+0.40 
15.40 
50°+5° 


2.40+0.41 
11.43 
48°+5° 


2.534+0.40 
17.46 
54°+5 


2.70+0.40 
7.74 
60°+5 


2.66+0.31 
9.98 
54°+5° 


10. B. Young and F. E. Harvey, Phys. Rev. 109, 529 (1958). 
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TABLE II. Flight and plate data. 


Flux value, Jo Attenuation Area Time of ad . 
No. of (10™¢ part./cm? MFP(A) Altitude scanned flight const. Thickness Type of 
Plate tracks sec sterad) (g/cm?) (g/cm?) Date (cm?) (hours) a b (microns) _ plate 


HHNII 753 2.664031 — 21.4340.67 





09.98 February 11, 1956 97.46 8.4 0.082 0.2 600 G-5 
267 =. 2.70 4-0.40 24.80+1.0 07.74 June 3, 1955 21.70 9.0 0.082 0.23 600 G-5 
362 -2.57+0.41 21.29+0.95 07.70 June 3, 1955 33.44 9.3 0.06 0.18 1000 G-5 

79 = 240+0.41 24 11.43. November 18, 1954 23.45 6.2 0.065 0.2 600 G-0, G-5 

200, 400 

168 =2.50+0.30 15.43. May 8, 1952 32.14 : 0.075 0. 600 z-s 
258 2.40+0.40 15.40 October 7, 1954 60.00 ‘a 0.081 0. 400 - 
134 =2.5340.40 17.41 February 5, 1954 34.70 7.0 0.057 0. 600 3- 
157 2.41+0.46 23.39+0.91 32.40 February 11, 1956 106.13 8.0 0.085 0. 600 -§ 
232 =2.50+0.80 24 52.80 January 24, 1955 280.0 8.0 0.072 0. 600 


Average J (without No. 2601) =2.53+0.44X 10™ particles/cm? sec sterad (incomplete flight curve) 
Average Jo (with No. 2601) =2.52+0.43 X 10™ particles/cm? sec sterad 





73 000-foot altitude range. In the previous paper con- 
Picte No. No.of Trocks Alt & _Locefion siderable attention was given to plate HHNII. One 
HHNII 753 90 000-100 O00ft pic nage “a rs = s alhiale cain 

260! 134 41° N. Lat. (Texas) section containing 176 tracks gave results which were 
real a lower in attenuation mean free path than the figures 


a 4 from other plates. However, after HHNII was com- 
362 pletely scanned, yielding 753 tracks, the results were 


raised from 19+0.7 g/cm? to 21.43+0.67 g/cm’. 
EXPERIMENTAL METHOD 
The experimental method has remained essentially 
unchanged. Angular distribution measurements have 
been added. While angles have been measured to 
within one degree, they are plotted more roughly, as the 
smaller divisions are not generally needed. 
RESULTS 
The results are given in Tables I and I and Figs. 1-5. 
Flux, Jo, has been obtained by extrapolation to the 
top of the atmosphere. The various values agree well, 
regardless of the altitudes of the balloon flights. 
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Plate - 37 

Altitude - 07.70 gm/cr® 

Date - June 3, 1955 

Area Scanned - 33.44 cné 

5.0 G.M._ Lat. and Location- 4/° N, Texas 
.OF Eff. Flight Time - 9,3 hr 

4.0 Emulsion, Type and Thickness - G5, 1000 p 
“ No. of Tracks - 362 


Fic. 1. Charge spectrum for 2021 tracks. 





Piote No. No of Tracks Alt. & Location J 
LHNII 157 67 000-79 OOOft 
3200 232 41° N. Lat.(Texas) 

= 4 5 Flux (top of atmos , Z 2 10): 2.572.41 
389 
I 0.06 
b=.18 
MFP = 21,294.95 gm/cm* 
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Fic. 2. Charge spectrum for 389 tracks. Fic. 3. Flux for plate 37. 





HEAVY PRIMARY 





Plates: 37A8B 
Altitude (gnanth 770 15.43 15.40 11.43 1746 


Plates: 400! HHNIlI 
Altitude (gnven? 7.74 9.98 


Total No. of Tracks: 2021 
Geomagnetic Lotitude: 41° N (Texas) 


1423 1480 2588 260! 
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Fic. 4. Angular distribution for 2021 tracks. 


The charge spectrum is given for each plate in 
Table II. The results are combined for all plates 


exposed at about the same altitude in Figs. 1 and 2. 
For the data contained in Fig. 1 the altitudes of exposure 
were in the 90 000-foot to the 104 000-foot range. For 
the other, the altitudes were lower, being in the 67 000- 
foot to 79 000-foot range. The relative abundance in 
percent is given for each Z. However, a reader must 
observe that the lower the altitude of exposure, the 
less reliable the abundance ratios become, owing to 
the more rapid absorption of the heavier elements by 
the atmosphere and to the heavy nucleus breakups 
which have fragments large enough to be included 
among nuclei of Z 210. This effect is clearly shown by 
comparison of the two charge spectra. 

The angular distributions for each plate are given. 
The peaking angle, measured from the zenith, for each 
plate is listed in Table II. Absorption by the atmosphere 
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Fic. 5. Angular distribution for 389 tracks. 


causes the peaking angle to become smaller as the 
altitude decreases. 

The attenuation mean free path, A, has been deter- 
mined separately for six of the nine plates listed, with 
the results listed in Table I. These six values average 
23.1 g/cm. The consistency is within about 6% from 
the mean. 
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Interference Effects of the Retardation Term in Pion Photoproduction* 


A. M. WETHERELLTT 
California Institute of Technology, Pasadena, California « 


(Received April 13, 1959) 


It is shown that the difference in behavior of the high-energy (>450 Mev c.m.) total photoproduction 
cross sections for x* and x can be explained by the presence of the retardation term in the case of the x* 
production. The analogy with the behavior at the (3, $) resonance is noted. The discussion naturally pro- 
vides an explanation for the difference in center-of-mass energies of the lower high-energy peak found in 
x -p scattering and the corresponding peak in the r+ photoproduction. It is felt that the discussion con- 


tributes some evidence for the resonance nature of the peaks. 


I. INTRODUCTION 


ECENT experiments on the photoproduction of 
single charged and neutral pions from hydrogen! 
have indicated the presence of a resonant-like increase 
in the total cross sections in the photon energy region 
of 700-800 Mev. This increase in cross section has been 
ascribed to the existence of a new isobaric state of the 
proton’ characterized by isotopic spin 3, angular mo- 
mentum 3, and a center-of-mass energy of about 600 
Mev. Since this suggestion was advanced, the spin and 
parity of the state have been under discussion’ and 
methods for determining these quantum numbers have 
been proposed.‘ Experiments on the polarization of the 
recoil protons from r° photoproduction® have lately 
indicated that the state may, possibly be specified as Dj. 
Negative pion-proton scattering experiments® have 
exhibited two high-energy peaks in the total scattering 
cross section. The lower of these is to be associated with 
those found in the photoproduction cross sections, 
although it has been noted, by Burrowes ef al.,® that 
the c.m. energy at the scattering peak is about 75 Mev 
higher than the corresponding energy at which the 
m+ photoproduction peak is observed by Dixon and 
Walker.! 

A striking feature of the high-energy maxima in the 
m* and r° photo cross sections is their difference in 
shape. The r+ peak, at about 700 Mev, is sharp and 

* This work was supported in part by the U. S. Atomic Energy 
Commission. 

+t Commonwealth Fund Fellow. 

t Now at CERN, Geneva, Switzerland. 
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falls steeply on the high-energy side, while the ° cross 
section shows a broad and smooth increase with a 
maximum apparently somewhat above 700 Mev. This 
difference has been locally considered to be due to 
interference between the resonant 7=} state and a 
T= state with the same spin and parity. Such an 
interference between a resonant T=} state and a 
resonant T7=3 state, at a higher energy, has been in- 
voked in a discussion of pion pair production.’ 

The purpose of this note is to present an alternative, 
and perhaps more satisfactory explanation for the 
difference in behavior of the + and 7° photoproduction 
cross sections and, also to account for the discrepancy 
in c.m. energies of the r+ photopeak and the z~ proton 
scattering peak. The procedure used to achieve these 
aims is felt to provide some additional evidence for the 
resonance character of the T=} state. 

A natural explanation of the effects, without arbi- 
trarily introducing extra states, may be given by the 
interference between the T= } state and the retardation 
part of the photon-meson current interaction, which is 
present for r+ photoproduction, but not, in the same 
order, for x° production. The importance of including 
the retardation term in the analysis of charged-pion 
photoprocesses has been emphasized by Moravcsik.® 
The Born approximation to the term has the same form 
in all the theories proposed and its effects have been 
observed recently in experiments at around 300 Mev.’ 

So far the qualitative analysis of the high-energy 
data? has ignored contributions from the retardation 
term. Inclusion of the term, which contains states of all 
angular momenta, leads to considerable complication in 
the analysis of the r+ photoproduction differential cross 


sections. The complication is caused by the interference 
of the states contained in the retardation term with all 
others present and the result is that it is not yet possible 
to make a firm statement about the states involved 


from the x* differential cross sections alone. A discussion 
of the total cross sections is, however, simpler as inter- 


7A. B. Clegg, see reference 3. 

8M. J. Moravesik, Phys. Rev. 104, 1451 (1956). 
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ference terms between states of different spins and 
parities do not contribute. 

The next section discusses an interference effect in 
the total pion photoproduction cross section at the 
(3, $) resonance. The point brought out has been re- 
marked by Chew,” but it was thought useful to show 
the magnitude of the effect using a specific theory to 
calculate the magnitude and to exhibit the similarity 
of the behavior with that found at the second resonance. 

Section (3) describes a simple resonance model deter- 
mined by reference to pion proton scattering data. The 
model is used to compute the contribution of the re- 
tardation-resonance interference to the m* cross section 
at the second resonance. 

The last section compares the result of the calculation 
with the experimental data and discusses the signifi- 
cance of the result. 


II. THE (3, }) RESONANCE 


A study of the r+ and 7° total photoproduction cross 
sections at the (3, 3) resonance shows a small relative 
displacement between the maxima, of the order of 
25 Mev. It will be shown that the difference is caused 
by interference between the resonance and the retarda- 
tion term in the r+ photoproduction. 

In order to attempt to compare the total cross sections 
associated with the low-energy resonance it is necessary 
to subtract out from the w* cross section the contribu- 
tions due to the nonresonant states present. This has 
been done by calculating the sum of the contributions 
due to the S; state, the retardation term and their 
mutual interference using the relevant terms contained 
in the complete photoproduction amplitude to order 
1/M given by Chew ef al.'! This amplitude is obtained 
from an application of the dispersion theory and con- 
tains an attempt to include rescattering and recoil 
processes together with the dominant resonant ampli- 
tude. In the evaluation the coupling constant f? was 
set equal to 0.081, the S-wave phase shifts used were 
those suggested by Puppi,'? and the small amplitude 
F® was multiplied by (1+w/M)~'. Nonresonant 
states other than those mentioned above were neglected. 

The result of the subtraction is shown in Fig. 1 in 
which the experimental data are compiled from several 
sources.’ Also shown is the r° total photoproduction 
cross section scaled down by an isotopic spin weight 
equal to 2. It should be noted that the r° cross section 
contains small contributions from rescattering and 

0G. F. Chew, Phys. Rev. 95, 1669 (1954); G. F. Chew and 
F. E. Low, Phys. Rev. 101, 1579 (1956). 
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Fic. 1. Total 7+ photoproduction cross section and total 7° 
photoproduction cross section, the latter scaled down by a factor 
of 2. Curve (a) is the result of subtracting the s wave and re 
tardation contributions from the r* total cross section. 


recoil which have not been subtracted. In the form 
shown in Fig. 1 the shift of the e+ maximum relative to 
the r° is quite marked. 

The contribution due to the interference between the 
retardation and (3, $) terms was computed from the 
dispersion theory amplitudes taking the resonance 
energy «,= 2.0 (pion mass units) and using (3, $) phase 
shifts calculated from the effective-range formula of 
Chew." 

The major effect of the interference term, as noted 
by Chew,” arises from the resonant part of the ampli- 
tude and involves cosé33 sinds3; which changes sign on 
going through resonance. Apart from this there are 
smaller effects due to rescattering and small magnetic 
moment Born terms contained in the complete ampli- 
tude. The rescattering terms arise as corrections to the 
renormalized Born approximation to the meson current 
term. The effect of these contributions is to shift the 
point at which the interference changes sign to an 
energy slightly lower than the resonance energy. 

The shape of the interference contribution is shown 
in Fig. 2 together with points taken from the difference 
between the curves indicated in Fig. 1. The general 
agreement is good and clearly indicates the origin of 
the relative shift in the x*, 2° peaks as being due to the 
interference term. 


III. THE SECOND RESONANCE 
(a) Scattering Peak 


The maxima in the r~ proton scattering cross section 
were given by Burrowes e/ al.° as occurring at 615+40 
and 775440 Mev c.m. The maximum in the a* total 
photoproduction cross section, according to Dixon and 
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Fic. 2. Contribution to total x* photoproduction cross section due 
to interference between resonance and retardation term. 


Walker,’ occurs at about 540 Mev c.m. It is rather 
difficult to place the peak in the x° photoproduction 
cross section, but the subtraction of a resonant-like tail 
due to the low-energy resonance, as performed by 
Vette,! seems to indicate that the maximum due to 
contributions other than the (3, 3) state lies at about 
590 Mev. This agrees, within the uncertainties involved, 
with the position of the lower peak in the scattering 
experiments, but the 2* position is low. Photon energies 
high enough to explore the region of the 775 Mev c.m. 
maximum have not yet been attained, but the 1000-Mev 
photon energy (720 Mev c.m.) m* angular distribution 
of Dixon and Walker' indicates the presence of a 
strong high-angular-momentum wave which is probably 
associated with the large scattering cross section ob- 
served at 775 Mev c.m. The change in character of the 


mr angular distributions between 785 and 940 Mev, as 
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Fic. 3. Form of resonance used in discussion; it is normalized to 
the peak of the curve given by Burrowes ef al. 


observed by Vette,! may also be evidence for the 
presence of another state. 

In order to discuss phenomena connected with the 
615-Mev c.m. peak in the scattering cross section, 
a simple single-level resonance type of description'*” 
will be used and the state concerned specified as D}. 
The procedure for constructing the resonance is set out 
in the following formulas. 

For a state with j=3%, the total scattering cross 
section is given by 

|e” sind |? 
o,=8r|——_ , (1) 
lq 


where 6 is the phase shift and g the c.m. momentum. 
A resonant phase shift may be characterized by’® 


5. 
tané = ———_, (2) 
W,—W 


where W is the total c.m. energy and W, is the resonance 
energy. I’ is the total resonance width and is given by 


I= 2(qga)ond; (3) 


here a is the channel radius, A is the reduced width, 
and v is the penetration factor. For a state with /=2, 
v is given by’® 


v= (ga)'/[9+3(qa)*+ (ga)*]. (4) 


Setting W,=615 Mev and using (1) through (4), the 
parameters a and \ were chosen to give a total width 
at half height of about 200 Mev, as required by the 
T= cross section shown in Fig. 2 of the Letter of 
Burrowes ef al.° The values for a and A used were, a= 1 
pion Compton wavelength, A=0.19 (pion mass). The 
maximum elastic scattering cross section observed by 
Crittenden ef al.6 was about 30 mb compared with the 
absolute maximum of 44 mb for a j= 3 state at a c.m. 
energy of 615 Mev. This indicates absorption of the 
j= wave, and the resonance calculated has been 
fitted to a maximum absolute value of about 30 mb. 
The result is shown in Fig. 3 with the curve drawn in 
Fig. 2 of Burrowes et al.® 


(b) Interference With the Retardation Term 
in Photoproduction 


The considerations of the last subsection are con- 
sidered to have determined the Dy elastic scattering 
amplitude in terms of the phase shift, given by Eq. (2), 
and the adjusted parameters a and X. 

A resonant £1, D; photo matrix element, involving 
the same scattering amplitude, may be written down 


4K. A. Brueckner, Phys. Rev. 86, 106 (1952); M. Gell-Mann 
and K. M. Watson, Annual Review of Nuclear Science (Annual 
Reviews, Inc., Palo Alto, 1954), Vol. 4. 

's Feshbach, Peaslee, and Weisskopf, Phys. Rev. 71, 145 (1947). 
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in the form 
e” sind 
IM = —1efC— 
q° 


(5) 


(30-qq:e—@- eq"). 
¢ 


The amplitude term has been formed with reference to 
Watson’s remarks'® on final-state interactions and the 
angular part has been taken from the general form given 
by Chew ef al."' C is considered a constant which is 
adjusted to give a fit to the contribution of the £1, 
D, state to the total x photoproduction cross section 
and e and f have been included explicitly for con- 
venience. 

The extraction of the £1, D; contribution from the 
experimental data has been done in a rather arbitrary 
way. The tail of the (3, $) cross section was calculated 
from the dispersion amplitude, using experimental 633 
phase shifts given by Blevins ef al.!’ The result was 
subtracted from the experimental cross section and the 
remainder considered to be due mainly to the D, 
resonant state. Specifically, the maximum Dy cross 
section was taken to be about 70% of the maximum 
remaining after subtraction, the other 30% being pre- 
sumably due to small contributions from rescattering 
processes induced by the nonresonant S wave and 
retardation interactions, and higher partial waves. The 
low-energy side of the high angular momentum state 
indicated by the data of Dixon and Walker! and Vette! 
probably contributes substantially at the upper end of 
the range of measurements. In the absence of further 
information the above procedure was followed and C 
obtained by comparing 


q WV \—* 
1-( )(+—) f mie, 
k M 


with the estimated D, cross sections. 

In (6) k is the c.m. photon momentum and W the 
total c.m. energy. 

Figure 4 shows the shape of the 2° resonance calcu- 
lated from Eq. (6) with the subtracted cross section. 
The position of the calculated 7° maximum is consistent 
with the experimental points. 

The Born approximation to the retardation part of 
the m* meson current matrix element is 


(6) 


o:-(k—q)q:e 
R=v2ief aa 
gok(1—B cos) 


(7) 


The contribution to the total r+ photoproduction cross 
section due to the interference of the resonance and 
retardation matrix elements is calculated from the 
cross term between V2 and R. 

Explicitly it is given by the expression 


(cosé sind) Wr? /q 
3= 4e? P?C— (1+ ) ( )a. 
: M k 


q° 
‘6K. M. Watson, Phys. Rev. 88, 1163 (1952). 
'7 Blevins, Block, and Leitner, Phys. Rev. 112, 1287 (1958). 


(8) 


PTON ) ‘PHOTOPRODUCTION 





4 Mc Donald et ol. ref, (13) 
© Vette ref, (I) 


x Worlock C.1.T. Thesis 











Ki ab 
Fic. 4. High-energy total x® photoproduction cross section. 
Curve (a)—calculated tail of (3, 3) resonance; (b)—difference 
between experimental curve and curve (a); (c)—Dy resonance 
calculated from scattering amplitude, maximum normalized to 
70% of maximum of (b). 


The result of the angular integration, A, is given by 


17k? 
A = 2wy*k*282(1+7) —3]—2— (1-6) 


1-s 
X (3— 2687) inf — ), (9) 
1+, 


where B=g/qo, Y=qo/k and qo is the c.m. pion total 
energy. As 9 contains cosé siné, the term changes sign 
at resonance and reaches a maximum around 6=45°. 
The curve calculated from (8) and (9) is shown in 
Fig. 5. Also shown is the sum of 9 and 2X the 7 
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Fic. 5. Estimated contributions to r* total photoproduction 
cross section. Curve (a)—calculated interference contribution ; 
(b)—calculated r® resonance; (c)—sum of (a) and 2X (b). 
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Fic. 6. Estimated x* total photoproduction cross section given 
by curve (c) of Fig. 5 added to a “background”’ cross section 
discussed in the text. 


photoproduction cross section calculated from (6). The 
maximum in the sum is seen to be about 50 Mev 
below the maximum in the 7° cross section and also the 
peak falls off sharply on the high-energy side due to 
the change in sign and rapid variation of the inter- 
ference term in the cross section. 

In order to compare the result of this discussion 
directly with the experimental data, a guess must be 
made as to the cross section due to other processes. 
Figure 6 shows the result of adding on a ‘‘background” 
cross section composed of an essentially energy-inde- 
pendent contribution, a resonance tail from the (3, 3) 
state and a tail from the 775-Mev c.m. resonance. 


IV. DISCUSSION AND CONCLUSIONS 


The shape of the estimated #+ photoproduction cross 
section, shown in Fig. 6 is in good agreement with the 
experimental data. There is a hint that the estimated 
peak is a little too high in energy and this could be 
remedied by choosing the resonance energy a little less, 
say 600 Mev, than the 615 Mev taken from the scat- 
tering data. On the other hand, other effects could 
cause a further small shift in the peak and such effects 
due, for example, to rescattering processes were re- 
marked in the case of the (3, $) resonance. Another way 
of saying this is that the Born approximation to the 
retardation term is inadequate. It is certainly clear, 
however, that the difference in c.m. energy between 
the maximum in the + photoproduction cross section 
and that in the #~-proton scattering cross section can 
be essentially explained by the present discussion. Of 


course, the absolute value of the resonance and inter- 
ference cross sections have been determined in a rather 
arbitrary way, but the general shape of their contribu- 
tion is not too sensitive to variation of the normalizing 
constant C. Another arbitrary procedure was the addi- 
tion of the background cross section shown in Fig. 6. 
Apart from its absolute magnitude, the main assump- 
tion was that it did not vary rapidly with energy and, 
therefore, did not distort the general shape of the peak. 
This assumption is probably not unreasonable. 

It is easy to verify'’ that the present analysis leads to 
consistency with the sign of the polarization found in 
the recent x° experiment.® According to Sakurai,‘ the 
polarization is supposed to be due to interference be- 
tween the D, state and the tail of the (3, 3) resonance. 
Taking the phase of the P; amplitude to be moving 
from 90° to 180° and using the D; phase assignment of 
the present discussion, one finds that the polarization 
of the recoil nucleon should be positive in the sense of 
the direction (qxk). The measurement of Stein® shows 
that this is the case. The correlation between the sign 
of the shift of the r+ photopeak and the polarization of 
the recoil proton in r° production provides useful in- 
formation for determining the possible states present 
and should also be applicable to the case of the ‘‘third”’ 
resonance when experimental data is available. 

Finally, the present discussion brings out the im- 
portant point that the difference in c.m. energies of 
7 ~-proton scattering peak and the 2+ photoproduction 
peak, together with the singular shape of the latter, 
can be explained in terms of a state characterized by a 
single resonant phase shift and the presence of the 
charge dependent meson current. Insofar as this is 
true, the argument can be said to contribute some 
evidence that the rise in the pion photoproduction and 
scattering cross sections at around 600 Mev c.m. is 
caused by a state with a phase shift which passes 
through 90°. 


ACKNOWLEDGMENTS 


I am indebted to Professor R. L. Walker for sug- 
gesting an investigation of the total photoproduction 
cross sections and for his constant help and encourage- 
ment. Discussions with Professor M. Gell-Mann have 
been much appreciated. I should also like to thank 
Professor R. F. Bacher for his interest and hospitality. 

I am grateful to the Commonwealth Fund of New 
York for a grant. 


'8 T am indebted to Professor M. Gell-Mann for this remark and 
to Professor R. L. Walker for an independent check on signs. 





PHYSICAL REVIEW VOLUME 


23, 


NUMBER 6 SEPTEMBER 15, 1959 


K*‘-Deuteron Scattering in the Impulse Approximation* 


ErasMO M. FERREIRAT 
Imperial College of Science and Technology, London, Iingland 
(Received September 17, 1958; revised manuscript received March 25, 1959) 


It is suggested that a phenomenological analysis using the impulse approximation of the processes 
occurring in the scattering of K*+ mesons by deuterons may be used to get the phase-shifts for the 7=0 
isotopic spin state. Typical curves are given for the elastic, elastic plus inelastic and charge exchange scat 
tering differential cross sections of 100-Mev K+ mesons by deuterons on the assumption that only S-waves 
contribute and for various ratios of the T=1 and T=0 isotopic spin states phase shifts. 


INTRODUCTION 


LL existing experimental results! which give in- 

formation on K+ meson-nucleon scattering are 
consistent with an isotropic and energy-independent 
differential cross section at energies between 40 and 
200 Mev. These results come both from pure hydrogen 
bubble chamber and from nuclear emulsions experi- 
ments. 

The pure A+ meson-proton processes, as observed in 
hydrogen bubble chambers, can only give information 
on the amplitude for the scattering in the T= 1 isotopic 
spin state. At the energy of 100 Mev a negative S-wave 
phase shift siné,=—0.37 (with all the other phase 
shifts equal to zero) fits well the observed repulsive 
potential at low energies and the isotropy in the 
scattering, the total cross section being of 4m sin*6,/k? 
= 16 millibarns. 

The direct and charge exchange scattering of K* 
mesons by neutrons observed on nuclear emulsion 
material depend on the amplitude a» for scattering in 
the T=0 state, as well as on the amplitude a, for 
scattering in the 7=1 isotopic spin state. Assuming 
charge independence, in a material with a number of 
protons equal to f times the number of neutrons, the 
rate of charge exchange to total (non-Coulombian) 
scattering is R= (a,;—do)?/[2(a;2+ ao?) +4 fa,? ]. If there 
is no interaction in the state T=0, that is, if ag=0, we 
get R=1/6 for f=1 and R=1/7 for f=5/4. As the 
experimentally observed ratio is consistent with these 
values of R, it has been suggested? that at low energies 
only the T=1 isotopic spin state is responsible for the 
scattering. 

These processes observed in nuclear emulsion give 
the only available information on A+ meson-neutron 
processes and on the contribution of the 7=0 state to 
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1 


~ 


/ 


7 


< 


the A* meson-nucleon interactions. However the com- 
plexity of the emulsion material makes the analysis of 
the experimental results rather difficult. Deuterium 
bubble chambers and diffusion cloud chambers now in 
use provide a more convenient sample material con- 
taining neutrons. 

The impulse approximation’ has been used with suc- 
cess by Fernbach, Green, and Watson‘ and by Rock- 
more® to express the differential cross sections for 
scattering of pions by deuterons in terms of the ampli- 
tudes of elementary processes of pions with protons and 
neutrons. In Rockmore’s formulas the cross sections for 
elastic, elastic plus inelastic, and charge-exchange 
scatterings of the m-mesons by deuterons are expressed 
in terms of the S- and P-wave phase shifts for scattering 
in the charge independent T=3/2 and T=1/2 states 
of a pion-nucleon system. Comparison of Rockmore’s 
results with experiment® has shown the reliability of 
the application of the impulse approximation to his 
problem. 

We now then suggest that a calculation in impulse 
approximation similar to that of Rockmore, and experi- 
ments on the scattering of a K*+-meson beam by the 
deuterium of either a bubble or a diffusion chamber, be 
used together to get information on the phase shifts 
for the T=0 isotopic spin state of the A*+ meson- 
nucleon system. 

In this paper we present the formulas, which are 
analogous to those of Rockmore, giving the differential 
cross sections for elastic, elastic plus inelastic, and 
charge exchange scattering of K+ mesons by deuterons. 
In particular we present the curves for these differential 
cross sections for K+ mesons of 100-Mev energy in the 
cases of T= 1, S wave only contributing to the scattering 
and of T=0, S-wave adding a small contribution to the 
above (two different signs of the phase shift are con- 
sidered). 

We have suggested explicitly the work with the 
positive K meson for the obvious reason of the greater 
simplicity (as compared with the negative K) of its 

3G. F. Chew and M. L. Goldberger, Phys. Rev. 87, 778 (1952). 

4 Fernbach, Green, and Watson, Phys. Rev. 84, 1084 (1951). 

5 R. M. Rockmore, Phys. Rev. 105, 256 (1957). Typographical 
errors in Rockmore’s formulas were later corrected: see R. Rock 
more, Phys. Rev. 113, 1696(E) (1959). 
pt C. Rogers and L. M. Lederman, Phys. Rev. 105, 247 
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Fic. 1. Differential cross section for elastic scattering of 100- 
Mev K* mesons by deuterons. Only S waves are assumed to 
contribute. Curve A corresponds to scattering purely in the 7=1 
isotopic spin states. Curves B and C include smaller contributions 
from the 7=0 state. 


interaction with nucleons. The absence of absorption 
processes greatly simplifies the phenomenological analy- 
sis, and in this respect, the At interaction is even 
simpler than that of the x mesons. 

The multiple-scattering and potential corrections to 
the impulse approximation calculation have been shown 
by Rockmore’ in the pion case to give contributions of 
opposite signs to the cross sections, so that even if they 
cannot separately be neglected, their added effect 
seems not to alter appreciably the values calculated in 
the pure impulse approximation. In our case of scat- 
tering of K mesons the estimation of these corrections 
by the methods used in the pion case would necessarily 
be unconvincing. In fact the knowledge of the inter- 
actions of K mesons and nucleons is not good enough to 
provide us with a field-theoretic method for the calcu- 
lation of the effects of binding and multiple scattering. 
On the other hand, the Brueckner model, which assumes 
fixed (infinitely heavy) nucleons, cannot be used in the 
K-meson case due to the large mass of these particles. 

However, we can make some estimates and present 
some qualitative arguments which show that we can 
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rely on the applicability of the impulse approximation 
to this problem. We expect the deuteron binding effects 
(in other words, the nucleon-nucleon interaction in the 
final state) to be smaller in the scattering of K mesons 
than they were in the scattering of pions of about the 
same energy (few hundred Mev). This is because the 
potential corrections are larger for smaller nucleon 
recoil energies, and these tend to be larger in a collision 
by a heavier particle. 

As to the multiple-scattering effects, a simple esti- 
mation can be made by considering the value of the 
amplitude of the wave scattered by one nucleon, 
evaluated at the average position of the other nucleon.’ 
In the case of scattering of 100-Mev K mesons we thus 
estimate (assuming a A-nucleon cross section of 20 
millibarns and the usual value for the radius of deu- 
teron) that the error due to neglect of multiple scatter- 
ing will be probably less than 10%. 

On the other hand, the interaction time of a Kt- 
nucleon system is certainly shorter than that of nucleon- 
nucleonic or pion-nucleonic systems (which are of the 
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Fic. 2. Differential cross section for elastic plus inelastic scat 
tering of 100-Mev K* mesons by deuterons. Curve A corresponds 
to scattering of S waves only in the 7=1 isotopic spin state. 
Curves B and C admit contributions from the S waves in the 
T=0 state. 


7G. F. Chew and G. C. Wiek, Phys. Rev. 85, 636 (1952). 
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Fic. 3. Differential cross section for charge exchange scattering 
of 100-Mev AK* mesons by deuterons. Curve A corresponds to 
scattering only in 7=1 state. Curves B and C are calculated with 
small contributions from the S waves of the 7=0 state. 


order of #/m,c*) and can be neglected compared to the 
much longer time involved in the nucleon motion inside 
the deuteron. So the neglect of the effects of the forces 
between the two nucleons during the ‘“‘collision time” 
(this is the properly called impulse approximation) is 
well justified here, as it was in the nucleon-deuteron and 
pion-deuteron cases (see reference 7). 

Summing up, we estimate the main error, due to 
multiple scattering, to be less than 10%. Other effects 
should not change this error by more than a few 
percent. 


FORMULAS AND CURVES 


Our notation is the same as that of Fernbach* and 
Rockmore.’ Phase shifts are indicated in a manner 
similar to that introduced by Fermi, the first index 
being either 1 or 0 according to whether the isotopic 
spin state is 7=1 or T=0. The second index is omitted 
for S waves, and for P waves is either 3 or 1 according 
to whether the total angular momentum is 3/2 or 1/2. 

We define 


Jo= [ 5( Es ko)q°dq 


Ef 


the kinematical factors being those of a collision of a 
K meson with a nucleon, and 


Ji _ f 6( E;/- Ey) q°dq, 
Ej’ 


where the kinematics is that of a collision of a K meson 
with a deuteron. 

The curves are traced for 100-Mev energy K mesons 
and for the following set of phase shifts. Curves labeled 
by A correspond to siné,;= —0.37, all the other phase 
shifts being set equal to zero (only T7=1, S-waves con- 
tributing to the scattering). They correspond to a ratio 
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R of the charge exchange to the total number of 
processes equal to 1/6 (for f=1). 

Curves labeled by B correspond to siné;=—0.37, 
sindy= —0.1, and in this case we have R= 1/5. For the 
curves C we have siné;=—0.37, sind>=+0.1, and 
R=1/9. 


The differential cross section for elastic scattering is 
do E MoS oH (6) {| Qa 


= - oma — 
dQ 4k*wWy cab) v sin? (6, 2) 


| 2 
+ (63+ 2nos+3ni11+701) COS | 


+3 sin’ | 3m3+n03— 3m 1 No1 . 


For elastic plus inelastic scattering we have 
da E+! JoM qo | 


= ‘| oh 21311) COSAy 
k*wW nad) 


a 
-—— —_ +sin4o| 3-11)? 
20 sin?(o/ 2) | 


+4 m+ not (2m3+2n03+n1+101) cos6, | . 


+4 sin’69|3+n0s— 1-01 | 


H: Jo! Mqo 


= (ann 
Sktw IWrwaavy | 


a 
v sin?o/2 


1/2 a 


+ c0s6o(6n13+ 2no3+: Smut am)) — 
" 


a 


Pies 2 + cos6o(2m13— 
v sin? (Oo 2) 


2nostm jag 


+43 sin”, 3mist+no3s— 3m11— N01 2 


—$ sin’6o|m13—n03—mi+M01|? 


and for the charge-exchange differential cross section 
do©— MqvJo 
(1 — HH») ni No 
dQ ARN 4 IW wad) 
+ (min it2ni—- 2n03) CosOy | 2 
ms3t+nos| 2 | 
“+ 4 ( 1 +H») sin6y 117 Nor 


2 sin*6o|m11— 01 — 
mi3tNos | 1. 


Note added in proof.—While the present work was in 
the process of publication, an equivalent paper by M. 
Gourdin and A. Martin appeared in Nuovo cimento XI], 
670 (1959), 
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Strange Particle Production by Bevatron Neutrons in Propane* 
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A liquid propane bubble chamber was exposed to a beam of neutrons with energies up to 6 Bev from the 
Bevatron. 10 000 pictures of interactions in the hydrocarbon were scanned to detect neutral heavy unstable 
particles. 349 neutral V-events were found, most of which came from the stainless steel walls of the chamber. 
86% of these events could be identified as one or the other or either of the neutral strange particles: A° or 
0,°. The A®/6,;° ratio is about 0.6. 

8200 stars of 2 or more prongs formed by neutrons interacting in the liquid propane were observed in the 
chamber and 17 of these produced V°’s. An additional 5 V°’s were formed in single-prong events produced 
by neutrons, and 8 others were produced in events in the propane caused by charged particles. 

The energy spectrum of the incident neutrons was estimated from study of x-meson production inter- 
actions in the hydrogen. The distribution shows that the neutrons had energies up to 6 Bev with a mean 
value of about 4 Bev. For the energy range 1 to 6 Bev, the production of strange particles occurs in about 


1% of all inelastic interactions of neutrons with hydrogen and carbon. 


I. INTRODUCTION 


LTHOUGH neutral strange particles were first 

produced artificially by neutrons hitting complex 
nuclei,' most studies of these V° particles in the collisions 
of elementary particles have been in 2-p interactions. 
For energies up to 3 Bev, V° production by pions 
striking both free protons and complex nuclei** appears 
to be greater than production by nucleons and very 
few cases of nucleon-nucleon production have been 
directly observed.*>.* This paper reports results of a 
production study for neutrons with energies up to 6 Bev 
interacting both with free protons and with the nucleons 
within carbon nuclei. 

Details of the experimental equipment and _ the 
analysis of the V° events are discussed in Sec. II; the 
detection efficiency for V° events in this liquid propane 
bubble chamber is calculated in III. Section IV de- 
scribes the neutron beam and its interactions in the 
hydrocarbon and Sec. V discusses the production of 
neutral strange particles in these interactions. 


II. EXPERIMENTAL EQUIPMENT AND ANALYSIS 


This investigation of the interactions of neutrons 
with energies up to 6 Bev was done with pictures from 
the University of California Radiation Laboratory 
13-inch liquid propane bubble chamber’ in a magnetic 


* Supported in part by the U. S. Atomic Energy Commission. 
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for the degree of Doctor of Philosophy. 
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field of 12.9 kilogauss. The chamber was operated and 
the pictures supplied by the Radiation Laboratory 
group directed by Professor Wilson Powell. The 
chamber was exposed to a beam of neutral particles 
traveling forward from a metal (beryllium or copper) 
target struck by the 6.2-Bev internal proton beam of the 
Bevatron. The beam traveled about 13 feet in the 
magnetic field of the Bevatron quadrant so that charged 
particles were swept out, and the bubble chamber was 
a distance of about 93 feet from the target, so that 
most of the unstable neutral particles, except the long- 
lived 0,°, should have decayed. The beam, assumed to 
be almost purely neutrons with energies up to 6 Bev, 
was collimated so that it would enter the bubble 
chamber through the 1-in. diameter }-in. thick “thin- 
window” section of the 2-in. thick stainless steel wall. 
Some of the beam hit the thick wall producing a back- 
ground of charged particles throughout the chamber, 
and some of the neutral particles entering the chamber 
were secondaries of interactions in the chamber walls. 

The 10000 pictures taken of the bubble chamber 
exposed to the neutron beam were completely scanned 
twice, once by a physicist and once by a nonscientist, 
with all types of interactions being recorded on file 
cards. The possible V°’s found in the scanning numbered 
about 700 and were measured on a 3D reprojector. 336 
pictures were found that had V°’s decaying within the 
propane. Thirteen pictures had two V°’s, with some 
appearing to be “associated production” (the same 
source for both V°’s). The planes of the Vs were 
checked for any “related” events (those with an inter- 
action as apparent source upstream of the V° decay) ; 
about 50 pictures had V°’s that seemed to be related to 
“neutral” stars (events with two or more prongs caused 
by an incoming neutron particle) or the beginning of 
single prongs. Momenta of charged particles were 
determined from the curvature of the tracks in the 
magnetic field. Nevertheless, there were errors in 
momentum measurements of the V° prongs amounting 
to 10 to 15%, due predominantly to multiple scattering 
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in the liquid propane. These errors could be tolerated 
since the search was only for particles fitting the well- 
known decay schemes: 


\® = p+n-+36.9 Mev, 0,°— r++2-+214 Mev. 


The identification of A° and 0,° was made by comparing 
the values of the momenta of the two prongs of the V® 
and the included angle of these prongs as measured on 
the 3D reprojector directly with curves plotted at the 
Radiation Laboratory.* Not all of the 349 possible V° 
events were analyzed because of the length of time 
consumed in each microscope measurement, but prefer- 
ence was given to the V® events that appeared to be 
associated and related. Altogether 126 V°’s were ana- 
lyzed and of these 86% fit either the A° or the 0,° decay 
schemes. The others may have been (a) decays of the 
0,°, (b) “pseudo-V” events produced by neutrons 
hitting loosely bound carbon neutrons to form n+-n — n 
+pt+nr- or n+-n—-n+n+n++7, or (c) events for 
which momentum errors were very large. 

Further analysis revealed 30 V°’s that had sources in 
the propane, but there were no definite associated- 
production pairs. To check for associated production 
events, the lines of flight of the neutral decaying 
particles were calculated by balancing momentum to 
determine if the two Vs in the same picture had a 
common origin. In 12 of the 13 pictures with two V°’s 
this was not the case, and in the 13th both lines of 
flight could possibly have started at the same point 
in the chamber wall. The number of pictures containing 
two unassociated V°’s is that expected from random 
distribution of 336 events through 10000 pictures. 
Possible related events were checked by comparing the 
line of flight calculated from the V° prong momenta 
with the line from the apparent source to the V° decay 
measured on the 3D reprojection screen. The results of 
this analysis were 30 V°’s that were either A° or 0,° that 
were related to events in the propane: 8 from interac- 
tions formed by charged particles, 17 from neutral stars 
with 2 to 7 prongs, and 5 from single-prong neutral 
stars. These are summarized in Table I. 


III. DETECTION EFFICIENCY 


The detection efficiency for V°’s in the chamber is 
taken to be the product of four factors: scanning, 
angular distribution, range, and neutral decay. 


A 


The detection efficiency for the scanners at Yale was 
found assuming that all V°’s had the same probability 
of being detected. Of the 349 Vs, 336 were found by 
the two scanners, 271 by one and 277 by the other, 
indicating a combined efficiency of 95%. The other 13 
events were found in the preliminary scanning of some 
of the pictures at Berkeley. 


8B. Armstrong, University of California Radiation Laboratory 
Report UCRL-3470, 1957 (unpublished), 
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V° decay kinematics fit: 
Source of V® Ao 6,° Either Total 
A. Multiprong stars, 2; 9; > +2" 7 ¢ 
. Multiprong stars, 2; ¢;=0 
’. Multiprong stars, 2; g;= +1 
. Single-prong stars (¢= +1) 
?. Charged events 


F. Total 9 


G. All analyzed V°’s 20 108 


32 
(G-F). Analyzed wall events 23 14 78 


* gi =electronic charge of a prong (+1 or —1). 


B 


To obtain an estimate of the number of V°’s missed 
because the decay plane of the V° was steep, the dis- 
tribution with respect to the angle between the normal 
to the decay plane and the vertical optic axis was plotted 
for 200 V°’s apparently formed in the chamber walls. 
Determination of this angle does not require knowledge 
of the prong momenta, so that this plot includes un- 
identified V’s and some that do not fit A° or 0,° decay 
kinematics. A uniform distribution over all angles is 
expected for an isotropic distribution of paths about 
the beam, as in the case here where the decay distance 
is small compared to the chamber dimensions. The 
observed distribution appears uniform over all angles 
except for a decrease near the vertical decay plane which 
indicates about 10% of the V"’s were missed. 


C 


The effect of V° particle path length can be seen in 
the formula for the probability of observing the charged 
decay mode before the particle escapes from the visible 
region of the chamber’: 


P=exp(—lm,/r:pic) —exp(—lom,/ripic), 


/, and /, are the minimum and maximum distances the 
neutral particle can travel from its production to 
recognizable decay, 7; is its lifetime at rest, m; its mass, 
and p; its momentum, which varies in this experiment 
from 0.1 to 5.0 Bev/c. For this chamber the extremum 
distances for particles produced in the upstream wall 
are 1;=5 cm and /.=25 cm (allowing 5 centimeters for 
identification) and the formula gives a momentum dis- 
tribution in agreement with the measurements made on 
wall-produced V°’s. This distribution was used to weight 
the probabilities found for particles produced within the 
chamber. In this latter case /; approaches zero, but since 
the V° vertex must be distinguished from the vertex of 
the source star, /; is taken as 0.3cm. /, requires a 
complicated averaging over all points inside the visible 
region and over all directions to the bounding planes 
of this region, using an appropriate distribution with 


9D. Gayther and C. Butler, Phil. Mag. 46, 467 (1955). 
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Fic. 1. Star prong distribution, neutron events 
in carbon and hydrogen. 


respect to angle with the incident beam direction. For 
rough approximation with a cosine-squared distribu- 
tion,* /, takes the values 2.5 to 5.0 cm, in agreement with 
the average of 4.9 cm for the 30 V° events produced in 
the chamber. P does not change appreciably within the 
above range of /, and averages to 60% over a momentum 
spectrum typical of the wall-produced V°’s. 


D 


Correcting for neutral decay products, which occur 
in 4/7 of the cases of 6,° decay and § of the A° decays," 
the observed ratio of almost twice as many identified 
@,"’s as A°’s gives the average probability of the charged 
decay made by a V° in this experiment of 50%. 

The product of these four factors is 26% for the 
over-all detection efficiency. 


IV. THE NEUTRON BEAM AND ITS INTERACTIONS 


In order to get an estimate of the neutron flux and 
to determine the ratio of carbon to hydrogen events, 
the stars produced by neutral particles in the 10000 
pictures were counted, using the two sets of scanning 
cards. Selection of events was such that the star prongs 
indicated an incident energy for the neutron of at 
least 0.5 Bev, appreciably above the threshold for 


Eisler, Plano, Samios, Schwartz, and Steinberger, Nuovo 
cimento 5, 1700 (1957). 
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pi-meson production. This count gave 8200 such stars 
with two or more prongs. The distribution with respect 
to prong number, n, is shown in Fig. 1, including esti- 
mates of single and zero prong events. This distribution 
is a decreasing curve from n=3 to n=11, but with 
peaks (note the logarithmic scale of the number of 
events) at all odd values of n. These peaks are con- 
sidered to be inelastic hydrogen events (n-p), for which 
the net charge of outgoing prongs is +1. The estimate 
of single prong (v=1) events was made from a sample 
count which was corrected for elastic n-p events. The 
total number of events shown in Fig. 1 is 12 000 inter- 
actions of neutrons with propane nuclei, producing 
stars with 0 to 13 prongs, with 9300 of these being 
carbon events and 2700 hydrogen events. 

The ratio of carbon to hydrogen interactions may 
also be estimated from the neutron interaction lengths 
in the two materials. For the energy range 0.5 to 6 Bev, 
average values for the interaction cross sections are 220 
and 20 millibarns for carbon and hydrogen, respec- 
tively.” The visible length of the chamber comprises 
9 g/cm? of carbon and 2 g/cm? of hydrogen. Thus for 
each 41 traversals of the chamber by a neutron there 
would be 5 stars in the propane, with a ratio of 4 carbon 
stars to 1 hydrogen star. However, the observed ratio 
is about 3.4 carbon stars for each hydrogen star, 
indicating that there is some error in the chosen cross 
sections and/or the star identification process. Since the 
phase space calculation discussed below indicates a 
lower carbon to hydrogen ratio (1.6) the 9300/2700 
division appears to be a reasonable one. 

The total flux of neutrons with energies above 0.5 Bev 
is found from the above result: 12 000 stars would be 
produced by about 1X 10* neutrons. 

The energy spectrum was found for a sample of the 
stars. The ++ — events were analyzed to obtain the 
momentum resultant of the three outgoing prongs. 
Such events include the n-p interactions (1) n+p — p 
+pt+n, (2) nt+p—nt+ptartt+nr (3) n+p— ptp 
+2~+7°, and the production of three or more 7 mesons. 
Interaction (1), which has been found to occur in about 
é of all cases of ++ — stars," has all outgoing prongs 
charged and thus the momentum resultant of these 
three prongs lies in the direction of the incident neutron 
and represents the incident neutron’s momentum. One- 
sixth of the analyzed ++— events had an angle 
between beam direction and resultant of <4° and were 
considered to be interactions of type (1). The resulting 
energy spectrum is shown in Fig. 2(a). 

Simple phase-space calculations" for pion production 
in nucleon-nucleon interactions were made for pion 
multiplicities up to 6 to determine the fraction of all 


1 W. Hess, Revs. Modern Phys. 30, 368 (1958). 
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Letters 2, 168 (1959). 
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Laboratory Report UCRL-8559, December, 1958 (unpublished). 

4G, Fialho, Phys. Rev. 105, 328 (1957), 
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cases comprised by the single pion production case 
studied. This fraction was used as a correction to the 
energy spectrum of Fig. 2(a) and gives the spectrum 
of Fig. 2(b) for all neutron interactions involving 
m-meson production (the 1% or so of strange particle 
production is neglected). The total of 400 ++ — events 
indicates that in this experiment there should have 
been observed a total of 4000 neutron stars in hydrogen, 
a somewhat larger number than that resulting from the 
estimates given above. It is possible that some of the 
400 ++— events actually occurred at the surface of 
carbon nuclei. 

The above phase-space correction for the total 
neutron spectrum shifts the peak from 3 to 4 Bev in 
agreement with measurements in a similar Bevatron 
beam. This reveals that about 5% of the incident 
neutrons in this experiment were in the energy range 
0.5 to 1.0 Bev [shaded in Fig. 2(b)] and were thus 
incapable of producing strange particles. This reduces 
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the number of possible V° sources in the chamber to 
11 400 neutron stars. 


V. STRANGE PARTICLE PRODUCTION 


The A°/O,° ratio can be determined from results 
given in Table I. Of the V°’s that were analyzed. 64% 
were clearly identified as either A° or 0, with a A°/0,° 
ratio of 0.42; additional less definite cases increase the 
ratio to 0.63. This ratio of less than unity suggests that 
there was considerable V° production by the V+N — N 
+N+K-+K reaction, for which the threshold is 2.4 Bev 
for free N+N and 1.7 Bev for bound V+.V, compared 
to N+N — N+ Y+K with a 1.5- to 1.1-Bev threshold. 
Although the statistics are poor, the A°/6,° ratio appears 
smaller for the carbon events than for hydrogen. It may 
be that while A°’s are still within the carbon nucleus 
they interact to form 6s by A+N > N+N+K. The 
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decrease from H to C agrees with other experiments!*~"” 
which indicate that, for energies both just above and 
well above the K+K production thresholds, the A°/6,° 
ratio is greater than or equal to unity for hydrogen, 
decreases somewhat for light complex elements, and 
then increases as atomic number increases to elements 
such as lead. 

Study of the possible V° sources indicates a ratio of 
carbon to hydrogen stars of about 34. In the analysis 
of the neutron interactions, the net charge of +1 was 
taken as the identification of neutron-on-hydrogen 
stars. This would make the sources of the Vs 13 
carbon stars (lines A and B, Table I) and 9 hydrogen 
stars (lines C and D). However, since there are 4 
(line B) of 13 carbon events which are apparently 
neutron-neutron interactions that do not disturb the 
remaining nucleons in the carbon nuclei, there must 
also have been some neutron-proton events at the sur- 
face of the carbon nuclei. For this reason the events are 
estimated to be 16X (1-40.25) carbon and 6X (10.25) 
hydrogen events, where the errors are those for the 
statistical fluctuations for such small numbers of random 
events. 

The relative probability for strange particle produc- 
tion is calculated as: (related V° events)/[ (stars that 
could produce V°’s) X (detection efficiency) |. For 2600 
+250 hydrogen and 8800+250 carbon stars as deter- 
mined from the energy spectrum correction, assuming 
a 20% error in the detection efficiency, the results are 
(0.940.5)% of the hydrogen and (0.7+0.3)% of the 
carbon interactions produce neutral V° particles. When 
allowance is made for the associated production of two 
strange particles which may be either neutral or 
charged, the detection efficiency changes slightly but is 
within the limits given above. One other possible source 
of error is zero-prong stars that are sources of V’s. The 
ratio of “‘wall-produced” V°’s to “‘propane-produced”’ 
Vs is somewhat higher than is predicted from the 
equivalent thicknesses of the two materials, so that 
possibly some of the Vs had their origin in carbon 
events where the m-n interaction produced all neutral 
products. Such events, although unlikely, might raise 
the upper limit of the fractional cross section for 
carbon to about 2%. 


VI. SUMMARY 


The approximate value for the production of neutral 
strange particles by neutrons in the energy range 1 to 
6 Bev, with peak at 4 Bev, in interactions with both 
carbon and hydrogen is 1% of all inelastic events. The 
A°/0,;° ratio for production by these neutrons in both 
hydrogen and carbon as well as in the iron of the 
chamber walls is less than or equal to unity. 
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A series of high-altitude balloon flights was carried out in 1957 
and 1958 to study the flux of primary cosmic-ray protons and 
a particles during variations in the total cosmic-ray intensity. The 
following results are obtained for a particles with energies exceed- 
ing 530 Mev/nucleon under 13.5 g/cm? of air: (a) During a large 
Forbush-type decrease the a-particle and proton intensities were 
closely correlated. This demonstrates that a modulation mecha 
nism is operating on both components. (b) At certain times 
variations in the a-particle intensity were observed within a few 
hours which were not accompanied by corresponding changes in 
the proton flux. This is tentatively ascribed to an anisotropy in the 
a-particle flux that reaches the earth. (c) While there existed an 
intensity decrease in the proton flux between 1957 and 1958 which 
is also observed in the neutron monitor station data, no such 
variation occurred in the a-particle flux. A division of the a 


I. INTRODUCTION 
ARIATIONS in the total primary cosmic-ray 
intensity have been observed and studied for a 
number of years. The main tools for these investigations 
are permanently installed cosmic-ray monitors that are 
able to record the intensity over long periods of time. 
More recently, this work was complemented using air- 

craft and balloon-borne instruments. 

It was possible to show that most variations occur in 
the primary flux of cosmic-ray particles and are not due 
to local influences such as geomagnetic or meteoro- 
logical effects, and it could be demonstrated that the 
mechanisms that give rise to such variations are solar 
controlled and must operate within or near the solar 
system. Extensive work has been carried out to find 
some model that may account for the observed phe- 
nomena, but no unique explanation is as yet available. 

Most investigations that were carried out so far on 
the intensity variations of the primary cosmic radiation 
are concerned with the total flux of incoming particles. 
We know, however, that approximately 86% of the 
primary flux are protons, about 13% consist of alpha 
particles, while the remaining 1% are still heavier 

* This research is supported by the U. S. Committee for the 
International Geophysical Year and by the Air Force Office of 


Scientific Research, Air Research and Development Command, 
under contract AF 18(600)-666. 


particles into two energy groups (450 Mev/nucleon < £; £960 
Mev/nucleon and £2>960 Mev/nucleon) shows (a) that the 
Forbush decrease is of the same magnitude in both energy groups, 
(b) that the hourly flux increase observed in some flights is about 
the same in both energy groups, and (c) that from 1957 to 1958 
the flux in the low-energy group increased, while it decreased in 
the high-energy interval, contrary to the well-known behavior of 
the proton flux. These independent a-particle flux variations 
cannot be explained by any of the modulation mechanisms so far 
proposed. We suggest that occasional solar production of a 
particles may be responsible for our results. 

The absolute flux of a particles with energies exceeding 560 
Mev/nucleon at the top of the atmosphere was measured on 
five different days. 


nuclei. The information on intensity variations of the 
different components of the primary radiation is still 
very limited. A more detailed knowledge of their flux 
and of their energy spectrum as a function of time can 
be used to test the models that were proposed to explain 
the changes observed in the total cosmic-ray flux. The 
emphasis of the present experiment is to move a step in 
this direction through an investigation of some of the 
time variations of the primary proton and alpha- 
particle fluxes and a study of the correlation between 
the two components. If, for example, the total flux of 
the low-energy primary cosmic radiation is of galactic 
origin and the intensity modulation takes place in the 
solar system due to disordered magnetic fields, one 
would expect very similar changes for each rigidity 
interval in the proton and a-particle component. 
Should, on the other hand, solar production of particles 
with cosmic-ray energies play a more important role 
than is generally assumed, variations in the ratio of 
a-particle flux and proton flux are likely to occur. For 
two reasons, it is advantageous to choose the cosmic-ray 
alpha particles as the prototype of the heavy primaries. 


1. They are, next to the protons, the most abundant 
component, contributing more than 10% to the primary 
flux of particles. 
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2. They can readily be studied with electronic 
detectors with an excellent discrimination against 
protons and the still heavier components. 


Work on the changes in intensity and energy spec- 
trum of the primary cosmic-ray alpha particles in 
correlation with the total cosmic-ray intensity changes 
and the 11-year activity cycle of the sun was carried out 
in recent years using photographic emulsions and 
electronic detectors. A rather close correlation between 
the changes in a-particle and proton fluxes was found 
and it was concluded that the same mechanism is 
responsible for the intensity modulation of both 
components.! 

The question we wished to ask in the present in- 
vestigation concerns primarily the relationship between 
the short-term variations of primary protons and 
alpha particles. In this group would fall the 27-day 
variation, the sudden and sharp Forbush-type intensity 
decreases, and possibly 24-hour variations. We, there- 
fore, had to select properly the times for which measure- 
ments were to be made to obtain data at periods of 
highest and lowest total cosmic-ray intensity. The 
cosmic-ray neutron monitor at Chicago* was used for 
the purpose of choosing the appropriate times of 
measurement. 


Il. INSTRUMENTATION 


An electronic detector capable of measuring alpha- 
particle fluxes over an appreciable range of energy has 
been described by McDonald.’ His technique of using a 
combined scintillation counter and Cerenkov counter 
for discrimination between alpha particles and protons 
has, with slight modifications, been used in the present 
experiment, and we refer to the paper by McDonald for 
a detailed discussion of the capabilities of this detector. 
Figure 1 shows a schematic cross section through the 
apparatus used in the present experiment. Counter 1 
and Counter 3 are cylindrical plastic scintillators } inch 
thick with diameters of three inches and six inches, 
respectively. Each is viewed by a two-inch Dumont 
photomultiplier through a conical air light pipe with 
reflecting walls. Counter 2 is a Cerenkov counter con- 
sisting of a cylindrical Lucite radiator, one inch thick 
and four inches in diameter, mounted in optical 
contact with a five-inch Dumont photomultiplier. The 
top of the Lucite was painted black in order to obtain 
minimum light output from upward moving particles. 
A geometry factor of 8.30.4 steradXcm? is obtained 
by requiring a coincidence between Counter 1 and 
Counter 3. This geometry factor varies slightly between 
different sets of equipment, the error being unchanged. 
For each coincidence, the light output of Counter 1 and 


! Freier, Ney, and Fowler, Nature 181, 1319 (1958) ; Freier, Ney, 
and Waddington, Bull. Am. Phys. Soc. Ser. I, 3, 221 (1958); 
F. B. McDonald, Phys. Rev. 107, 1386 (1957); Bull. Am. Phys. 
Soc. Ser. II, 3, 220 (1958). 

2 Simpson, Fonger, and Treiman, Phys. Rev. 90, 934 (1953). 

3F. B. McDonald, Phys. Rev. 104, 1723 (1956). 
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Fic. 1. Schematic dia- 
gram of the counter tele- 
scope. Counters 1 and 3 are 
plastic scintillators ; Counter 
2 is a Cerenkov counter. 





Counter 2 are independently measured by displaying 
the electronic pulses on two separate cathode ray tubes 
and photographing them. Each particle passing through 
the counter telescope is, therefore, characterized by two 
numbers, one giving its energy loss in the scintillation 
counter and. one giving its output of Cerenkov light. 
The plastic scintillator material was chosen because of 
its small density and because it does not require any 
protective windows. The proportionality between light 
output and energy loss of the passing particles is 
satisfactory within the range of energy loss covered in 
the experiment.‘ Using Lucite with an index of re- 
fraction 1.5 as the material for the Cerenkov radiator, 
alpha particles with energies down to 450 Mev/nucleon 
are well discriminated against low-energy protons. The 
telescope is easily calibrated with relativistic cosmic-ray 
mu mesons at sea level. In this case, the scintillation 
detector shows a pulse-height distribution with a half- 
width of about 30% of the average pulse height pro- 
duced by the mesons, and the Cerenkov detector has a 
half-width of around 40%. 

During the balloon flights the entire apparatus was 
placed in an aluminum cylinder and kept at atmospheric 
pressure. 


Ill. THE EXPERIMENTS 


Two series of measurements at high geomagnetic 
latitude were carried out to investigate the proton and 


‘(C, N. Chou, Phys. Rev. 87, 903 (1952). 
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TABLE I. The proton flux, a-particle flux, and Climax neutron monitor intensity during five balloon flights. 


Average 
pressure 
during flight 
(mm Hg) 


11.3 
10.2 
10.2 
0.4 
9.0 


of protons 
counted 
in flight 


41 940 
48 546 
51 424 
75 828 
80 701 


Climax neutron 
monitor 


Date intensity 


2841417 
2480+ 1°, 
278041% 
270941% 
2694+1% 


August 16, 1957 
August 30, 1957 
September 16, 1957 
July 12, 1958 

July 22, 1958 


a-particle flux We chose Prince Albert, Saskatchewan 
(53° 13’ N, 105° 41’ W) as the balloon launching site 
in 1957 and Neepawa, Manitoba (50° 16’ N, 99° 27’ W) 
in 1958. The geomagnetic cutoff rigidities are 0.6 Bv 
and 0.7 By, respectively, according to Quenby and 
Webber.*® They are well below the low-rigidity cutoff of 
the primary spectrum during the years of this experi- 
ment, and we can expect that small changes in latitude 
during a flight will not noticeably influence the flux 
measurement. The actual change in latitude never 
exceeded 1°. For all balloon flights the altitude was 
chosen to be approximately 100 000 feet, corresponding 
to a residual layer of 13 g/cm? of atmosphere above the 
equipment. Allowing for the additional amount of 
matter to be traversed in order to reach Counter 3, 
one finds that particles with energies exceeding 150 
Mev/nucleon are capable of triggering the counter 
telescope. While in 1957 the balloons remained at 
constant altitudes for about eight hours, this time was 
extended to 20 hours in 1958. The pressure was moni- 
tored throughout the flight with a Wallace-Tiernan 
gauge type FA 160 which, together with a clock, was 
photographed at regular intervals on the data re- 
cording film. 

Twelve days were selected for balloon flights using the 
neutron monitor data from Chicago. Seven of those 


August, '9® eptember, '957 


Fic. 2. The total cosmic-ray intensity as a function of time 
measured by the Climax neutron monitor during the period of the 
1957 balloon expedition. The days of balloon flights are indicated 
by arrows. 


5 J. J. Quenby and W. R. Webber, Phil. Mag. 4, 90 (1959). 


Total number 


a-particle flux 
at the top of 
the atmosphere 
(E >560 
Mev/nucleon) 
(m~? sec"! 
sterad™') 


136.449% 
124.449% 
153.9497 
138.0489; 
140.148% 


a-particle flux 
corrected for 
13.5 g/cm? and 
background (E >530 
Mev/nucleon) 
(m~? sec™ 
sterad™') 


103.044, 

94.044; 
115.1+4°, 
103.743; 
105.2+3°% 


Total 
number of 
a-particles 

counted 

in flight 

(E >530 
Mev/nucleon) 


Proton flux 

corrected for 
13.5 g/cm? 
(m7? sec™ 
sterad™') 


191042 
1662+2' 
1921+2‘ 
1627+2' 
1588+ 2‘ 


2275 
2265 
3224 
5094 
5836 


flights were satisfactory in both balloon and equipment 
performances and five of them are included in the 
present analysis. They took place on August 16, 1957, 
August 30, 1957, September 16, 1957, July 12, 1958, 
and July 22, 1958. In Figs. 2 and 3, the total cosmic-ray 
intensity as measured by the Climax neutron monitor 
is plotted as a function of time for the two periods of 
balloon measurements. The arrows indicate the days on 
which balloon flights were made. The period in 1957 
showed about the largest intensity changes for the 
entire year. Especially around September 1, 1957, solar 
activity was at a very high level. 


IV. THE RESULTS 


In Table I we give the values of the total cosmic-ray 
intensity as measured with the Climax neutron station, 
the proton flux and the a-particle flux for the five 
selected balloon flights. These are averaged over the 
periods in which the measurements were carried out. 
The following corrections were taken into account. 
The flux values were corrected for altitude changes of 
the balloon during each flight. These amounted to less 
than 1 mm Hg in the flights that were selected except 
for one occasion where the pressure increased by about 
2 mm Hg for four hours (August 16, 1957) and another 
increase by 5 mm Hg during the last hour of the flight 
on July 12, 1958. 

In the case of the a particles we normalized all data 
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Fic. 3. The total cosmic-ray intensity as a function of time 
measured by the Climax neutron monitor during the period of the 
1958 balloon expedition. The days of balloon flights are indicated 
by arrows. 
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TABLE II. The relative changes in proton flux, a-particle flux, and the neutron monitor intensity at three stations 
with respect to the values of August 16, 1957. 








Balloon data 
Proton flux 


Date % change 


a-particle flux % change 
E>530 Mev/nucleon R=0.98 Bv 


Neutron monitor station data 


Sulphur Mountain Climax 
B R=2.71 Bv 


Sac Peak 
R=4.7 Bv 





August 16, 1957 0 0 
August 30, 1957 —13.0+-2 —9+4 
September 16, 1957 +0.6+2 +1244 
July 12, 1958 —14.8+2 

July 22, 1958 —16.8+2 


to an altitude corresponding to 13.5 g/cm? of air above 
the detector using an absorption mean free path® of 
45 g/cm? for @ particles in air and we corrected for a 
general background in the a-particle region (see Sec. D). 

A similar procedure was followed to obtain corrected 
proton flux values. We again normalized to 13.5 g/cm? 
of air overhead using the altitude dependence of the 
proton flux which was measured during the ascents of 
all flights. All singly charged particles able to penetrate 
the scintillation counter telescope were included in the 
proton group. The lowest energy accepted is, therefore, 
determined by the low-rigidity cutoff in the primary 
spectrum at the time of the measurement. 


A. The Changes in a-Particle and Proton Flux 
during the Forbush Decrease of 
August 30, 1957 


The largest intensity changes between days of 
measurements occurred during the 1957 balloon flight 
series. Alpha-particle flux measurements were carried 
out before, during, and after the large Forbush type 
decrease of August 30, 1957. Some preliminary results 
that we observed in connection with this phenomenon 
have been reported earlier.6 The main question we wish 
to investigate is the relative change of the proton flux 
and the alpha-particle flux during this interesting period 
of large total intensity change. 

We begin by comparing the changes in total intensity, 
proton intensity, and intensity of a-particles with 
energies exceeding 530 Mev/nucleon at the altitude of 
observation. 

Using the values that were obtained on August 16, 
1957, before the Forbush decrease occurred, as a 
reference, we observe the changes given in Table IT. It 
should be noted that the lowest rigidity of protons or 
a particles admitted in this measurement is not deter- 
mined by the geomagnetic cutoff at the location of the 
balloon flights. In the case of protons, it is given by the 
low-rigidity cutoff in the primary cosmic-ray spectrum 
in the years of the experiment. It should, therefore, be 
at about 1.8 Bv. The lowest a-particle rigidity included 
in the figures of Table IT is 2.3 Bv and is determined by 
our selection of the pulse-height region. 

Within the accuracy of our experiment, from August 
16, 1957 to August 30, 1957, the same relative changes 


6 Peter Meyer, Bull. Am. Phys. Soc. Ser. IT, 3, 221 (1958). 


0% 
—12.7% 
ais O7, 
toe 
—5.2% 


0% 
—7.9% 


0 
0% 


—1.6% 


—2.8% 





occurred in the averaged proton and a-particle in- 
tensities. Within two weeks after the decrease, both 
components returned to a high intensity value, the 
a-particle increase being considerably larger than the 
corresponding change in the proton component. We 
suspect that this high a-particle flux value is connected 
with the large intensity increase that was observed in 
the a-particle component during the measurement on 
September 16, 1957. This will be discussed in Sec. C. 

If one compares the relative intensity changes of 
various neutron monitor stations’ (Table II) between 
the days of balloon measurements, one notices that up 
to 4 Bv there exists very little rigidity dependence for 
this particular event. Within the limits imposed by the 
velocity sensitive range of the Cerenkov counter one 
can investigate the rigidity dependence of this variation 
for the a particles. We have, therefore, subdivided our 
a-particle fluxes into two intervals, the first including 
a particles with energies between 450 and 960 Mev/ 
nucleon and the second for energies exceeding 960 
Mev/nucleon. These data are included in Table III. 
The Forbush decrease is observed in both energy 
intervals with about equal amplitude. Within the error 
of the experiment there is no evidence for a strong 
energy or rigidity dependence. We conclude that within 
the energy interval covered by this experiment, a 
common modulation mechanism operated on both 
primary particle components during the large Forbush- 
type decrease of August 30, 1957. 

The rigidity dependence observed in this particular 
Forbush-type decrease is strikingly different from the 
rigidity dependence of the 11-year cycle variation 
(see Sec. B) and our result adds to the increasing 
evidence that the mechanisms responsible for the two 


TABLE III. The a-particle flux in two energy 
intervals during the five balloon flights. 


a-particle flux a-particle flux 

450-960 Mev/nucleon >960 Mev/nucleon 

corrected for 13.5 g/cm? corrected for 13.5 g/cm? 
(m~? sec sterad~) 

85+5% 

7245% 

8745% 

77.544% 

81.8+4% 


August 16, 1957 
August 30, 1957 
September 16, 1957 
July 12, 1958 

July 22, 1958 


7 We wish to thank Dr. D. C. Rose for his permission to use the 
data from his cosmic-ray monitor network. 





PETER 


ei ae | a 


+2400 _ 
- 


— 2000 


— 1600 


| 


~" sterad 


Particle Flux (E>530 Mev) 
Nucleon 
Corrected For 135g/cm“) 
@ Proton Flux 


August 16, 1957 


Protons (m 2 sec 


oo oe ee oo es oe ee | 


0800 1200 1600 CST 


° 


Fic. 4. The flux of @ particles and protons under 13.5 g/cm? 
residual atmosphere as a function of local time during the balloon 
flight of August 16, 1957. (These data are not corrected for back- 
ground in the a-particle region.) 


types of intensity variations are not closely related. 
The rigidity independence of the Forbush decrease in 
the low-energy region can be explained on the basis of 
modulation by disordered magnetic fields. It turns out 
that the scale of the disordering is equal to the Larmor 
radius at the energy below which the independence 
occurs (Parker*®). The measurements at aircraft altitude 
made by Simpson*® before, during, and after the Forbush 
decrease of August 18, 1951, also showed this rigidity 
independence in the same rigidity interval. We may 
point out here that our experiment is not capable of 
discriminating between an energy dependence and a 
rigidity dependence of these variations. 


B. Changes in the Primary Particle Intensity 
and Spectrum between 1957 and 1958 


During the period between the two balloon flight 
expeditions, noticeable changes occurred in the primary 
cosmic-ray intensity and energy spectrum as exhibited 
by the monthly averaged intensities recorded by 
neutron monitor stations at various latitudes.’ Further- 
more, measurements using aircraft established" that 
the low-rigidity cutoff of the primary cosmic-ray 
spectrum changed to higher rigidities between the 
summer of 1956 and the beginning of 1958. These 
changes follow the pattern now known as the 11-year 
cycle of cosmic-ray intensity and spectrum and one 
expects the largest intensity changes to occur in the 
low-energy end of the spectrum.” This is indeed 
observed if one compares the relative changes in 
intensity as measured by neutron stations (Table IT) on 
the days of balloon flights and the changes in proton 

8 E. N. Parker, Phys. Rev. 103, 1518 (1956); 110, 1445 (1958). 

9» J. A. Simpson, Phys. Rev. 94, 426 (1954). 

J. A. Simpson (unpublished results). 

" P, Meyer and J. A. Simpson (unpublished). 

2 Peter Meyer and J. A. Simpson, Phys. Rev. 99, 1517 (1955); 
106, 568 (1957). 

18H. V. Neher and H. Anderson, Phys. Rev. 109, 608 (1958). 
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intensity at balloon altitude. A strong rigidity de- 
pendence is present in contrast to the observations of 
the Forbush decrease discussed in the previous section. 
A decrease in intensity, comparable to the one observed 
in the proton component, was also observed at high 
altitude and latitude, using balloon-borne neutron 
equipment.'4 

We did, however, not find a comparable decrease in 
the flux of primary a particles with energies exceeding 
530 Mev/nucleon, which remained almost unchanged. 
Therefore, the ratio of the proton flux at 13.5 g/cm? 
over the a-particle flux at 13.5 g/cm’, which had an 
average of 17.6 in the 1957 flights, became 15.4 in 1958. 
The division of the a-particle flux into two energy 
intervals (Table III) seems to indicate that a decrease in 
intensity does take place in the high-energy group, 
while the low-energy flux increases, although the errors 
are rather large. This behavior is exactly the opposite of 
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Fic. 5. The flux of a particles and protons under 13.5 g/cm? 
residual atmosphere as a function of local time during the balloon 
flight of August 30, 1957. (These data are not corrected for back- 
ground in the a-particle region.) 








what one would expect on the basis of the proton 
flux data. 


C. Rapid Variations in the Primary 
e-Particle Flux 


An outstanding advantage of an electronic counting 
device for the measurement of the primary cosmic-ray 
flux is the possibility of studying in detail the intensity 
as a function of time during the periods the equipment 
stays at altitude. We have divided the flight periods 
into equal intervals, each lasting for approximately 
90 minutes. The results so obtained for the a-particle 
flux and for the proton flux as a function of time are 
presented in Figs. 4 through 8. During all of the 
measurements carried out in 1957 we noticed an increase 
in the a-particle flux as a function of time which was not 
accompanied by a similar change in the proton intensity. 


4K. B. Fenton (private communication). 
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This effect was most remarkable in the measurement of 
September 16, 1957. All flights in the 1957 group of 
experiments were launched in the early morning hours 
and therefore cover about the same period of local time. 
The total cosmic-ray intensity was measured inde- 
pendently with a vertical Geiger counter telescope 
during those flights!® and shows the same time de- 
pendence as the proton flux. The a-particle increase is 
not only observed at energies exceeding 530 Mev 
nucleon, but occurs with comparable amplitude in the 
two energy intervals of 450-960 Mev/nucleon and 
> 960 Mev/nucleon. 

This observation raises the question whether a 
24-hour variation or anisotropy in the a-particle flux 
may occasionally be present.® It should be noted that a 
similar effect has previously been observed by McDonald 
in his a-particle measurement of March 13, 1956. Ours 
as well as McDonald’s observations were carried out at 
a time of enhanced solar activity as exhibited by the 
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Fic. 6. The flux of @ particles and protons under 13.5 g/cm? 
residual atmosphere as a function of local time during the balloon 
flight of September 16, 1957. (These data are not corrected for 
background in the a-particle region.) 


occurrence of a large Forbush-type decrease in total 
cosmic-ray intensity. 

The presence of an independent a-particle intensity 
variation would be highly interesting because it can 
certainly not be achieved by any of the modulating 
mechanisms that have so far been considered. In spite 
of the large number of a particles counted in this 
experiment, the evidence for the independent a-particle 
intensity variation is subject to a substantial error. 
For example, the probability that the measurement of 
September 16, 1957 is consistent with identical a-particle 
and proton variation amounts to about 75. 

In order to study this variation in more detail, the 
1958 flights were designed to remain at altitude for 
about 20 hours, but no similar independent a-particle 
intensity change could be observed, as seen from Figs. 7 

'8 This equipment was built and is used in a different research 
program by Mr. F. Jones and Mr. K. Yates. [ wish to thank them 
for participation on my flights and for providing me with the data. 


PROTON 





140 


sterad~' 


o a-Particle Flux (E >530 Mev/Nucleon) 
(Corrected For 13.5 g/cm) 


e Proton Flux 
July 12,1958 


Se 
0600 1000 400 1800 
cst 


Protons (m~@ sec 





ee 
0200 


2 


Fic. 7. The flux of a particles and protons under 13.5 g/cm? 
residual atmosphere as a function of local time during the balloon 
flight of July 12, 1958. (These data are not corrected for back- 
ground in the a-particle region.) 


and 8. We plan to carry out some further experiments 
to study this phenomenon. 


D. The Absolute Flux of Primary a Particles 
with Energies in Excess of 560 
Mev/Nucleon 


The total number of a particles measured during each 
flight of the present experiment is substantially higher 
than in any previous investigation, due to the large 
geometry factor and the length of time of each measure- 
ment. It, therefore, seems worthwhile to use the data for 
a determination of the absolute flux of primary a@ 
particles. If one wishes to obtain an absolute measure- 
ment and to extrapolate the a-particle fluxes to the top 
of the atmosphere, the following quantities have to be 
known: 


1. the absorption length of @ particles in air; 

2. the average path length in air traveled by a 
primary particle before reaching the detector; 

3. the contribution of a-particle fragments from 
heavier nuclei colliding above the apparatus; 

4. the contribution by a@ particles originally not 
passing through the entire telescope but producing a 
fast knock-on electron capable of triggering Counter 3; 

5. the general background due to showers, nuclear 
interactions, etc. in the pulse-height region in which the 
a particles fall. 
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The absorption mean free path of a particles has been 
determined by McDonald* and we used his value of 
(45+4) g/cm’. The average layer of air traversed by 
the primary particles turns out to be 2% higher than 
the vertical path for our geometry. The number of 
a-particle fragments from heavier primaries contribut- 
ing to the measured a-particle flux at the altitude of the 
equipment was estimated using the equations given by 
Noon and Kaplon"® and extending the solution of these 
equations to the a-particle flux. In order to obtain a 
numerical result we modified the heavy primary flux 
values of Danton, Fowler, and Kent,!’ so as to account 
for the changing primary intensity with the solar 
activity cycle.'* We used the fragmentation probabilities 
of Rajopadhye and Waddington’® and the interaction 
mean free paths published by Piischel.” 

An estimate of the fraction of a particles which do not 
fall into the angle of acceptance of the telescope but are 
counted after producing a knock-on electron which 
triggers Counter 3 can be obtained using the results of 
Webber™ with w mesons on a very similar telescope 
which had guard counters installed. Webber estimates a 
correction of about —2%. The fact that the energy a 
knock-on electron has to exceed in order to trigger the 
bottom counter in our telescope is more than twice as 
high as in Webber’s arrangement, leads us to a correc- 
tion of less than —1°%. We, therefore, did not correct 
for this effect. 

Two factors contribute to a background in the 
a-particle region. First, a tail due to the pulse-height 
distribution of singly charged particles may extend into 
this region and, second, there will be a general back- 
ground due to showers and nuclear interactions. The 
first effect can be eliminated by observation of the pulse- 
height distribution of sea-level 4 mesons. A measure- 
ment of this kind yielded a negligible contribution in 
the pulse-height region occupied by a@ particles with 
energies exceeding 530 Mev/nucleon. In order to obtain 
an estimate of the general background underlying the 
a-particle region, we extrapolated the background in 
the two-dimensional pulse-height matrix into this 
region. This procedure leads us to a correction of about 
—5% with an accuracy of not better than 30%. 

After including the above-mentioned corrections and 
all errors, we arrive at the absolute primary a-particle 
fluxes at the top of the atmosphere that are presented in 
the last column of Table I. Taking into account an 
energy loss of 30 Mev/nucleon during the traversal of 
the residual atmosphere and the equipment, these flux 


16 J. H. Noon and M. F. Kaplon, Phys. Rev. 97, 769 (1955). 

17 Danton, Fowler, and Kent, Phil. Mag. 43, 729 (1952). 

18 Freier, Ney, and Waddington, Phys. Rev. 114, 365 (1959); 
Aizu, Fujimoto, Hasegawa, Koshiba, Mito, Nishimura, Yokoi, and 
Schein (to be published). I wish to thank these authors for 
providing me with their results prior to publication. 

#V. Y. Rajopadhye and C. J. Waddington, Phil. Mag. 3, 19 
(1958). 

” W. Piischel, Z. Naturforsch. 13a, 801 (1958). 

21 W. R, Webber, Nuovo cimento 4, 1285 (1956), 
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values contain all primary a particles with energies 
exceeding 560 Mev/nucleon. 


SUMMARY AND CONCLUSIONS 


I'rom a series of cosmic-ray a-particle and proton 
flux measurements carried out during the years 1957 
and 1958, the absolute a-particle flux at the top of the 
atmosphere was obtained for five different days. On 
August 30, 1957 a large Forbush-type intensity decrease 
occurred and flux measurements could be made before, 
during and after this event. The following results were 
obtained : 


1. The primary proton and a-particle intensity show 
the same relative change during the Forbush decrease. 
This demonstrates that the mechanism responsible for 
this intensity change is operating in the same way on 
both components and is a modulation mechanism. No 
energy dependence could be found. 

2. The relative change in proton intensity at altitude 
is of the same amount as that observed at the Climax 
and the Sulphur Mountain neutron monitors. This 
means that there is no noticeable rigidity dependence in 
this Forbush decrease up to a rigidity of 3 Bv. 


The measurements in 1958 took place at a period with 
no large cosmic-ray intensity changes. We can compare 
the proton and a-particle fluxes on the days of balloon 
flights in 1957 and 1958 and we are led to the following 
results: 


1. The proton intensity at balloon altitude and high 
latitude decreased by about 15% between the days of 
flights from 1957 to 1958. We compare this number with 
the changes in intensity observed by neutron monitor 
stations on the days of balloon flights and find a 
decrease of 7% for Sulphur Mountain and 5% for 
Climax. This indicates a noticeable change in the low- 
energy portion of the primary proton spectrum between 
flight days in the two years which undoubtedly is 
connected with the solar cycle variation of the primary 
cosmic radiation. 

2. No decrease, comparable to the change in proton 
intensity, could be observed in the a-particle com- 
ponent. The flux of a particles with energies exceeding 
530 Mev/nucleon remained unchanged between the 
flights of 1957 and 1958. A subdivision of the a-particle 
flux into two energy groups (450 Mev/nucleon < £; ¢ 960 
Mev/nucleon and E,>960 Mev/nucleon) leads, with 
less statistical accuracy, to the conclusion that an 
intensity decrease is present in the high-energy interval, 
while the low-energy flux increases from 1957 to 1958. 
The resulting change in spectrum is opposite to the 
observations made with protons. 


During several flights in 1957, while solar activity 
was at a very high level, we observed an increase in 
a-particle flux within a few hours which was not 
correlated to a change in proton flux. In each case this 
increase occurred during the daylight hours. We 
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tentatively ascribe this observation to a 24-hour 
variation in a-particle intensity, but further experiments 
are necessary to substantiate this effect. 

Both the observation of an independent hourly 
a-particle intensity variation as well as the change in 
the ratio of a particle to proton flux between the balloon 
flights in 1957 and 1958 cannot be explained by the 
action of a modulation mechanism only, which operates 
on both primary components. One is led to suspect the 
possibility of occasional production of primary a par- 
ticles by the sun. The experimental evidence is scarce 
and it cannot be justified to draw a more detailed 
conclusion at the present time, but further experiments 
should be directed towards answering this question. 
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The analytic properties of two-particle transition amplitudes as functions of both energy and momentum 
transfer are examined in perturbation theory. The modified Nambu representation previously proposed by 
the author for expressing these properties is discussed in a little more detail. It is shown that, as long as the 
masses do not satisfy certain inequalities connected with the existence of anomalous thresholds, the fourth- 
order terms, calculated in the usual manner, satisfy the representation. The spectral functions are calculated 
explicitly for spinless particles. The proof can be extended to the sixth order, but is not worked out here. 
The modifications necessary when there exist anomalous thresholds are mentioned. 


1, INTRODUCTION 


N a previous paper,! a representation was proposed 

for two-particle transition amplitudes when both 
the energy and the momentum transfer become complex. 
This representation exhibits analytic properties of the 
transition amplitude which are generalizations of the 
analytic properties expressed by the usual dispersion 
relations, in which one of the variables is kept fixed. 
The representation is similar in appearance to one 
proposed earlier by Nambu? for Green’s functions; 
however, it differs in detail and its validity is postulated 
in a much more restrictive form. 


* This research was supported by the U. S. Air Force under a 
contract monitored by the Air. Force Office of Scientific Research 
of the Air Research and Development Command. The initial 
stages were supported by the Atomic Energy Commission contract 
of Columbia University. 

1S. Mandelstam, Phys. Rev. 112, 1344 (1958). 

2 Y. Nambu, Phys. Rev. 100, 394 (1955). 


Double dispersion representations of this type have 
not thus far been proved from the general principles of 
quantum field theory. The usual dispersion relations 
can be proved by examining the restrictions imposed by 
causality on the four-point Green’s function, provided 
that the momentum transfer is sufficiently small.*~* It is 
unlikely that a corresponding proof can be carried out 
in our case, or indeed that the representation follows 
from these requirements alone. The general principles 
of field theory contain much more information, since 
the causality condition enables one to deduce analytic 
properties of all the Green’s functions, which are related 
to one another by the unitarity conditions. It is there- 
fore very possible that the representation is a coase- 
quence of the general principles of field theory, but it 

5 Bogoliubov, Medvedev, and Polivanov (unpublished). 

4Bremermann, Oehme, and Taylor, Phys. Rev. 109, 2178 
(1958). 

5H. Lehmann, Nuovo cimento 10, 579 (1958). 
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seems at present a matter of considerable difficulty to 
carry out such a proof. 

In the absence of a rigorous treatment making use of 
all the information available from the general principles 
of field theory, therefore, it should be useful to examine 
the analytic structure of transition amplitudes in 
perturbation theory. It is the aim of the present paper 
to carry out such an investigation and, in particular, to 
verify that the representation mentioned above is valid 
in the lower orders. In Sec. 2 the representation is 
discussed for the general reaction A+B— C+D, and 
certain features, such as the form of the representation 
when subtractions are necessary, are discussed in more 
detail than in reference 1. In Sec. 3 it is shown that the 
fourth-order perturbation theory terms satisfy this 
representation, and an explicit form for these terms is 
given if the particles are scalar. The masses are re- 
stricted to have values for which there are no anomalous 
thresholds. It is pointed out that the sixth-order terms 
can also be shown to satisfy the representation, but the 
method used does not permit extension to general order. 

In the final section the analytic form of the fourth- 
order transition amplitude is examined when the masses 
are allowed to take on values corresponding to the 
existence of the anomalous thresholds of Karplus, 


Sommerfield, and Wichmann.® These authors dis- 


tinguish two types of anomalous thresholds, those that 
depend on one of the variables only and correspond to 


similar thresholds in the vertex problem, and those that 
depend on both the variables and have no analog in 
the vertex problem. If only the first type of anomalous 
threshold occurs, it will be shown that the double 
dispersion representation still holds. The form of the 
regions in which the spectral functions are nonzero is 
slightly altered. If, however, the second type occurs, 
ihe double dispersion representation breaks down, and 
there will be singularities in the complex plane. 


2. ANALYTIC REPRESENTATION OF 
TRANSITION AMPLITUDES 


Kinematics 


We take the general case of the interaction of four 
particles, A, B, C and D, whose masses are M;, Mo, M;, 
and M,, though we shall frequently simplify the treat- 
ment by assuming that some of the masses are equal to 
one another. A single Green’s function will describe 
three possible reactions 


A+B 
A+D 
A+C 


-C+D, (I) 
>B+C, (ID) 
>B+D. (II) 


Let the four-momenta of the four particles be denoted 
by pi-+-p4; they will be taken as positive for incoming 

‘ J 8 
particles, negative for outgoing particles. The squares 


® Karplus, Somerfield, and Wichmann, Phys. Rev. 114, 376 
(1959). 
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of the energies for the three reactions will be given by 
s= (pit p»)’, (2.1a) 
(= (pit ps)’, (2.1b) 
u= (pit ps)’, (2.1c) 
(time-like vectors are taken as positive). They are 
connected by the relation 
sti+u=M/4+M274+M?4+M¢. (2.2) 
Owing to this relation, the scattering is characterized 
by only two independent invariants. However, it will 
prove most convenient to use all three quantities s, /, 
and u, and to take different pairs of these as independent 
variables at different times. When an expression such 
as A(s,/,u,) is written down, it is implied that s and ¢ 
are the independent variables, and 1, is to be defined as 
a function of s and ¢ by (2.2). Similarly, in expressions 
such as A(5,t;,u) or A(s,,t,), t, and s; are to be defined 
as a function of the other variables by (2.2). 

It should be pointed out that the variable ¢, besides 
being the square of the energy for the reaction II, is 
also the square of the momentum transfer between A 
and C in the reactions I and II (space-like if negative) 
and w is similarly the square of the momentum transfer 
A and D in the reactions I and III. 


(reaction I) 
(reaction IIL) 


(reaction IT) 


between 
Ordinary Dispersion Relations 


The usual dispersion relation, in which ¢ is kept 
constant, has the form (for scalar particles) 


1 é A i(s" f.14) 
I ds’ 
TJ) Ma s'—S 
1 > M2—-—M? 


= 


va 


A(s,t,;) 


(2:3) 


where, as usual, the denominators are given small 
negative imaginary parts. 4, and A, are the ‘‘absorptive 
parts” associated with the reactions I and II, respec- 
tively. M, and My, are the lowest masses which can 
occur in the continuum of intermediate states in the 
two reactions. If, as is often the case, the lowest inter- 
mediate states are discrete, the absorptive parts will 
have 6-functions at the corresponding values of s; for 
simplicity, we shall not write them explicitly. The 
dispersion relations will have this simple form only 
if A tends to zero as s becomes infinite; otherwise they 
will have to be modified by subtractions in the usual way. 

Except for the case of forward elastic scattering, the 
absorptive parts in (2.3) always contain contributions 
from the unphysical region. This unphysical region may 
be divided into two parts. If s approaches sufficiently 
near the threshold and / is kept fixed, the angle of 
scattering will become complex. If ¢ is not too large, the 
value of the absorptive part can be found in this 
unphysical region from the angular-momentum expan- 
sion. In addition, the lowest masses of the intermediate 
states may be below the sum of the masses of the 
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particles participating in the reaction. Between these 
two mass values we shall always be in the unphysical 
region. Such as unphysical region occurs, for instance, 
in nucleon-antinucleon scattering or nucleon-anti- 
nucleon annihilation into two pions, when it extends 
from the square of the mass of the lowest intermediate 
state (4u?), to 4M?. 

If ¢ is positive (time-like momentum transfer), the 
greater part of, if not the entire, region of integration 
in (2.3) lies in the unphysical region, and dispersion 
relations have not been considered for this case. We 
shall see, however, that they follow from the general 
analytic representation for the transition amplitude. 

As an absorptive part A» in (2.3) is associated with 
the reaction IT, it is more appropriate to use the variable 
uw instead of s; the equation then becomes 


I Ax(s’,t.u;) 1 Ao(s;,t.u’) 
ye J ds’ + f du’ : 
wT! Ma? s’—s r/ M2 u—4 


(2.4) 


By interchanging the roles of the particles in the 
reactions, dispersion relations can also be obtained when 
u or sis kept constant; they take the form 


1 is A,(s"t),u) 1 x A3(5,,l',u) 
A= f ds’ +- f dt’ 
TJ M s'—5 Tr! M, '-1 


a’ 


l 4" Ao(s,t;,4’) 1 f% As3(s,t',161) 
A= f du’ + f dt’ : 
wJ My? u—U wr) M2 '—t 


(2.6) 


A; is the absorptive part associated with the reaction 
IL. 

In the case of elastic scattering, two of the reactions, 
I, II, and III will be identical. If, for instance, A and C 
are nucleons, B and D pions, then the reactions I and 
II will be identical. In the dispersion relation (2.4), 
where the momentum transfer between the two pions 
(or the two-nucleons) is kept constant, the absorptive 
part associated with reaction III, the pair-annihilation 
reaction, will not occur at all. In the dispersion relation 
(2.5), however, in which the momentum transfer 
between the incoming pion and the outgoing nucleon is 
kept constant, the integrand in the “crossing” term 
will involve A;, the absorptive part associated with 
reaction III. Owing to the contribution from the large 
unphysical region, this dispersion relation has not thus 
far proved useful in phenomenological analyses. 


Double Dispersion Representation 


The proposed representation of the transition 
amplitude, as an analytic function of two complex 
variables, is obtained by generalizing the analytic 
properties given by the ordinary dispersion relations 
in the simplest possible manner. It is assumed that the 
transition amplitude is analytic in the entire space 
of the two variables except for cuts along certain planes, 
the location of the cuts being determined so as to lead 
to the dispersion relations (2.4)~—(2.6). Cauchy’s 
theorem then leads to the following analytic represen- 


IN PERTURBATION 


THEORY 





Fic. 1. Region in 
which the spectral func- 
tion Aj3 is nonzero. 











tation of the transition amplitude: 


1 A3(s’,t’) 1 Ay3(U,u’) 
fosae +: fecau 
a (s’—s)\('—-1 wr? (t'—1t)(u’—1u) 


1 A 0(s’u’) 
+ fasaw 2 

r (s’—s) (1! —1) 
As in the ordinary dispersion relations, there will in 
practice be subtraction terms and the representation 
will be rather more complicated than (2.7). The 
spectral function A,; will be nonzero in a region of the 
form shown in Fig. 1 (to the right of the curve C). The 
equation of C has to be determined from the unitarity 
condition; all that we can say now is that the curve 
approaches asymptotically the lines s= M,? and /=M /. 
Aj. and A»; will be nonzero in corresponding regions. 
It should be noted that the spectral functions are zero 
in the physical region for any of the reactions, where 
one of the invariants—the square of the energy—is 
positive, while the other two-—the the 
momentum transfers—are negative. 
* The dispersion relations (2.4)—(2.6) follow from (2.7) 
if we define the absorptive parts by the equations 


1 A,3(s,t’) 1 A 1o(s,u’) 
A, fue t few : 
‘a '-1 T u’—u 
1 Ayo(s’ 1) 
fw : 
1 u’—U 
Ais(s’,t) 1 Ao3(t,u’) 
: 4 fow 
s’ § T u’ —tU 


This may be verified by a direct substitution of Eqs. 
(2.8)—(2.10) in any one of the dispersion relations, when 


squares of 


(2.8) 


(2.9) 


(2.10) 


(2.7) results. From (2.8)—(2.10) we see that the absorp- 
tive parts satisfy dispersion relations in the momentum 
transfer for their reaction when the energy is kept fixed. 
The weight functions are Aj3;, A», and Aj. which, as 
we have pointed out, are in the unphysical region. The 
cuts therefore begin at points on the curve C in Fig. 1 
and corresponding curves for Aj; and Aj, i.e., a finite 
distance above the thresholds M,?, M,?, and M2. On 
the other hand, as long as C approaches asymptotically 
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the lines s=M,? and ‘=M 2’, the absorptive parts A, 
A», and A;, as given by (2.8)—(2.10), will begin at the 
expected thresholds. 


Representation with Subtractions 


Let us return to consider the form the representation 
(2.7) takes when—as is always the case in practice— 
there are subtractions. Proceeding as we do for ordinary 
dispersion relations, we may write the first term of (2.7) 
in the form 


(s—So)(t—to) A\3(s’,t’) 
fasar 
r (s’—sy)(s’—s)(t'—to) (’'—1) 
S—So 7” fils’) 
+ f ds’ 
mw Sma: (s’—59)(s'—S) 


f(t’) 


t—lo og 
+ f dt’ t 
wr Ju. (t'—ty)(t'—2) 


(s—so)(t—to) A 43(s’,t’) 
A= fesar se + 
r° (s’—s»)(s’—s)(t'—to) ('—1) 
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where 


A i3(s,t’) 

pee 
t'—lo 

A 13(8”,l) 

fos : 
s’—So 


1 A,;(s’,t’) 
(=— f ie, 
Tr (s’— sy) (t'—to) 


If, now, the scattering amplitude remains finite at 
infinity, the first term of (2.7) can still be replaced by 
(2.11) but fi, fe, and ¢ will no longer be given by the 
integrals (2.12), which may not even converge. On 
treating the other terms of (2.7) in the same way, we 
find that A is given by the representation 


(£— to) (4— Uo) Le 
f dt'du’ jeceehtentctaionaninit 
” (t!— 0) (t! —1) (14’ — tuo) (u’ — 4) 


(s—so)(u— Up) A j2(s’,u’) s—So f°” fils’) 
+ fasaw ; — ; +— J , eS 
r (s’—50)(s’—s)(u'—uo)(u’'—u) oe SMa? = (s’—50)(8’—5) 


U— ty ” 
+—— f du’ + 
xr JMy (u’—Uo)(u'—U) 


it is found that the amplitude for scattering of scalar 
particles (coupled with fermions) does remain finite at 
large values of the variables, so that the representation 
has the form (2.13). 

For future reference we note that the dispersion 
relations (2.8)—(2.10) must now be written 


The first three terms will be referred to as ‘double 
dispersion integrals,” the fourth, fifth, and sixth as 
“single dispersion integrals.” If any of the reactions can 
occur through a discrete intermediate state, so that 
there are poles in the transition amplitude, they will be 
represented by 6-functions in fi, fz, or f:, below the 
thresholds M,?, M,?, and M2. In perturbation theory, 


A,3(s,t’) 


i—ie 
A,= fi(s)+ fa - + 
7 (t'—to) (t’—1) 
5—Se A 12(s’,a) 
an _ fo —_—. 
T (s’— se) (s’—s) 


U— Uy Ao3(t,w’) 
+ f ieee, 
Tr (u’ — uo) (u’ —u) 


ii A(t’ ,u) 
»= fo(u)+ fu ae 
(t’—ty) (t’—2) 


S—So Ay3(s’ ju) 
A; f3(t) + fos 
T 


As the fourth term in (2.13) does not depend on ¢, it 
affects only the S waves in the reaction I. The succeed- 
ing terms are purely real in the physical region, so that 
the single dispersion integrals affect the absorptive 
part of only the S wave. The absorptive part of all the 
other waves is determined completely by the double 
dispersion integrals. 


(s'—-5, )(s’—s) 


f(t’) 


fo(u’) 
(t'—to) (t'—2) 


+r. (2.13) 


t—lo + 
f dt’ 
T M,? 


uU— ty A jo(s,u 
few —__——., (2.8a) 
T (u’ — 19) (u’ — 1) 


(2.9a) 


(2.10a) 


The functions /, f2, and fs will of course depend 
on the values of so and ¢) chosen. The subtractions can 
be performed in such a way that these functions are 
just the S-wave absorptive for the three reactions. To 
do this, we note from (3.8) that the S-wave absorptive 


part for the first reaction is given by 
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1 


1 
a d(cos@) A 1(s,/,11) 
ZY 1 


tot (8) Rater ala 1 ubl (8) A jo(s,u) 
ecereen tal (8) i—t per eon, satin a ul —U 


t'—ty1(s) 1 u’ — Up (Ss) 
ae — far. A3(s,t’) In — =f dul Als) In (2.14) 
“ 'iet= tars) t’ —tar(s) wf stor(s) —Mar(5) | ut’ — Ua (S) 


tar(s) and ¢,;(s) are the minimum and maximum values of ¢ for the reaction I, and similarly for war(s) and 
ust(s). As ¢ or « depends linearly on cos@ when s is fixed, the integrations over cos# has been replaced by 
integrations over these variables. We now see that we can rewrite (2.7) in the form 


1 1 1 1 
== fasaranio,n(- -tnit’))(— —his') )+— fata Ante 
7 s'—s '—t r 
1 1 1 1 
x{— -inww't))(— hint’) ) += f d'un) ( —In(w's)) 
F F r s'-s 


t—t u“—U 


1 S—So f°” A,(s’) u—Uy f°” A, (u’) t—to 
x (<a +—f if bccn ete +-- f du/—_—_—_—_ +- 
u’—U aw JMq? (s’—S50)(s'—S) a My: (u’—m)(u'—u) or 
A; (t’) 
xf dl’- py (2.15) 
Me: (t’ ~ ty) (t’—12) 


Let us now consider the number of subtractions 
required when the particles A and C are fermions of 


1 v—t at (s’) Me ; ; ¥ ; : 
In spin 3. If we write the invariant scattering amplitude 
lar(s’)—ty(s’) v —~ty(s") in the form 


where we have written /;(s’,t’) for 


The absorptive parts for the S waves then depend only —A+Hiy: (po— ps) B, (2.16) 


on the single dispersion integrals and those for the 
higher angular-momentum waves only on the double 
dispersion integral. 


we find that the perturbation terms have the following 


asymptotic behavior: 


t—- const, s— A-—const, B- const/s. 


s— const, toruw— ©, Re(A) and Re (B)—const, 
m(A) and Im(B)->const/t (const/u). 


s,t,andu— ~, A and B — const/s. 


s can be replaced by wu in these limits since interchange of these variables simply interchanges the two spin-zero 
particles. It is then easy to see that the representation must have the form 


5—S0 A,3(s’ st) u— Uo Ao;3(t',u’) 
A= = fay dt/—— --+-—— ‘far du’—— 
(s’ — 50) (5! —s)( t! —1) - (ae’ — uo) (u’ —u) (t'—1) 


fan 


1 A jo(s’,u’) le? | ats) Di a(n’) Yn s(t ¢) 
+— § ds'du’— + J ds'— +.. du’ + “fa x 
Mat Ss > w—uU M 


a (s’—s)(u’—u) s'—s 2I My? 


B,;3(s’,t’) 1 By3(t’,u’) 1 Byo(s’,u’) 
ds'dt/——__—— =f araw —————-+— § ds'dw’ - 
(s’ —s)(—-1 i ('—t)(u’—u) (!—3)(a'—w) tg 
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We observe in particular that, in this case, there is no 
over-all subtraction constant independent of both 
variables. It is in fact true independently of pertur- 
bation theory that the number of over-all subtraction 
terms in a double dispersion representation is limited 
(one for the case where all the particles are scalar, none 
for all other The presence of one over-all 
subtraction term implies that the S-wave amplitudes for 
the three reactions behave like a constant at infinity, 
whereas more than one over-all subtraction term 
(involving polynomials in the variables) would require 
at least one angular-momentum state for one of the 
reactions to increase at infinity. However, the form 
—2n(e"* sind)w/qwyws of the scattering amplitude for 
an individual partial wave (Imé 20) indicates that it is 
bounded at infinity. On inserting the kinematical 
factors relating it to the invariant scattering amplitude, 
we find that the latter must tend to zero at infinity, 
except for scattering of scalar particles when it may 
remain finite. We thus arrive at the result just stated. 


cases). 


3. EXPRESSION OF FOURTH-ORDER FEYNMAN 
GRAPHS AS DOUBLE DISPERSION 
INTEGRALS 


Outline of the Method 


We now wish to evaluate the fourth-order Feynman 
diagrams and to show that they can be expressed in the 
form of the representation we have been describing. In 


ig! f 
AG) =— 
(2r)'4 [(pi-—q)?— M* JL ( pit p2—q)?— M? JL (ps+q)?— M2] (g?— M?) 


The 4 in the superscript is the order of the perturbation 
theory, and the superscript (i) is to indicate that we are 
not dealing with the entire fourth-order amplitude. 

The integral (3.1) is a function of the two invariants 
s and ¢. The method of procedure will be to show that, 
if ¢ is fixed, real and negative, this integral is an analytic 
function of s with no singularities in the complex plane. 
A dispersion relation can therefore be written for A“. 
The imaginary part which appears in the integrand is 
next calculated explicitly by examining (3.1). It will 
then be seen that A“: can be brought into the form 
of the first term of (2.7). 

The easiest way of showing that A“: is an analytic 


Fic. 2. Fourth-order per- 
turbation graph for the re- 
action A+B -— C+D. 
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order to simplify the discussion, all particles will be 
taken to be scalar and the A¢! interaction term will be 
omitted. Though greatly reducing the amount of algebra 
to be performed, this assumption does not change any 
of the essential features of the result. 

The reducible graphs do not present any difficulty. 
They are all functions of one of the variables only, and 
may be represented by modifying a vertex part or an 
internal line in a second-order diagram. The modified 
vertex part or internal line can then be expressed as a 
dispersion integral in the mass of the internal line. That 
it is possible to do so is now a well-known fact for the 
internal line, and has been proved in perturbation 
theory for the vertex part,’~* though in certain cases 
there may be anomalous thresholds. On inserting these 
dispersion integrals into the formulas for the scattering 
diagrams in question, it follows at once that the scatter- 
ing amplitude satisfies a dispersion relation in the 
variable on which it depends, again with the possible 
occurrence of anomalous thresholds. 

The reducible graphs thus contribute only to the 
single dispersion integrals in (2.13). In this case they 
tend to zero at infinity, so that they can be written 
without subtractions. 

We are left with the irreducible diagrams, which are 
all topologically identical with Fig. 2. For simplicity we 
shall suppose all masses to be equal. The amplitude 
corresponding to this diagram is then given by the 
formula 


d‘g 
(3.1) 


function of s for fixed negative / is to parametrize it in 
the usual way [see, for instance, Karplus, Sommerfield, 
and Wichmann,® Eq. (9) ]. If s is complex, it is not 
difficult to see that the denominator of the integrand 
never vanishes over the range of integration of the 
parameters. This is true whatever the values of thé 
eight masses in Fig. 2, as long as each particle is 
energetically stable, i.e., its mass must not be greater 
than the sum of the masses of the other two particles 
meeting it in one of the vertices. If in addition the 
condition for the existence of anomalous singularities is 
not met, the denominator also never vanishes if s is 
real and below the expected threshold. It then follows 
immediately that the integral is an analytic function of 
s with no singularities in the complex plane or below 
the expected threshold. A“: can thus be written in 
the form 
1 7” ImA (4:1) (5’ fa) 
AD (5 104) = f ds’ ——— 


> m . 
WH (2M)? 5S —S 


(¢real and <0). (3.2) 


7 Karplus, Sommerfield, and Wichmann, Phys. Rev. 111, 1187 
(1958). 

8 Y. Nambu, Nuovo cimento 9, 610 (1958). 

*R. Oehme, Phys. Rev. 111, 1430 (1958). 
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It should be noted that not all dispersion relations 
can be proved in fourth order with such ease. The 
dispersion relation for A“: with instead of ¢ fixed, for 
instance, does not follow from this type of reasoning. 
Grisaru!”’ has examined a comparable case for nucleon- 
nucleon scattering. We shall not look at such cases 
directly, as all the ordinary dispersion relations follow 
from the double dispersion representation. 


Calculation of Im(A) 


Returning to consider the imaginary part of (3.1) 
which must be inserted into (3.2), we proceed along the 
lines of reference 10. We take (3.1) in the center-of-mass 
system of the particles, with the momentum transfer 
in the y direction and the z direction perpendicular to 
the scattering plane. As we have pointed out, part of the 
range of integration in (3.2) is in the unphysical region 
and represents scattering through an angle whose 
cosine is less than — 1. In this region, the x-components 
of p: and ps3, which occur in the first and third factors of 
(3.1), will be purely imaginary. However, the imaginary 
components occur only in the form (f;,—q,)? and 
(psz— xz)”, and the remainder of the integrand is an 
even function of g,. On integrating over g, from — © to 
co, therefore, the imaginary terms arising from this 
source cancel out. 

Each of the four factors of (3.1) really consists of the 
sum of a principal part and an imaginary 6 function. It 
might therefore naively be supposed that, if these two 
parts were separated and the resulting expression 
multiplied out, those terms containing an even number 
of 6 functions would give the imaginary part. The 
integral over the product of the four principal parts can 
easily be shown to vanish. The product of the four 
6 functions also vanishes in our case, as it will be seen 
below that their arguments can never be zero simul- 
taneously. We are left with all possible terms which are 
products of two principal parts and two 6 functions. By 
looking more carefully at the integral, we find that this 
procedure is not correct and must be modified as 
follows: write each factor in (3.1) in the form 


1 r 


9 


(qo— po)?— (q—p)’— Mie (qo— po)?— (q—p)?- M? 


in 
— - 5{go- potL(q—p)?— M?— ie |} 
£40 


in 
aor 5{go— po—L(q—p)?— M?—ie}*}. (3.3) 
<qo 


Expand out the product, and examine the terms 
containing two principal parts and two 6 functions. If, 
in a particular term, the arguments of the two 6 func- 
tions have infinitesimal imaginary parts of opposite 


10M. T. Grisaru, Phys. Rev. 111, 1719 (1958). 
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sign, multiply the term by two. If they have imaginary 
parts of the same sign, do not include the term. 

We have therefore to investigate what products of 6 
functions from two different factors of (3.1) can be 
nonzero, i.e., can have arguments which are simul- 
taneously zero. The argument of the 6 function is zero 
when the four-momentum of the corresponding internal 
line in Fig. 2 is on the mass shell. Two adjacent internal 
lines cannot be on the mass shell simultaneously, as 
this would imply that one of the vertices had three real 
lines, which is impossible, even if the ’s can have their 
first component purely imaginary. Further, the two 
opposite lines / and F cannot be simultaneously on the 
mass shell and have 6 functions which contribute to 
ImA “:), as, assuming this to be the case, we can easily 
derive a contradiction. The arguments of the 6 functions 
are 

ee pitgo+l (pita 2+ M?— ie |}, 
and 


psot+ go+[ (ps+q)?+ M?— ie |. 


If they are to be zero and have imaginary parts of 
opposite sign, the two fourth components pio— qo and 
psot+go must have the same sign. As the two lines are 
both on the mass shell and therefore timelike, and the 
fourth components of their momenta have the same 
sign, the sum of their momenta must be time-like. 
However, the sum of their momenta is equal to ¢ and 
is therefore, by assumption, space-like. It is therefore 
impossible for them to be simultaneously on the mass 
shell and to have 6 functions which contribute to 
ImA“, 

We are left with the case where G and H are simul- 
taneously on the mass shell. The arguments of the two 6 
functions are then 


go— (pit po)o+(G?+M?—ie)}, (since pitpe=0) 


and 
go+ (q?-+ M?—1€)?. 


If their imaginary parts are to have opposite sign, the 
energies of the lines G and H, equal to (pi+p2)o— qo 
and go, must have the same sign, and must therefore be 
positive, as the incoming pion-nucleon state has 
positive energy. On combining the 6 functions with the 
principal part of the other two factors, and multiplying 
by @ functions to restrict the signs of go and (p1+ p2)o— Yo 
to be positive, we obtain for ImA “' 


. 1 
ImA 4:1) = foo 
8x? {(pi—q)’— M*}{ (ps+q)?-M} 


X4( go) Af (pi + po)o — Joyo q° _ M?) 


X5{ (pit pe—g)?—M*}, [s>(2M)*]. (3.4) 
The arguments of the 6 functions vanish when the 


internal particles G and H are on the mass shell, and we 
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have seen that they must have positive energies. We 
may therefore consider G and H to form a real inter- 
mediate state. As four-momentum is conserved at the 
vertices of a Feynman diagram, and the initial and 
final states are taken to be in their center-of-mass 
systems, this intermediate state is also in its center-of- 
mass system. The two integrations in (3.4) remaining 
after the 6 functions have been taken into account 
reduce to integrations over the direction of the particles 
in the intermediate state, which we shall denote by 
n;. There will be a factor from the 6 functions equal to 
1/(4qw), where g is the center-of-mass momentum and 
w the center-of-mass energy (equal to s!). The term 
(pi:—q)? in the denominator of (3.4) is now just ¢;., the 
square of the momentum transfer between the incoming 
particle A and the intermediate particle G. (p3+4)? is 
similarly ¢;., the square of the momentum transfer 
between the intermediate particle G and the final 
particle C. Thus (3.4) takes the form 


g'g 1 Qn 
ImA 4.) =— — f def do 
32nd» 4% = [292(s:e—1) - M2] 2g°[ 221+ (1—2?)(1—3,2) cosp—1]— M2} 


The integral can be evaluated and, after transferring 
from z to ¢ by (3.6) and expressing g in terms of s by the 
formula 

g’= (s—4M?)/4, (3.9) 


we find that 


1 a(s,t)+(q/s')[x(s,0) }! 
In— < ’ 
L6m{x(s,)}! — a(s,t)—(g/s*)[x(s,t) }! 
[s>(2M)*] 


ImA 41) = 


(3.10) 


=0, [s<(2M)*] 


(3.11) 
(3.12) 


«(s,t) = 4sf[ st—4M2(s+t)+12M*], 
a(s,t)=st—2M*s—4M%+6M4. 


Expression of A as a Double Dispersion Integral 


It now follows at once that ImA“:”) satisfies a 
dispersion relation of the form (2.8). The right-hand 
side of (3.10) is an analytic function of ¢ in the complex 
t-plane, except for a cut along that portion of the real 
axis where x and ¢ are both positive. The discontinuity 
across the cut is —1/{4[x«(s,f) ]4}. Hence we may write 


1 A,3)(s,t’) 
ImA “@ H(s44u)=- f de - 


T {'-t 


(3.13) 


where 
A33(s,t)=—1, {8| x(s,t) }*}, 
xk>0, 
=() otherwise. 


t>0, s>(2M)? (3.14) 


Our final step is to substitute (3.13), which has the 


MANDELSTAM 


mg 1 
32r%w (tie— M2) (t,.— M2) 


[s>(2M)2]. (3.5) 


In order to perform the integrations, ¢;, and ¢;. must 
be expressed in terms of the cosines z;- and 2; of the 
corresponding angles by the relations: 

tie( io) = 24?(2ie( io) — 1). (3.6) 
Zio is given in terms of z;, and z, the cosine of the angle 
of scattering from the initial to the final state, by the 
relation 
Sio= 22iet (1—27)}(1—2,.7)! cos, (3.7) 
@ being the azimuthal angle between the plane of 
scattering and the direction n,, with the direction of the 
incoming particles taken as the pole. On substituting 
the expressions for ti, tio, and 2. into (3.5), and 
rewriting the integral over n; as an integral over its 
co-ordinates 2;, and ¢, we obtain 


1 
(3.8) 


form of the dispersion relation (2.8), into (3.2), to give 


1 Ay; (s',t') 
A) (5 11) =- fovae - : 
r (s’—s)(t'—1) 


We have thus verified that 4“:) can be written in the 
form of the representation (2.7), with only the first 
term contributing. The boundary of the region in which 
A; is nonzero is given by 


st—4M?(s+t)+12M‘=0. 


(3.15) 


(3.16) 


Equation (3.16) has the form of the curve C of Fig. 1, 
and approaches asymptotically the lines s=4M? and 
t=4M?. 

Equation (3.13) has only been established for ¢ real 
and negative. We can however define an analytic 
continuation of A for all values of ¢ by this equation. 
It is then necessary to verify that A is given correctly in 
the physical region for reaction III, where ¢ is positive, 
by (3.13). We could do this directly by repeating the 
same procedure with Fig. 2 turned through 90°, it is 
then found that A“: is given by (3.15) as long as s 
is real and negative. This, however, is not really 
necessary, as the Feynman integral (3.1) and the 
expression (3.15) are both analytic functions of s and 
t, so that, if they are equal in a region, they are equal 
everywhere as long as we are on the same branch. Now, 
on passing from the region in which / is real and negative 
to the physical region for the reaction III, the only 
branch point of (3.15) which has to be crossed is at 
!=4M?, the threshold for this reaction. The sign of the 
small imaginary part to be included in the second 
denominator of (3.15) will depend on the manner in 
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which we go round this point, and it is not difficult 
to verify that, corresponding to the small negative 
imaginary parts in the denominators of the Feynman- 
Dyson integral, we must maintain our convention and 
insert a small negative imaginary part here. The 
scattering amplitude is therefore given correctly by 
(3.13), with the usual convention concerning the 
imaginary part, in the physical region for the reaction 


The other fourth-order diagrams are obtained by 
interchanging some of the external lines in Fig. 2. 
This has the effect of interchanging a pair of the 
variables s, /, and u, and these diagrams will therefore 
correspond to the second or third terms of (2.7). The 
spectral functions will again be given by (3.14), with 
the appropriate change of variables. The property 
of fourth-order perturbation theory that one Feynman 
diagram corresponds to a single term of (2.7) is not 
maintained in higher orders. 


The General Mass Case 


We shall conclude this section by generalizing the 
procedure to the case where all eight masses are 
different. Instead of (3.5), we now have 


8'qi 1 
ImA 4) =- fan "y 
327*w (tie— M7") (tio— M3) 


[s> (Ms+M,)? | (3.17) 
gi denoting the center-of-mass momentum of the 
intermediate state. The equation connecting the 
momentum transfer and the cosine of the angle is, in the 
general case, 


lap = 2qa98%a8— Ja? — 98° 
+[(Ma?-+qa’)!— (Me?+-qe?)*P, 


where ga and gg are the center-of-mass momenta, and 
M, and Mg the masses, of the particles between which 
the momentum transfer is to be found. It is more 
convenient to work in terms of cosines of angles than 
of momentum. As /;, and ¢; are linearly related to zie 
and z,., with constants of proportionality g.qi and q.9i 
(qe and gq. being the center-of-mass momenta of the 
initial and final states), Eq. (3.17) may be written 


(3.18) 


g' 
ImA 4.1) =—— 
3299 -GogiW 


(3.19) 


, 


Ud 
io %io ) 


where z;-. and z;,,’ are the cosines of the (complex) 
angles corresponding to momentum transfers t;.=M;’ 
and t;-= M;?. According to (3.8), they will be given by 


til =(ge-+ge+M? 


—[(M2+97)'—(Mse+q?) P}/(2qig-), (3.20a) 


IN 
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$i = {go+ ge+ Me 
—[(M?+9.")!— (M+q?)'P}/(2qigo), (3.20) 


and there will be a corresponding equation connecting 
z with ¢: 


s={qe+q?+t 
—C(M3+ 92)'— (MP+q)*P}/(2qeqo). 
The formula connecting g with s is 


g=[s?—2s(M2+M3)+(M2—M)?Y (45), 


(3.20c) 


(3.20d) 


where M, and M, are the masses of the particles in the 
state in question (M, and Mz for the initial state, Ms 
and M, for the intermediate state, and M3; and M, for 
the final state). 

KOn expressing z;, by (3.7) and performing the 
integration over 2;, and ¢, we obtain the formula 


. g I 
ImA 4.4) 


16mq.qogiW {R(2,2:0' 2:0) }3 


ues ‘Bio +LR(S,2:¢' 2:0’) J} 
¢In—- - 


Seer % 4) 3) 
S— Bie Bio —[h(2,8i0' Bio’) }! 


where 


et iD) 12 f ee 
R(2,2%¢ 210 )=S +2Zie +20 —1- 222:¢ Zio + 


(3.22) 


The right-hand side of (3.21) is an analytic function of 
z, except for a cut running from the value 2;, where 


Z1= Zie Zio + (Sie ?— 1)4(z;,/2—1)}, (3.23) 


to infinity. The discontinuity across the cut is 
—1/{4R(z,2:-',2i0’) ]*}, so that (3.21) may be written 
g 
ImA “4! 

Sq eGo W 


i" 1 1 
xf dz’ ’ 
a. {R(2' Sie’ Zio) 4 2’ —3 


where the lower limit 2; is given by (3.23). 

Since ¢ is linearly related to z by (3.20c), Eq. (3.24) 
may be rewritten as a dispersion integral in ¢ instead of z. 
As in the simple case, we can then substitute this 
dispersion relation into (3.2) and obtain (3.15), with 
A,;“ now given by the equation 


(3.24) 


1 
ola ‘ (23> 23) 


SgeqgogiW { k(z,5:' 2:0) 4 


0. (s<2). (3.25) 


z, is again the value of z in (3.23). The curve C of Fig. 1, 
which bounds the region in which A,;“ is nonzero, is 
thus given by (3.23). On inserting the values of z;.’, 
zi and z from (3.20) into (3.25) and (3.23), and 
expressing qe, Jo, gi and W in terms of s and the masses, 
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Fic. 3. Definition of the mo- 
menta used in Eq. (3.26). 











we can obtain A,;“?, and the curve C which bounds the 
region in which it is nonzero, in terms of s and ¢. 

For most cases of practical interest, in which not all 
the masses are different, it is probably more convenient 
to work out (3.20) with the masses involved, and then 
to substitute into (3.25) and (3.23), rather than to use 
these formulas expressed in terms of s and ¢ and general 
masses M,---Ms. However, for the general case, the 
result of the substitution assumes a neat form in the 
notation of reference 6, so we give it here. The masses 
and momenta are now re-defined according to Fig. 3, 
together with the equations 


pis= piot pos, 
pos= post pss. 
We further define the variables" 


pecnon’ 
‘ , (3.27) 


xij= (pi?—m?—m?)/(2mmj;), (i,j=1---4). 


Apart from proportionality factors and constant terms, 
the variables x12, 23, %34, and 24 correspond to the 


external masses, X24 corresponds to s and x; to /. 
Equation (3.25) then reduces to the formula” 


1 
A,;" = or 
8m ymomym,4(Detx)? 


and the region in which it is nonzero is that portion of 
the %13— 24 plane which lies to the right and above the 
curve: 

Xu4> 1. (3.29) 


Det x=0, x3>1, 


If the condition for anomalous singularities is fulfilled, 
the inequalities in (3.29) have to be modified. As long 
as this is not the case, the equation, when expressed 
in terms of s and ¢, has the form of the curve C in Fig. 1. 


4. FOURTH-ORDER TERM WITH 
ANOMALOUS THRESHOLDS 


We have assumed thus far that the masses of the 
particles were such that no anomalous thresholds of 


11 Tn order to have no reversal in sign between our new variables 
and s and ¢, we have defined the x’s to be opposite in sign to the 
y's of reference 8. 

2 This can most easily be shown by a method due to T. W. B. 
Kibble (to be published.). 
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the type discussed by Karplus, Sommerfield, and 
Wichmann* occur. In that case the curve C of Fig. 1 is 
always bounded by its asymptotes as shown, and the 
representation (2.7) has its simple form. We now wish 
to discuss what happens when there are anomalous 
thresholds. No detailed discussion will be given, but 
we shall merely indicate in what way the representation 
must be modified. 

We again use the variables (3.27). x12, 223, X24, and 
414 correspond to the four external masses, «24 to s, and 
*13 to ¢. The normal thresholds for the reactions I and 
III are X4= A, *13> l. 

If the region in which Aj; is nonzero is to extend out 
of the region bounded by its asymptotes, there must be 
some point of horizontal or vertical tangency on the 
curve bounding it. The equation of this curve is given 
by (3.29), and it is shown in reference 6 that the 
equations for the horizontal or vertical tangents are 
(if x24 is plotted horizontally, x; vertically) : 


K.=0, or K,4=0, 
K;=0, 


(4.1a.) 
(4.1b) 


Horizontal tangents 


Vertical tangents K,=0, or 


where 


K,=Detx; 1,j=1---4; i,j7¥r. (4.2) 


(4.1a) is the equation for the possible anomalous 
thresholds which Karplus ef al.® find for the reaction 
III, (4.1b) the equation for the possible anomalous 
thresholds in the reaction I. We note that K,=0 is the 
equation for the anomalous threshold of the vertex 
part obtained by contracting the line m, (Fig. 3) out of 
the diagram. 

If (4.1a) is solved for aj, it is found that the solutions, 
which we shall denote by LZ, and L4, are always between 
1 and —1. If one or more of the masses is varied so that 
the condition for an anomalous threshold of the reaction 
III is approached, the solution of one of Eqs. (4.1a) 
approaches 1; let us suppose it is 2. It reaches 1 when 


A14+634=7, (4.3) 





























Fic. 4. Behavior of the spectral function when 
there are anomalous thresholds. 
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where 


6;;= cos! (—4,;). (4.4) 


It then decreases again from 1, and now represents an 
anomalous threshold. Meanwhile, Zs increases towards 
1, and the two solutions will cross at the point where 


912 +623+654+614= 2a 


If (4.1b) is solved for «24, the solutions will behave in 
the same manner. As (4.5) is symmetric in the indices, 
the solutions of the two equations (4.1a), and those of 
the two equations (4.1b), will cross simultaneously. 
Beyond this point, we have the threshold behavior of 
case (iii) of reference 6. 

Let us now study how the spectral functions behave 
in these cases. We have seen that, in the normal case, 
the curve bounding the region in which Aj; is nonzero 
has the form of Fig. 4(a). As long as a horizontal or 
vertical tangent does not meet one of the asymptotes, 
it must continue to have this form. 

Figure 4(b) indicates the behavior when Ly» has 
reached, and moved away from, the line 4;;=1. The 
region in which Aj; is nonzero is now not bounded by 
the asymptotes, but the scattering amplitude can still be 
written in the form (3.15). This may be verified by 
analytic continuation in the masses from the normal 
case. A;, given by (2.10), is now nonzero if «3 is 
between ZL» and 1. In other words, we have a typical 
anomalous threshold. 

When Ls and Z,; are both anomalous thresholds, the 
situation is as indicated in Fig. 4(c). We have shown 
in addition the lines x1;3= 14, %24= 3. The curve drawn 
below and to the left of these lines is also given by 
(3.29), but it does not yet play a part in the analytical 
structure of the scattering amplitude. 

At the point given by (4.5), we have the situation 
indicated in Fig. 4(d). All the points of tangency now 
coincide—if this were not the case, the equation (3.29), 
if solved for x04 when x13= 2, would have two double 
rvots, which is impossible as it only involves x24 quad- 


(4.5) 
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ratically. The scattering amplitude is still given by 
(3.15), and the curves below and to the left of the lines 
¥13= L4, X24= Ls play no part. 

When the sum of the angles in (4.5) becomes greater 
than 27, we again have the situation in Fig. 4(c), 
except that Ly and L4, and L, and Ls, are interchanged. 
However, if we assume that (3.15) is true before we 
reach this condition, and continue it analytically past 
the point where the situation indicated in Fig. 4(d) 
holds, the analytic continuation will no longer be of 
this form (3.15). The scattering amplitude now has a 
nonzero imaginary part in the area to the right and 
above ABCD, That part of the curve given by (3.29) 
between B and C plays the role, therefore, of an 
anomalous threshold which depends on all the variables 
and has no counterpart in the vertex function. This has 
previously been shown by Karplus, Sommerfield, and 
Wichmann® by examining the parametrized Feynman 
integral. For values of 24 between L; and L3, the 
absorptive part A, will have a similar analytic form, 
as a function of x13, to the normal case, [Eq. (3.10), 
modified for nonequality of the masses | and its singu- 
larities will again be given by (3.29). Now, however, the 
solution of this equation will yield complex values of 
x13 (for real x24), so that 4; cannot be written as a 
dispersion integral in x3. 

It is thus evident that, once the sum of the angles 
612, 83, O54 and 6)4 is greater than 27, the representation 
(2.7) will no longer hold. One can still construct the 
fourth-order term by analytic continuation in the 
external masses from the normal case. 
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The analyticity properties of transition amplitudes are used in conjunction with the unitarity requirements 
to generate successive terms in the perturbation series, without referring to a specific Lagrangian. In the 
sixth and higher orders, production is neglected in the unitarity condition; subject to this approximation, 
it is found that the series can be so constructed. No analyticity properties which have not been rigorously 
proved need be employed, and the terms are found to satisfy the double dispersion representation. By 
examining the connection between this method and the conventional calculation of the perturbation series, 
the types of spectral function corresponding to different Feynman diagrams can be found. Formulas are 
given for the regions in which the spectral functions are nonzero. 


1. INTRODUCTION 


QUESTION of great current interest in quantum 

field theory is whether the dispersion relations, 
together with unitarity, can be used to calculate 
measurable quantities in terms of a small number of 
coupling constants and masses, without the introduction 
of a specific Lagrangian.’ An approximation method for 
a calculation based on such principles has been outlined 
by the author,? but would meanwhile be of interest to 
see if one could generate the perturbation series term 
by term in this way. One can obtain much more definite 
answers in a perturbation than in a nonperturbation 
approach, and the results obtained should give one 
insight into the problem which may prove useful in a 
more general treatment. 

We shall begin in the following section by considering 
the fourth-order terms. Previous attacks on the problem 
based on ordinary dispersion relations, have suffered 
from the difficulty that the result contained, as a 
“subtraction term,’ an unknown function of the 
momentum transfer. In the present paper, however, 
we shall put in more than the ordinary dispersion 
relations, and shall assume analytic properties of the 
transition amplitude as a function of both the energy 
and the momentum transfer. A representation which 
expresses these properties has been postulated by the 
author’? and discussed in the previous paper’; if this 
representation is assumed to be correct, the fourth- 
order terms can be calculated in a form free from 
arbitrary functions. There will be an arbitrary constant 
only when all four particles are scalar. By comparing 
the new to the conventional calculations we shall be 
able to relate terms calculated by unitarity to partic- 
ular Feynman diagrams. We shall also be able to see, 


* This research was supported by the U. S. Air Force under a 
contract monitored by the Air Force Office of Scientific Research 
of the Air Research and Development Command. 

1 This was first suggested by M. Gell-Mann, Proceedings of 
the Sixth Annual Rochester Conference on High-Energy Physics, 
1956 (Interscience Publishers, Inc., New York), Sec. III, p. 30. 

2S. Mandelstam, Phys. Rev. 112, 1344 (1958). 

5S. Mandelstam, preceding paper [Phys. Rev. 115, 1741 
(1959) ] 


without explicit calculation, which of the two-dimen- 
sional spectral functions corresponds to a given diagram, 
and to find the asymptotes of the region in which the 
spectral function is nonzero. 

Though the calculation of the perturbation terms 
is most straightforward if the double dispersion 
representation is assumed to be correct, it is not 
necessary to assume as much as is implied by it. We 
shall, in fact, show, by re-examining our calculation, 
that it is sufficient to assume analyticity properties 
that have been proved rigorously by Lehmann,‘ 
Kallén and Wightman,‘ and others. The result is then 
found to satisfy the double dispersion representation. 
We thus have another proof that the representation is 
true in fourth order. 

In Sec. 3 we shall extend the calculation to higher 
orders. The unitarity condition would then involve 
production, so that the analyticity properties of 
multiparticles transition amplitudes would be required. 
If, however, one neglects the production terms in the 
unitarity condition, one can construct the perturbation 
series term by term to any order. As in the fourth-order 
case, it is unnecessary to assume any unproved analytic 
properties, and the result is found to satisfy the double 
dispersion relation in all orders. 

The calculations of this and the preceding paper 
enable us to specify in more detail certain features of the 
double dispersion relation that were previously left 
undetermined. More specifically, the boundaries of the 
regions in which the spectral functions are nonzero 
can only be obtained with the aid of the unitarity 
condition. In Sec. 4 the equations of the boundaries 
are given for some frequently occurring cases, and the 
type of diagram corresponding to each spectral function 
is indicated. 

A few final remarks are added in the last section with 
regard to the general validity of the double dispersion 
representation. 


4H. Lehmann, Nuovo cimento 10, 579 (1958). 

5G. Kallén and A. S. Wightman, Mat. Fys. Skrifter Danske 
Videnskab. Selskab 1, No. 6 (1958). Only the general reasoning in 
Sec. III of their paper will be required, and this section is appli- 
cable to all Green’s functions. 


1752 





CONSTRUCTION OF 


2. FOURTH-ORDER TERMS 
Kinematical Preliminaries 


The kinematical notation will be the same as in the 
previous paper. The momenta for the four particles 
A, B, C, and D are represented by 1, po, ps3 and py, 
(Fig. 1), we then define the three invariants 

s= (pit po)’, 
u=(pitpu)’, 
t= (pit ps)”. 


(2.1) 


These are, respectively, the squares of the energies for 
the reactions 

A+B—C+D, (I) 

A+D— B+C, (ID 

A+C— B+D. (Ii) 


They are related by the equation 


sti+u=M/’+M?7+M3+M¢. (2.2) 


We continue to adopt the convention that, in an expres- 
sion written as A(s,t,4,), s and ¢ are regarded as in- 
dependent and u is to be expressed in terms of them by 
(2.2). Expressions written as A (s,f1,#) or A (s1,t,u) are 
to be similarly understood. For future reference, we 
may quote one or two kinematical relations. The 
square of the momentum transfer ¢ between two 
particles is connected with the cosine 2 of the angle of 
scattering by the equation 


t= 2gagat—qa?—4e°+[ (Ma?-+ ge?) !+ (Mp?-+ qs)", (2.3a) 


where ga and gs, M,, and Ms are the center-of-mass 
momenta and masses of the particles involved. If all 
masses are equal, the formula simplifies to 


t= 2g?(2—1). (2.3b) 


The momentum gq is given in terms of the corresponding 
square of the energy by the formula 


g=([s?—2s(M 2+ M.*)+ (M?— M7)? ]/4s,  (2.4a) 


where M, and Mz are the masses of the two particles 
forming the state. If 4/;=M2, the formula is simply 


g?=45—M?. (2.4b) 





Fic. 1. Kinematics for the 
reactions. 
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On the assumption that the transition amplitude 
behaves like a constant at infinity, the double dispersion 
relation is 


(s—so)(t—lo) A is(s’,t’) 
A= wile f PT : 
rr (s’— 50) (s’—s) (t’—to) ('—2) 


(t—to) (u— uo) 
i eens ee ~f dt'dw’ 
7 


Ag; (U',u’) (s— so) (u&— to) 


('—to) (t’— 2) (’ — up) (w’ — u) T 


Ao(s’,u’) 
xf ds'du)———__—______—__—_ 
(s’— 50) (s’—s) (1 — uo) (u’ — 2) 


Fe 


(s’— so) (s" ae 


U— Uy fo(u’) 
+----- f du’ - 
ig (10 — 149) (’ — 4) 


ilo fs(U’) 
san far - =A. (2.5) 
T (t’—to) (t'—1) 


The scattering amplitude will satisfy dispersion relations 
(if we neglect subtraction terms) 


1 A,(s’,t,u:) 1 A,(s,,1,u’) 
sicher, tp Acie 
ss 1 u'—U 
1 A,(s',ti,u) 1 A3(51,t’,u) 
A= -f ds'—————_+ -f dt'———_—_, 
T s'-s ra '—t 
1 A4(s,t,0’) 1 A;(s,t’ ,u;) 
A= f du’——— + f dt'—— : 
ra u'—U T U-t 


The corresponding dispersion relations for the absorp- 


(2.6a) 
(2.6b) 


(2.6c) 


tive parts Ay, Az and A;, associated with the reactions 


I, I, and IIT, will be 


1 A 13(5,t’) 1 A 12(s,u’) 
f f+ J a, 
Ut T u'—U 
Ao3(t',w) 1 A j2(s’,) 
‘ + fo ’ 
—t T s'—s5 


1 A,3(s’,t) 1 A o3(t,u’) 
A 3=— J ds’ + f du’ a 
1 s'-s 7 u'—U 


A,=- (2.7b) 
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or, with subtractions, 


I—ly A43(s,t’) 
Ay fils) + fe 
T (t'—to) (t'!—2) 


Uu—Uy Ajo(s,u’) 
+ J du’ : 
( 


tg u’ — uy) (u' — 1) 


j {—lo A o3(t',u) 
As= fo(u)+ fu 
T (t!—t)(t/—1) 
S—So A yo(s'u) 
+ fos s 
7 (s’—59)(s’—s) 
§-- Se A43(s’,t) 
fa 
T (s’—sy)(s’—s) 


U— Kh Ao3(tu’) : 
fw . AZT 
rg (1’ — uo) (u’ — 4) 


According to the procedure outlined in the introduction, 
we shall begin by assuming all these formulas to be 
true and shall then return to re-examine the assumptions 
which it is really necessary to introduce. 


(2.7d) 


Calculation without Subtraction Terms 


As in the last paper, the calculation will initially 
be done with all the masses equal and with scalar 
particles. Besides reducing the amount of algebra, 
this second simplification has the effect that all disper- 
sion relations can be written down without subtraction 
terms, so that the functions satisfying them can be 
determined from their imaginary parts along the real 
axis. We shall return later to consider how the calcula- 
tion can be carried out when there are subtraction terms. 

Our method of approach will be to determine the 
absorptive parts by unitarity and to bring them into 
the form of the dispersion relations (2.7). The spectral 
functions Aj43, A»3;, and Ay to be inserted into the 
fourth-order expression for (2.5) are thereby deter- 
mined. If states with more than two particles are 
neglected, as they may be in fourth order, the unitarity 
equation for the absorptive part A, takes the form 


A,{s,t(z),u} =[gq, (3240) ] fda hse) 
X A sof 5,t(i0) 11}. ( 


A,; and Aj, are transition amplitudes from the initial 
state of the reaction I to a two-particle intermediate 
state, and from the intermediate state to the final 
state. There are similar equations for A» and A3. 
n,; denotes a unit vector, z the cosine of the angle of 

® We do not assume that the particles taking part in the reaction 


are necessarily identical, so that A,; and Aj may be different 
from A. 
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scattering, 2;- the cosine of the angle between the 
direction of the incoming particles and n,, and 2; the 
cosine of the angle between the direction of the out- 
going particles and n;. The expression A {s,t(z),m} 
indicates that / is to be expressed as a function of 2 
by (2.3), so that A; is a function of s and z. 3j is 
connected with z;, and ¢, the azimuthal angle between 
n, and the plane of scattering, by the formula 


Zio= 22ie+ (1—27)'(1—32,.7)! cos. (2.9) 


Since we require A; in fourth-order perturbation 
theory, it will be sufficient to take the second-order 
terms for A* and A. As an example, we shall suppose 
that 


A .,O*= g?/(tie—M"), Aio=8"/(tio—M"). (2.10) 


We exclude for the moment the case where there may 
be terms of the form g*/(s—M7?) or constant terms, 
since all dispersion relation would not then be free of 
subtractions. As we shall see later, the terms excluded 
correspond to reducible graphs in the scattering ampli- 
tude, which we know to have the form of single dis- 
persion integrals. 

The expressions for A,; and A, in (2.10) have now 
to be substituted into (2.8). When this is done, however, 
we arrive precisely at Eq. (3.5) of the previous paper. 
The expression can be evaluated by expressing 2,5 in 
terms of 2 and z;, by (2.9), integrating over ¢ and 3;, 
(which is the same as integrating over n;), and finally 
expressing z in terms of ¢ by (2.3b). We then find that 
the resulting value of A; can be written as a dispersion 
integral (2.7a) and that the spectral function is given by 


Ay3 (8,0) = —1/{8[x(s,0) }'}, 
k>0, (>0, and s>4M?, 


0 otherwise, 


(2.11) 


where 


«(s,t) = 4st st—4M?(s+1)+12M*]. (2.12) 


We omit the details of calculation, which can be found 
in the portion of the previous paper following Eq. (3.5). 
If A,,?* or Ayo in (2.10) contains a term g? 
(u—M*), we need not repeat the calculation of the 
corresponding term in A,, for we can find the result 
from the calculation already performed by making use 
of the symmetry of the problem. On interchanging the 


Fic. 2. Fourth-order dia- 
gram for the reaction A+B 
»>C+D 
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two particles in one of the states, initial, intermediate 
or final, we make two of the interchanges ( <> 4, t;, © U;-, 
lio Uio. We should therefore expect the results to be 
unchanged by such a transformation. Hence, if both 
t;, and fj. in (2.10) are replaced by u;. and u;, we should 
get the same contribution to A,; as before. If only one 
of these replacements is made, / will also have to be 
replaced by u; the term therefore contributes to Aj». 
instead of Aj3,’ and the contribution is obtained by 
substituting « for ¢ in (2.11). 

By doing analogous unitarity calculations for A» and 
A;, one can obtain expressions for A»; and Ay, and 
Aj; and A9;, respectively. Each of the spectral functions 
can therefore be calculated from either of two unitarity 
equations. These calculations must give the same result, 
otherwise there is an inconsistency in the theory or in 
the assumptions made. We observe that (2.11) is a 
symmetric function of s and /, so that the same result 
would indeed be obtained whether one used the unitarity 
equation for A, or for A;. It is necessary to use only two 
of the three unitarity equations in calculating the 
spectral functions; the third is, in this case, completely 
redundant. 

It has previously been pointed out by Grisaru® that 
the imaginary part of a transition amplitude, calculated 
by conventional methods, is equal to the corresponding 
absorptive part calculated by unitarity. As we shall 
make further use of the unitarity condition in the next 
section, we have written it out explicitly. 


Relation between the Two Methods 


We shall now examine again the connection between 
the present and the conventional method of calculating 
perturbation terms, and we shall see how we can relate 
a term calculated in this way to a particular Feynman 
diagram. By doing so, we shall gain more insight into 
such questions as to whether a unitarity calculation of a 
particular term in A, say, will yield a spectral function 
Aj; or Ayo, and what asymptotes the region in which it 
is nonzero will approach as either variable becomes 
infinite. In the simple case we have been considering, 
the answer to this last question is more or less obvious, 
as the only threshold is at 4M?, but in more complicated 
cases where there may be several thresholds the answer 
is not quite so clear. 

We saw in the previous paper that the most conven- 
ient way, for our purposes, of evaluating the conven- 
tional integral corresponding to Fig. 2 was to prove 
that it satisfied a dispersion relation (2.6a). It was then 
shown that the imaginary part in the physical region 
for the reaction I could be calculated by regarding G 
and # as a real intermediate state for the reaction I, 


7 Either of the dispersion integrals in (2.7a) could be written 

in terms of ¢ and u, but, once a function has been brought to this 

form, we can decide whether the spectral function is A}; or A12 

from the range of values of the primed variable. Aj; involves 

positive ¢’ (and negative «’), Ai2 positive w’ (and negative ?’). 
8M. T. Grisaru, Phys. Rev. 111, 1719 (1958). 
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and performing an integration over the variables 
necessary to specify that intermediate state. In the 
calculation from dispersion relations and unitarity it 
was found that the absorptive part could be written 
as an integral over variables corresponding to a real 
intermediate state through which the scattering could 
take place. The integrand was found to be the same as 
in the first method of calculation—it consisted of two 
factors, representing the second-order contribution for 
the scattering from the initial to the intermediate, and 
the intermediate to the final state respectively. In 
Fig. 2, the left and right half of the diagram represent 
these two factors. 

However, we can equally well do the evaluation of 
the Feynman-Dyson integral for Fig. 2 by writing it 
as a dispersion relation (2.6c). We then have to calculate 
the imaginary part in the physical region for the 
reaction III, and we find that we must now regard the 
particles £ and F as a real intermediate state. Alterna- 
tively, if we are doing the calculation from dispersion 
relations and unitarity, we can calculate the absorptive 
part A; for the third reaction. The first factor of the 
equation corresponding to (2.8) will then be the 
transition amplitude for the process A+C— E+F, 
and the second the amplitude for the process E+F 
— B+D. We observe that these terms can be represented 
by the bottom and top halves of Fig. 2, connected by 
the real lines E and F. Thus, as before, the present and 
conventional methods of calculation correspond very 
closely, and give the same result for 413". 

From the two methods of evaluating the Feynman- 
Dyson integral for Fig. 2, we observe that it gives rise 
to a contribution to A; with threshold at 4M? and a 
contribution to A; with threshold at 44/*. There is no 
corresponding way of dividing up Fig. 2 so as to 
represent the reaction II going through a real inter- 
mediate state, and the calculation will give no contribu- 
tion to Ao. The term in the scattering amplitude 
corresponding to Fig. 2 therefore yields a spectral 
function which provides contributions to A; and A; 
but not A»; according to (2.7), this spectral function 
must be A;;. Also, the thresholds for A,“ and A; 
are s=4M? and (=4M?, so that we should expect the 
region in which A;;“ is non-zero to approach these 
lines asymptotically. 

It is thus a simple matter to identify the Feynman 
diagram corresponding to a term of the transition 
amplitude calculated by analyticity and unitarity, 
and, having identified the diagram, to determine which 
of the spectral functions Aj;, A23, and Ay. it yields and 
to find the asymptotes of the region in which the 
spectral function is nonzero. The Feynman diagram is 
obtained from the diagrams for A,,* and Aj which 
have been used in (2.8) by joining the lines representing 
the intermediate state. The diagram can then be 
divided up in the way it was constructed, and perhaps 
in alternative ways, to represent one of the reactions 
going through a real intermediate state. Corresponding 
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to each of these methods of division there will be a 
contribution to the absorptive part A;, A2, or A; of the 
reaction concerned, beginning at a threshold given by 
the lowest mass of the intermediate state. One can 
thereby determine which of the spectral functions A}, 
A 3, or Aj is involved, as well as the asymptotes of the 
region in which it is nonzero. 

The analysis of this section can also be carried out 
if the masses are unrestricted. Suppose that M, and 
M2, M; and M,, Ms and Mg are the masses of the 
particles in the initial, final and intermediate states, 
and M, and Mg the masses in the denominators of 
(2.10) (Fig. 2). As in the conventional calculation, it is 
then easiest to work in terms of cosines of angles 
rather than in terms of momentum transfers. (2.10) is 
written in the form 


(geqi(Zie—Zie’) |, 
A io = g°/[ qigo(Zio— Zio ) J. 


A,?*=" 
(2.13) 


Qe, gi, and g, are the center-of-mass momenta of the 
initial, intermediate and final states, given by (2.4a). 
zie and 2,’ are the cosines of the angles associated with 
squares of the momentum transfer MV,’ and M,’ between 
the initial and intermediate, and the intermediate and 
final, states. They are given by Eq. (2.3a). Following 
the procedure of the equal-mass case, we now substitute 
(2.13) into (2.8), express z,;, by (2.9), and perform the 
integral over z;, and ¢. The result can be expressed as 
a dispersion integral in z (corresponding to a dispersion 
integral in 4), with weight function given by 


(3> 2) 


A 3 = —1/{8q qoqgiwl (2,5: Zio) }}, 
0, (s<2;) 


where 


g 8. |. Dg / / 


° o 
Hane M10 9 


+ (Bie'?—1)4(z,97—1)}. 


The point 2, is the value of z for which k=0. Details 
are given in the calculation following Eq. (3.19) of the 
previous paper. Though it is usually most convenient 
to use these formulas as they stand with the 2’s given 
in terms of our usual variables by (2.3) and (2.4), we 
could write the formulas directly in terms of our usual 
variables when they would take the form of Eqs. (3.28) 
and (3.29) of the previous paper. 

If one or both the energy denominators in the 
perturbation terms (2.10) had involved u instead of /, 
the same formulae would hold, provided we constructed 
the graph corresponding to Fig. 2 according to our 
prescription, and always defined < in the manner 
corresponding to the case just treated. As in the equal- 
mass case, the spectral function would be Ay. if one 
energy denominator of (2.10) involved /, the other u. 
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Calculation with Subtraction Terms 


Let us now deal with the complications which arise 
when not all the particles are scalar, so that there are 
subtraction terms in the dispersion relations. As an 
example, we may consider pion-pion scattering through 
virtual nucleons. The process would be represented by 
Fig. 2, in which the external lines are pions and the 
internal lines nucleons. We again begin with the 
unitarity equation for A, which will be of the type 
(2.8) but is more complicated owing to the spin of the 
intermediate state. We shall simply outline the calcula- 
tion to show how the subtraction terms may be found, 
leaving out the details of the algebra. The evaluation of 
the integral for A,“: proceeds in a manner analogous 
to the calculation without spin and gives a similar 
result. In this case, however, A;“:) remains finite as 
t tends to infinity, so that it satisfies a dispersion 
relation of the form 


Ai) (s,t,u1) =A 14 (5,t0,01) 


t—to 
+—f dt’ 
Ww 


where both A,@:?(s,to,u1) and A,;“(s,t) are known, as 
we have an explicit expression for A;“:"(s,t,u1). By 
comparing this equation with (2.7d) we observe that 
A,3 and f;(s) in the representation (2.5) are now 
determined, the latter just being equal to A,“:" (s,f0,m1). 

If, instead of using the unitarity equation for Ai, we 
had used the equation for A;, we would have found that 


A 13° (s,t’) 


(2.17) 


(U—t)('—0) 


A ““  (s,t,11) =A 3) (s0,t,u1) 


S— So Aj3(s’,0) 
eo 
7 (s’—50)(s’—s) 
The value of A;3 in (2.17) and (2.18) must be the 
same in a consistent theory. By comparing (2.18) 
with (2.7f), however, we see that we have in addition 
now determined f;“ (/), which is equal to A 3?) (so,t,u). 
The information provided by the unitarity equation 
for A; is therefore not quite redundant, as /;(¢) cannot 
be determined without it. In general, therefore, we 
can use either of two unitarity conditions to determine 
the spectral functions A13, A423, and Aj2, but we must 
use the unitarity condition for the relevant reaction to 
determine f(s), fo(#) and f;(t). In the particular case 
represented by Fig. 2, there is no real intermediate 
state for the reaction II, and fy is zero. 

The over-all subtraction term \ in (2.5) cannot be 
determined from the unitarity equation, but it is an 
extra constant which has to be inserted into the theory. 
Such a constant is also present in the conventional 
theory, where it corresponds to the renormalized value 
of \ in the A¢* term. We have already observed that the 
only instance in which there is an over-all subtraction 
term is in scattering of particles of spin zero, and this 


(2.18) 
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Fic. 3. Construction 
of reducible diagrams 
according to our pre- 
scription. 





corresponds to the result of renormalization theory 
that a Ad‘ interaction is unrenormalizable in all other 
instances. 

Before proceeding further we may return to the case 
where the perturbation expressions (2.10), which are 
to be inserted into the unitarity equation (2.8) for A1, 
contain a term g*/(s— MM?) or a constant term. If either 
or both of the terms inserted as A,; and Aj, into (2.8) 
are of this form, the integral will be independent of z. 
This may be seen immediately by expanding 4,,; and 
Aj, in spherical harmonics, or it may be verified by 
direct calculation. The contributions to A; of this type, 
which are easily evaluated, will thus be functions of s 
alone, and will represent additions to the function f,(s) 
in the single dispersion integrals. They do not affect 
the spectral functions 4;; and Aj». By looking at the 
Feynman diagrams which correspond to these terms 
according to our prescription, such as Fig. 3, we notice 
that they are the reducible graphs of the usual formal- 
ism. In the present method of calculation, however, 
the absorptive part is found and inserted intoa dis persion 
integral, so that any single or double poles which may 
occur at s=M? are omitted. We therefore obtain our 
results directly in renormalized form. 

In cases where some but not all of the dispersion 
relations have subtraction terms, it may be unnecessary 
to use both unitarity conditions for the irreducible 
diagrams. With pion-nucleon scattering, for instance, 
the appropriate double dispersion relation is given by 
Eq. (2.17) of the previous paper. The reducible diagrams 
will contribute to the functions a), a2, a3, b;, and bd», 
and the unitarity conditions for the three reactions must 
be used to evaluate them. The irreducible diagrams, 
however, will only contribute to a3. It is therefore 
sufficient to use the unitarity equation for A; to 
determine 413 and Ag;, since a; is thereby determined 
as well. If the unitarity equations for A; or A» had 
been used, a3 would have remained undetermined. 


Avoidance of Unproved Hypotheses 


It is now necessary to re-examine our calculation of 
the fourth-order terms in order to show how it may be 
carried through by using only analyticity properties of 
the scattering amplitude which have been shown 
rigorously to be true. We shall begin with the case 
where all the masses are equal, and shall again neglect 
subtraction terms for simplicity. 

The calculation of A; and A,“™, and the demon- 
stration that they could be brought into the form 
(2.7a) and (2.7b) with known spectral functions, made 
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use of the unitarity condition only. They are therefore 
justified, provided we restrict ourselves to the physical 
region in the unprimed variables. These expressions 
for A; and A.“ can now be substituted into the 
ordinary dispersion relation (2.6a), which has been 
proved rigorously if ¢ is space-like and sufficiently 
small. (less than 2M?); on doing so, we arrive at 
the representation (2.5) (without subtraction terms). 
In order to justify this substitution, we have to show 
that A, and A,“ are given by (2.7a) and (2.7b) for 
all space-like values of ¢ less than 2M?, which include 
points in the unphysical region. However, it has been 
proved rigorously* that these absorptive parts are 
analytic functions of ¢ for fixed s or « if —2M*<t<0, 
so that, if they are given by (2.7a) and (2.7b) in the 
physical part of this region, they are given by the same 
equations in the entire region. 

Our calculation of the fourth-order scattering 
amplitude from analyticity and unitarity, and _ its 
expression in the form (2.5), are thus fully justified if 
—2M?<i<0. We now make use of another rigorously 
proved result, that the scattering amplitude is an 
analytic function of the momentum transfer of one of 
the reactions, with the energy kept fixed, in the physical 
region for that reaction.‘ By doing so we can analytically 
continue our expressions into the entire physical 
regions for the reactions I and II. Finally, by repeating 
the proof with the dispersion relation (2.6a) replaced 
by (2.6b) or (2.6c), we can include the physical region 
for the reaction III as well. 

The above proof is not applicable without modifica- 
tion for arbitrary masses, as the dispersion relations 
(2.6) have not been rigorously established unless they 
satisfy certain inequalities. In order to extend the 
result to the general case it is necessary to invoke 
more analytic properties of the Green’s function. The 
variables M,, Ms, M3, and M, in Fig. 2, which corre- 
spond to the squares of the external momenta, are not 
now restricted to have their physical values. For certain 
real (i.e., noncomplex) ranges of values of these masses 
the dispersion relations (2.6) are rigorously true, so 
that, within these ranges, the proof can be carried 
through as before. 

The procedure is then to continue analytically in 
the masses with the aid of the theorem that the Green’s 
function is the boundary value of an analytic function.*® 
More precisely, there exists a region, in the six-dimen- 
sional complex space of the variables M,, M2, M3, M,, 
s and ¢, for which the Green’s function is an analytic 
function of all six variables, and this region approaches 
arbitrarily close to any point which can be constructed 
from real momenta 1, p2, ps, and py. It follows that, 
if A® is given by a certain expression for a range of 
values of the masses, and if the expression is an analytic 

9 Bogoliubov, Medvedev, and Polivanov, ‘Problems in the 
Theory of Dispersion Relations” (unpublished). 

10 Bremermann, Oehme, and Taylor, Phys. Rev. 109, 2178 
(1958). 
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function of the masses over a wider range, A“ will 
be given by this expression over the wider range as 
well. Now, as long as we keep out of a region for which 
the mass inequalities which give the anomalous thresh- 
olds occur, the calculated expression for A“ will be an 
analytic function of its variables unless s and ¢ have 
their threshold values or unless we are on the curve c 
of Fig. 1. The expression can accordingly be continued 
analytically in the masses until they have their physical 
values, as these singularities are easily avoided. 

We have thus proved that the fourth-order perturba- 
tion terms can be reproduced from the rigorously 
proved analytic properties and the unitarity equations 
alone. As these terms are found to have the double 
dispersion representation, this provides an alternative 
proof of the representation in fourth-order perturbation 
theory. 


3. CALCULATION OF HIGHER TERMS FROM 
ANALYTICITY AND UNITARITY 


In the previous section the fourth-order perturbation 
terms were obtained from the second-order terms using 
unitarity and analyticity properties. In higher orders 
there would be further terms on the right of (2.8), 
corresponding to production processes, and we would 
require to know the analytic properties of the transition 
amplitudes for these processes in order to perform the 
calculation. In this paper we shall neglect production 
processes, and shall suppose that A, is given in all 
orders by (2.8); the calculation of any term in the 
transition amplitude from lower-order terms then 
becomes a straightforward generalization of the fourth- 
order calculation. We thereby construct a subset of 
the perturbation series. As before, we do not have to 
introduce any unproved analytic properties into the 
calculation, provided that the lower-order terms satisfy 
dispersion relations, and the higher-order terms are 
then found to satisfy the double dispersion representa- 
tion. We accordingly have a proof by induction that 
all the perturbation terms of the subset have this 
structure. The approximation (2.8), (which in the case 
of pion-nucleon scattering is the one-meson approxima- 
tion of Low" and Chew and Low") is being used in 
attempts to calculate scattering amplitudes from 
analyticity and unitarity without perturbation theory. 
The present section therefore proves that, within the 
framework of this approximation, the individual 
perturbation terms satisfy the representation (2.5). 

We shall also show that the prescriptions given in 
the last section for determining which of the spectral 
functions Aj3, Ao3, and Aj». receive contributions from 
a particular term, and for finding the asymptotes of 
the regions in which they are nonzero, continue to 
hold in higher orders. 

The calculation is similar to that given in reference 2; 

uF, E. Low, Phys. Rev. 97, 1392 (1955). 

2G. F. Chew and F. E. Low, Phys. Rev. 101, 1570 (1956). 
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since, however, the right-hand side of (2.9) is now 
known from lower-order perturbation calculations, the 
result can be obtained explicitly in the form of an 
integral involving the lower-order absorptive parts. 
We begin by expressing the transition amplitudes on 
the right of (2.9) as dispersion relations (2.6c), with 
the energy s constant, so that the equation for an nth 
order term adopts the form 


A \-{5,t(2) 1} 


q 1 Ag *(5,t),tie) 
fen -{ du;,.———— 
329 ie Wie —Uie(Z.e) 
f As(O*(S,tie',t1) ] 
dt; .. ——————__ } 
lie’ —tie(Zie) 


1 A," (5,t1,thio ) 
x feu man 


ler Uio — Wio( Zio) 


1 A;'" ")(S,tio U1) 
aa fase , SA 
rs tio’ —lio(Zio) 


7) 


As in the lowest-order calculations, the two brackets 
will also contain terms depending on s alone. Again, 
the contributions to 4," from these terms are easily 
to work out and affect only the function /;(s) in disper- 
sion integrals. They correspond as before to the 
reducible graphs of the usual formalism. 

If the orders of integration over n; and the primed 
variables are changed in (3.1), the expression becomes 
the sum of four integrals such as 


ff etatdtat ses, As ")(S tio’ st) 
4-55 


327 u 
1 
x f vn a 4822) 
[ tie’ —tie(Sie) tio’ —tio(Zio) | 


The integral over n; has exactly the same form as the 
corresponding integral in the fourth-order calculation. 
We may, therefore, repeat the procedure leading to 
Eq. (2.11) with the modification that the M*’s in the 
denominators have to be replaced by ¢;.’ and ¢,,’.The 
function x is now replaced by 


qg 


x (S,tbie’ tio’) =4s[ (s—4M?) (P+ tie+ lie? 


— tte’ —2tt io’ — biel ic!) —Allic' lio], (3.3) 


so that the integral takes the form 


g 1 
, f Dirsernentepeninin 
329w [tie’ —lie( Sie) tio’ —lio( Zio) ] 
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where 


I'(5,t,liestio) = 1/Ly (S,l,licstio) |}, 
y>0, (> (lieitliot)?, s> 442, 


otherwise. 


(3.5) 
=() 


Equation (3.3) can then be substituted into (3.2), and 
the orders of integration over (’ and the other two 
variables interchanged. On doing so, we find that the 
integral has the form of the first term of the dispersion 
relation (2.7a), with A1;(s,/) equal to the expression 


1 


-—f dl iol (Syl ticslio) A 3 *(S,tie,11) 
81° 


XA3""— (Sitio,tt1). (3.6) 


The primes on the dummy variables /;, and ¢;, have 
been suppressed. 

The other three terms of (3.1) can be found from the 
symmetry of the problem under two of the interchanges 
1 U, tip Uje, Lio Uio, aS In the fourth-order calcula- 
tion. The final result for the evaluation of (3.1) is 
therefore 
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A436" (5,1) = —— -f dt lt io (S,t,tiestio) 
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1 
- f dl dU jol (Sh biesMio) 
8r° 


XK A3* (Sli) As" (S,h1,i0). (3.7b) 

The calculation just performed can also be carried 
out in the general mass case, and the result will again 
be obtained by integrating the fourth-order result 
over the absorptive parts in (3.1). As in Sec. 2, it is now 
more convenient to work in terms of the variable 2, the 
cosine of the angle of scattering, than in terms of the 
momentum transfers ¢ and wu. Since ¢ and « depend 
linearly on 2, the dispersion integrals in (3.1) can be 
replaced by dispersion integrals in z’.!’ The procedure 
is then exactly the same as in the equal-mass case. 
The right-hand side of the equation corresponding to 


13When a change of variables is made from a momentum 
transfer ¢ or u to a cosine z of an angle, we always take the angle 
between the particles corresponding to the momentum transfer 
t or u. With this convention, the variables z’ in the dispersion 
integrals, like the variables ¢’ and u’, are always positive. 
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(3.4) will be a dispersion integral in s, and the weight 
function is now exactly the same as in the fourth-order 
term, Eq. (2.14) (except that the kinematical factor 
is now g;/8w). The equations corresponding to (3.7) 


will therefore be 


A 13°" {s,t(s)} 


1 4 ; 
=—-— d2; AZ ioK (2,2;¢,2:0) 
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(3.8b) 
where 


K (2,2i0,2i0) = 1/LR(2,8i¢,2i0) |}, 
2>21,5>(Mst+Me)? (3.9) 


=() otherwise. 


The general mass case could also be handled in terms 
of the momentum transfers. The kernel T in (3.7) is 
now replaced by an expression of the form (3.28) of 
the previous paper, with the momentum transfers /,, 
and /;, replacing the masses M;’ and M,’. For the 
integrals over u;, or u;,, the variables x must be defined 
with Fig. 3 of the previous paper replaced by the 
appropriate graph, drawn according to our usual 
prescription by taking the relevant second-order terms 
in (5.1) as the perturbation terms. 

Owing to the close correspondence between the 
fourth- and higher-order calculations, it is easy to 
generalize the formulas quoted in Sec. 2 for the region 
in which the spectral functions are nonzero. We can 
draw a graph such as Fig. 4 for one of the higher-order 
terms. The shaded portion, instead of representing 
terms 1/(/;-—M;) and 1/(¢,,—M3*), will now represent 
dispersion integrals in / beginning at some lowest mass 
M;, and M¢,, which will be the sum of the masses of 
the particles being exchanged. We have just seen that 
the transition amplitude corresponding to this diagram 





Fic. 4. Diagram for a 
higher-order term. 
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is obtained by integrating the fourth-order transition 
amplitude over the masses corresponding to the 
momentum transferred across the shaded portion. Since 
the region in which the spectral function is nonzero 
decreases as the intermediate masses increase, it will 
be obtained by putting M;=M7,, Ms=Ms3 x in the 
fourth-order formulas (2.16). We observe that it 
approaches the asymptote /= (M7,+Ms5,)* as s becomes 
infinite, which is to be expected, as M7,+Ms5, is the 
mass of the lowest intermediate state of the reaction 
ITI represented by Fig. 4. 

As in the fourth-order case, we should expect to be 
able to calculate the contribution to Ai; by using the 
unitarity equation for A;. From Fig. 4, however, we 
notice that the intermediate state of the reaction III 
is now not a two-particle state, so that we cannot check 
this explicitly without going beyond the approximation 
(2.8). The fact that the contribution to A; begins at 
the correct threshold indicates the consistency of the 
unitarity requirements. 

Since the terms in A;; which we obtain by applying 
the approximation (2.8) to A; and A; are not the same, 
we cannot consistently apply (2.8) to all three reactions. 
We might apply the approximation to the reactions I 
and II alone, as we have seen that the unitarity equation 
for any one of the reactions is redundant except for 
determining subtraction terms, in which case it could 
be applied to the S-waves of the reaction III. The 
spectral functions A,; and A»; are then uniquely 
obtained, but we shall still have two different approxi- 
mations for Aj. The difficulty, of course, is due to the 
fact that a graph which represents the reaction I 
going through a intermediate state 
represents the reaction IT going through a multiparticle 
intermediate state and vice-versa. We would therefore 
have to include in the series for A 2 all terms obtained 
from the unitarity conditions for both the reactions I 
and IT. By doing so we include some of the contributions 
of the multiparticle intermediate states to the unitarity 
equations for A; and A». If desired, the unitarity 
condition for the reaction III could also be used, and 


two-particle 


Fic. 5. Diagrams with 
the lowest intermediate 
states for various pro- 

“\ cesses. 
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the resulting contributions iv A,; and A2; added to 
those already obtained. 

The demonstration given in the last section, that the 
perturbation terms can be constructed by using only 
proved analyticity properties of the transition ampli- 
tude, can be extended to general order. We need not 
repeat the reasoning, but should refer to the difficulty 
just mentioned, which did not occur in the fourth-order 
case. The procedure, as before, is to calculate A; and 
A» by unitarity and to insert them into the dispersion 
relation (2.6a). The expression then obtained is 


A 43" (s',t’) 
ds'dt/—_—__—_ 
(s’—s)(t'—2) 


{imi == 


1 Ajo") (5’ 0’) 
~ f i 
™ (s’—s){u’—u(s’,t)} 


A 23° ni) (t’ 0’) 


1 A425" ii (5’ 14’) 
+ f ds'du’——__—_.. 
Tr {s’—s(t,u’)} (u’ —u) 


The form u(s’,t) indicates that « is to be expressed as 
a function of s’ and ¢ by (2.2), and similarly for s(¢,w’). 
Ayo») and Ajo) are the contributions to A” 
obtained from the unitarity condition for the reactions 
IT and II. 

The equation (3.10), which has thus far only been 
obtained rigorously if ¢ is sufficiently small, has now to 
be extended into the physical regions for the first 
two reactions. The second and fourth terms, however, 
cannot be continued beyond the (algebraically) 
smallest value of ¢ for which A12(s,#) is nonzero. In the 
fourth-order case A ,2:?) was equal to A122, and the 
sum of the two terms could be continued into the 
entire physical region. Now we have to add to A, 
and A» the terms 


1 A jo" id (5,10’) 1 A 4o(" i) (5" 10) 
=f aw —— and — | ds’/——————_. 
u'—U ra s’—5 


T 


(3.10) 


If these values of Ai” and A,” are inserted into 
(2.6a) and the result added to (3.10), the two energy 
denominators u’—1u(s’,t) and s’—s(t,u’) become simply 
u’—u and s’—s, so that the complete expression has 
the required properties. It should be emphasized that 
these additions to A; and A», which must come from 
higher intermediate states, were imposed by the 
requirement that the scattering amplitude have its 
rigorously proved analytic properties, and not by 
postulating the double dispersion representation. 

In the sixth and higher orders, one Feynman graph 
will correspond to the sum of several terms calculated 
by out formalism. If, for instance, Fig. 4 consists of 
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n/2 lines being exchanged between the particles, there 
will be (n/2)—1 terms depending on how many lines 
are assigned to each half. There will in addition be 
terms corresponding to such a diagram which are not 
included in our unitarity approximation. 

It has been verified in sixth order that, if the right- 
hand side of (2.8) is supplemented by a term for 
three-particle intermediate states (which is the most 
we can get in this order), the result still satisfies the 
double dispersion representation. Thus any perturbation 
terms which violate it must occur only in eighth or 
higher orders, and not at all if the unitarity approxima- 
tion (2.8) is used. 


4. REGIONS IN WHICH THE SPECTRAL FUNCTIONS 
ARE NONZERO IN PRACTICAL CASES 


In this section we shall apply to some simple cases 
the formulas given in the last two sections for the 
region in which the spectral functions are non-zero. 
We have seen that, for graphs of the form of Fig. 4, the 
bounding curves are obtained by inserting the lowest 
masses of the multiparticle intermediate states into 
(2.16). The bounding curves for the whole perturbation 
series will be obtained by choosing those graphs with 
the lowest possible intermediate states. It may be that 
terms not included in our unitarity approximation give 
spectral functions which extend out of the regions so cal- 
culated, but, as the intermediate states in such terms 
are necessarily not the lowest, this is unlikely. The as- 
ymptotes certainly lie further away from the physical 
region than the asymptotes of the curves we calculate. 


Pion-Pion Scattering 


The pion-pion amplitude is symmetric under inter- 
change of any pair of the variables s, ¢, and « (for 
neutral pions). All three spectral functions are therefore 
equal to one another, and each is a symmetric function 
of its two arguments. Let us consider for definiteness 
A,;(s,t). The graphs with the lowest intermediate 
states are Figs. 5(a) and (b). In Fig. 5(a), Mzz,=Msr 
=2yu, and the remaining masses are equal to uw. The 
cosines 2;, and z,, of the angles between the intermediate 
and the initial or final states corresponding to a momen- 
tum transfer 2u are 


o 
“i 


e = io = 1+ (2u)?/ (29°), 


while the cosine z of the angle between the initial and 
the final state corresponding to a momentum transfer 
tis 

g=1+1/(2g°). 


In this case, g is simply expressed in terms of s by the 
relation 


On putting these formulas into (2.16), we find that the 
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bounding curve is given by 
st—4y? (45+ 2) =0 


(Asymptotes: s=4yu? and ¢= 16’). (4.1a) 


The bounding curve corresponding to Fig. 5(b) is 
obtained by interchanging s and ¢, and has the equation 


sl—4y?(s+41 =0 


(Asymptotes: s= 16u? and /= 4’). (4.1b) 


The area in which Aj; is nonzero thus has the form of 
the shaded area in Fig. 6. 


Pion-Nucleon Scattering 


We take p; and p; to represent the external pion 
lines, po and 4 the external nucleon lines. Thus the 
reactions I and II are pion-nucleon scattering, while 
the reaction III is the pair-annihilation reaction. 

The amplitude is a symmetric function of s and u 
(for neutral pions and scalar nucleons), so that 41; 
and A»; are equal, and Aj. is a symmetric function of 
s and u. The graphs giving contributions to Aj; are 
those for which both reactions I and III can take place 
through real intermediate states. Figure 5(c) and (d) 
give the diagrams with the lowest intermediate states. 
The relations between the 2’s, g and s and ¢ will be a 
little more complicated than in the last case, and are 
obtained by putting the appropriate mass values in 
(2.3a) and (2.4a). On substituting in (2.16), we find, 
for the bounding curves from Figs. 5(c) and (d), 
respectively, 


(¢— 164?) [s— (M+u)* JLs— (M—y)* ]—64u's=0 


(Asymptotes: s=(M-+uyu)? and /=16y?).  (4.2a) 


(¢—4y?)[s— (M+ 2u)*JLs— (M—2u)?] 
— 16u'(s+3M?—3y?) =0 


(Asymptotes: s= (M+ 2y)? and t=4y").  (4.2b) 


The graphs contributing to Aj. will represent the 
reactions I and II taking place through real intermediate 
states. Those with the lowest intermediate states are 
Figs. 5(e) and (f), and, on calculating as before, the 





Fic. 6. Region in 
which A}; is nonzero 
for pion-pion scat- 
tering. 
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bounding curves are found to be 
[u— (M+ 2)? [u— (M — 2p)? [s—(M+yn)*] 
X[s— (M—p)*? ]—16suM2y?=0 


(M+y)? and u=(M+2y)?].  (4.3a) 


[ Asymptotes: s 


[u—(M+y)? ][u—(M—yp)* [s— (M+ 2y)?] 
X[s— (M—2y)? ]—16suM??=0 
[ Asymptotes: s= (M+ 2y)? and u= (M+y)? ]. 


(4.3b) 


The regions in which the spectral functions are nonzero 
are again similar in form to the shaded area of Fig. 6. 


Nucleon-Nucleon Scattering 


The reaction I is nucleon-nucleon scattering, whereas 
the reactions II and III are both nucleon-antinucleon 
scattering. The scattering amplitude is symmetric in / 
and u. Aj» is thus equal to Aj3, and A»; is a symmetric 
function of its arguments. 

The graphs contributing to Aj; will again represent 
the reactions I and III going through real intermediate 
states. In this case, the lowest intermediate states 
occur in a single graph, Fig. 5(g). The bounding curve 
is given by the equation 


((—4y*) (s—4M*) —4y4 Q 


(Asymptotes: s=4M? and /=4y’). (4.4) 


The graphs contributing to A»; will represent the 
reactions IT and III going through real intermediate 
states. The lowest intermediate states occur in Figs. 5(h) 
and (i), and the bounding curves are 


((—4y?) (u—4M?) —4y'=0 
4u? and u=4M?"). 


(Asymptotes: / (4.5a) 


((—4M?*) (u—4y*) —4yu4=0 


(Asymptotes: /=4M? and u=4y’). (4.5b) 


5. CONCLUDING REMARKS 


The development in the foregoing section shows that 
an appreciable class of perturbation terms satisfies 
the double dispersion representation. It may be that 
some of the perturbation terms not considered in this 
paper have additional singularities and do not satisfy 
the representation. Even if this were the case, it would 
not affect the calculations that have been proposed on 
the basis of the representation, as such calculations 
make use of the unitarity approximation (2.8), and 
the terms of the perturbation series generated by this 
approximation have been shown to satisfy the represen- 
tation. If there are other singularities in the perturba- 
tion terms not treated, they should be able to be 
classified and incorporated in any calculations which 
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may be proposed to improve on the approximation 
(2.8). In fact, the nature of the singularities would 
probably become evident in the course of the calculation. 

A more serious limitation on the validity of the 
double dispersion representation is that it has thus 
far not been proved except in perturbation theory. 
As we have remarked, it is unlikely that it can be 
proved from causality along without introducing 
unitarity. In fact, for the general mass case, counter- 
examples have been found which satisfy all the causality 
requirements and yet do not satisfy even forward 
dispersion relations. On the other hand, the unitarity 
condition gives equations connecting an_ infinite 
number of Green’s functions, and it would appear to be 
very difficult to obtain rigorous results with its aid. 
The natural thing to do would be to approximate the 
unitarity condition by an equation such as (2.8), and 
to use the same approximation as is used in the applica- 
tion of the theory to calculations. In other words, we 
probably do not need all the analyticity properties of 
(2.5) in order to perform the calculations from unitarity, 
but it may be sufficient to use only the rigorously 
established properties—as is the case in perturbation 
theory. The difficulty with this approach is that the 
unitarity conditions for the three reactions overdeter- 
mine the scattering amplitude, and contradict one 
another if approximations are made. However, it may 
be possible to formulate our approximations in a way 
which overcomes this—for example, by taking into 
account only singularities which begin sufficiently 
near the physical region. 

It is therefore probably not a formidable problem 
to calculate the scattering amplitude using only 
unitarity properties, in a certain approximation, as well 
as proved analyticity properties, and to show that the 
result satisfies (2.5). Whether this representation holds 
independently of any approximations is a question 
which would be very hard to answer, and we would not 
like to make speculations about this point here. Though 
of interest, this is not really relevant in connection with 
the application of the representation as it does not 
seem possible to apply it except in conjunction with a 
unitarity approximation. Even if one attempted to 
find what happened in the limit of including more and 
more processes in the unitarity condition, the question 
of interest would be whether the approximations 
converged, and not whether the limit had any further 
singularities in the complex plane. 
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The possibility of measuring the electromagnetic form factor of the pion by extrapolation of the cross 
section for e~+p — n+a*+e~ has been investigated. The method is based on the existence of a pole in the 
pion-electroproduction scattering amplitude as a function of the invariant momentum-transfer of the 
nucleon. The residue of this pole is the pion form factor multiplied by a known coefficient. Since the pole 
lies slightly outside the physical region of the invariant momentum transfer, an extrapolation of the experi 
mental data is required. An approximate calculation of the pion electroproduction cross section has been 
made in order to estimate the experimental accuracy necessary for a significant extrapolation. Accuracy is 
required which is an order of magnitude better than that achieved at present in similar experiments. 


I. INTRODUCTION 


N recent years much attention has been devoted to 

the problem of the electromagnetic structure of the 
nucleon. Considerable experimental information on this 
subject has been provided by the experiments carried 
out by Hofstadter and his collaborators on the scatter- 
ing of electrons by protons and deuterons.' Additional 
information was derived from the measurement of the 
cross section for electroproduction of pions (e~+) — e~ 
+n-+nt) by Panofsky and Allton.? 

Among the many theoretical attempts which have 
been made to treat the nucleon-structure problem, the 
most successful have been based on the method of 
spectral representations.*~> In this method, however, 
one encounters the difficulty of requiring knowledge of 
the electromagnetic structure of the pion. It is qualita- 
tively evident that the structure of the pion must 
contribute to the structure of the nucleon. If the spatial 
extension of the nucleon’s charge and magnetic moment 
are visualized as due to the pion cloud of the nucleon, 
then the spatial extension of the pion will contribute to 
that of the nucleon. It has recently been shown® that 
if the pion structure is assumed to be sufficiently ex- 
tended, the disagreement® between the spectral-repre- 
sentation theory and the nucleon structure may be 
removed. 

Unfortunately, no experiments have been performed 
which probe the pion structure. The scattering of pions 
on electrons yields no information at available energies, 
because the relatively massive incident pion cannot 

* This work was performed under the auspices of the U. S. 
Atomic Energy Commission. 

t Present address: Institute for Advanced Study, Princeton, 
New Jersey. 
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transfer sufficient momentum to make anything but 
the outermost parts of the pion electromagnetic field 
effective. On the other hand, scattering of electrons on 
pions is not feasible because, of course, no way has 
been found for making targets from pions. The purpose 
of this paper is to describe a method of using the pion 
cloud of the proton as a “pion target.” 

The procedure to be described is an application of a 
general method suggested recently by Chew and Low.’ 
In this case, their method enables one to measure the 
electromagnetic form factor of the pion by extrapolation 
of the cross section for e~+p— n+n++e-. The basic 
principle involved is the possibility of analytic con- 
tinuation of the electroproduction scattering amplitude 
as a function of the square of the four-momentum trans- 
fer of the nucleon, A’. If the initial nucleon has four- 
momentum #, and the final nucleon, p’, then we have 
A?’=—(p—p’)*?. It is conjectured that at A?=—y’, 
where yw is the mass of the pion, the scattering 
amplitude has an isolated pole whose residue is 
just the electromagnetic form factor of the pion 
multiplied by a known coefficient. It can easily be 
shown that negative values of A? are not physically 
attainable, so that an extrapolation of the measured 
cross section is necessary in order to reach the pole at 

*=—y". The distance of extrapolation is, however, 
small compared to the physical range of A®. The pro- 
cedure for determining the electromagnetic form factor 
of the pion is, then, to extrapolate the cross section, 
with its singularities suitably removed, to the point 
A’=—y’. This procedure will be described in detail in 
Sec. IT. 

Experimentally, it will be necessary to measure the 
electroproduction cross section as a function of at least 
two variables, A? and X. If s is the incident electron 
four-momentum, and s’ is the final, then we have 
\=—(s—s’)*. One must know A because the pion form 
factor is a function of this variable. In practice, one 
could determine these variables by measuring the 
energy and direction of the final electron, plus the 
direction of the final meson. No such measurements 
made. In their recent experiment, 


have yet been 


7G. F. Chew and F. E. Low, Phys. Rev. 113, 1640 (1959). 


1763 





1764 WILLIAM 


Ate Ea +2M yt ye At 2ue kot 3u? | aertes 
== 7 =“ 
Zi qiiki f Z2iqitki — 


==> if ne an ee 
—) (—=—-$-) 


2iq k 2igiik | 


Branch 
point 








Fic. 1. Poles and branch points in the cos@ plane. The scale of 
the figure is —— to the values E=9.66, \=10. The right 
hand pole is the one at A?= —,?. 


Panofsky and Allton*® measured the electron variables 
but did not detect the meson directly. 

The analytic properties of the electroproduction 
scattering amplitude, which are the basis of the method 
outlined above, have not been proved rigorously. They 
are, however, a very reasonable extension of properties 
which have been proved for other scattering problems. 
A plausibility argument will be given in Sec. II A. Such 
analytic properties of scattering amplitudes have been 
the object of much study. Their application in the form 
of spectral representations has been successful in corre- 
lating many experimental data in pion physics. Appli- 
cations very similar to that proposed herein have 
already been made for the purpose of measuring the 
pion-nucleon coupling constant. Extrapolations of both 
the nucleon-nucleon*’ and pion-photoproduction”™ cross 
sections as functions of invariant momentum transfer 
yielded values of the coupling constant in reasonable 
agreement with values obtained by other means. 

In principle, then, the analytic properties of the 
electroproduction scattering amplitude described above 
tell us that one can determine the pion form factor by 
an extrapolation procedure. In order to assess the 
practical difficulty involved in performing an extrapola- 
tion of a given set of experimental data, one must 
estimate the behavior of the electroproduction cross 
section as a function of A*. If the electromagnetic 
interaction is treated in lowest order of perturbation 
theory, it is apparent that the electroproduction and 
photoproduction matrix elements are closely related. 
The dispersion-theoretical analysis of photoproduction 
by Chew, Goldberger, Low, and Nambu" (hereafter 
called CGLN) has been extended to electroproduction 
by Fubini, Nambu, and Wataghin™ (hereafter FNW). 
The calculation in Sec. III is performed by the use of a 
modification of their theory: The Born terms in the 
matrix element are written in relativistic form and, 
correspondingly, certain recoil-correction terms are 
dropped. 

In Sec. IV the results of this calculation are inter- 
preted as implying that for a significant extrapolation 
experiments of great accuracy will be necessary, accu- 


§ Geoffrey F. Chew, Phys. Rev. 112, 1380 (1959). 

* P. Cziffra and M. J. Moravesik, Phys. Rev. (to be published). 

Taylor, Moravesik, and Uretsky, Phys. Rev. 113, 689 (1959), 
quoted in the following as TMU. 

1 Chew, Goldberger, Low, and Nambu, Phys. Rev. 106, 1337 
and 1345 (1957); quoted in the following as CGLN. See also 


Geofirey F. Chew, Encyclopedia of Physics (Springer-Verlag, 
Berlin, to be published), second edition, Vol. 43. 

 Fubini, Nambu, and Wataghin, Phys. Rev. 111, 329 (1958), 
quoted in the following as FNW 
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racy an order of magnitude better than that achieved in 
the electroproduction experiment of Panofsky and 
Allton.? 

II. EXTRAPOLATION PROCEDURE 


A. Location of Singularities of 
Scattering Amplitude 


Let us consider in detail the analytic properties of 
the pion-electroproduction scattering amplitude on 
which the proposed extrapolation depends. As remarked 
in the introduction, no rigorous proof of these properties 
has been given. However, such properties have often 
been conjectured; for instance, in the two-dimensional 
spectral representation proposed by Mandelstam™ and 
verified to fourth order in perturbation theory. Motiva- 
tion for conjecturing them comes from two sources: 
analogy with proved properties of simpler scattering 
amplitudes and analysis of perturbation theory. Argu- 
ments of both types have been given by Chew® for 
nucleon-nucleon scattering, and by Taylor, Moravcsik, 
and Uretsky” (hereafter TMU) for photoproduction. 
The latter case can be extended very easily to electro- 
production. To lowest order in the fine-structure con- 
stant, electroproduction is just photoproduction by a 
virtual photon. The only differences are that the 
“photon” has a nonzero, imaginary mass (k?=—)) in 
electroproduction, and that the matrix element contains 
longitudinal as well as transverse terms (i.e., k- 20). 
The existence of transverse terms will not affect the 
analytic properties, and the “photon” mass will only 
cause a shift in the position of the singularities. 

The recipe used in the papers referred to above is the 
following: to get one part of the spectrum consider 
the intermediate states which could be reached if p 
and p’ were incoming particles, with g and k outgoing 
(k=s—s’). The lowest of these is the discrete single 
pion state, which gives a pole at A?= —y? (on which the 
proposed extrapolation is based). The next state is that 
of two pions, which gives rise to a branch point at 
A?= — 4p. 

The other half of the spectrum, the crossed spectrum, 
is found in a similar way by considering the states 
which can be reached if p and g are incident. This leads 
to a pole at (p—gq)*=M?*, where M is the nucleon mass, 
and a branch point at (p—q)?=(M+4u)*. Now since 
p+k=p'+q, we find that 


(p—9)?=A?—E?+2M?+p?—2. 


Here E*=(p’+q)’; ie., E is the total energy of the 
final nucleon and pion in their barycentric system (the 
system in which p’+q=0). Using (2.1), one finds that 
the crossed spectrum gives rise to a pole at 


A?= F?— M?—y?+), 
and a branch point at 
‘ A*= E?— M?+2My-+i. 
18S, Mandelstam, Phys. Rev. 112, 1344 (1959). 


(2.1) 
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Fic. 2. The class of diagrams giv- 
ing rise to the pole at A?= —,?. 


The spectrum of singularities in the A? plane can be 
re-expressed in terms of cos#, where @ is the angle 
between q and k in the p’+q=0 system. With all 
symbols referring to this system, we have 


A?=)—p?+ 2woko— 2! q] |k! cos, (2.2) 
where 
w= (1 q|?+u?)}. 


Then in the cos@ plane the analyticity region is the cut 
plane, with poles and branch points as shown in Fig. 1. 
The quantity E, is the energy of the final nucleon. 
Actually, the existence of such a large region of ana- 
lyticity is not necessary for the proposed extrapolation. 
The method requires only that we have analyticity in 
some region containing the physical region |cos6@| <1 
and including the pole at cos@= (A+ 2wk»)/2|q!|k! as 
an isolated singularity. 


B. The Extrapolation Formula 


If one accepts the viewpoint that perturbation theory 
can yield information on analytic behavior, then the 
existence of the pole at A?= —y* can be demonstrated 
and its residue computed. It is evident that the class 
of diagrams shown in Fig. 2 gives rise to the pole. It is 
easy to show that no other type of diagram can contain 
this pole. The contribution of this diagram™ to the 
cross section gives, in covariant form, 
da 4g’a®A?F,?(A) 

—=-— —_—§[4(q-s)(q-s’)—p?r ] 
dP )*n?I(A?+ yp")? 
. +other terms, (2.3) 
where [=[(p-s)?—_M?m? ]!= Ms _, and where dP is the 
phase space factor: 
d'p'd*s'd5q 
dP=- —$!(p-+s—p’—s’—q). 


8wollo€s 


(2.4) 


The symbols ¢, €2 are defined as the initial and final 
electron energies. One finds that 


m|q|*dAdiE*dQ, 


ir — 
16M ;we| (d/d| q|)q-(p+s—s’) | 


where dQ, refers to the outgoing meson. Since dP as 
defined by Eq. (2.4) is a Lorentz invariant, this ex- 
pression must be valid in any coordinate system. In the 


4 The coupling constants used are g?=g,2/4r, f=gu/2M, and 
f?=0.08. 


FORM 


FACTOR OF PION 


lab system, we have 


r|q|2dddE*dQ, 


* 16Ms1£|q| (M+ho)—w2| k| cosd) 


» 


In Eq. (2.3) two quantities in the numerator, g and 
F,(A), are not the most general expressions corre- 
sponding to the diagram of Fig. 2. They have been 
given the value appropriate to A?= —,? in anticipation 
of the extrapolation to that point. The pion form factor, 
F,(A), is defined by considering the pion-photon vertex 
with both pions on the mass shell. If we write the con- 
tribution of this vertex as j,(q1,g2)e", the most general 
form of 7 consistent with Lorentz invariance is 


I (q1,q2) = —eF,(A)q _ eG (A) qo, 


where A= — (¢2—q:)”. The continuity equation imposes 
the further requirement (q.—q:):j/=90, giving F,(A) 
=G,(A), or 

J (1,92) = — (qitge)eFs(A). 


This definition of the pion form factor is normalized so 
that F,(0)=1. 

Equation (2.3) reveals an additional singularity 
which must be removed before an extrapolation can be 
performed. The factor in‘ brackets, which results from 
taking the trace of matrix factors, has the form, as 
a function of cos@: P(cosé)+sin@P2(cosé), where P; 
and P, are polynomials. Now since sind= (1—cos*@)}, 
sin@ has a branch point at cos@= 1, preventing extrapola- 
tion. We cannot get rid of this singularity by division 
by the factor in brackets, since it can vanish in the 
region of extrapolation and does vanish in the case to 
be discussed in Sec. III. We can, however, eliminate 
this undesirable sin@ by defining a “symmetrized” cross 
section. Define o(@)=do/dP. The quantity o(@) does, 
of course, depend upon variables other than @. Then 
define the symmetrized quantity 

(a(cos0)),=0(0@)+a(—8). 
In Eq. (3.19) an equivalent definition of this sym- 
metrization will be given. The quantity (o(cos@)),, will, 
of course, be free from the branch point at cos@=1, and 
we can at last write the extrapolation formula: 


(2.7) 


4p%ay?F ,2(d) 


(A*+u')*(0),| a—9= — 
w’Ms 


 (4(q-5)(q°s’)—p?A)s| at? (2.8) 

The above discussion has shown that if one knew the 
value of the differential cross section do/dP over some 
portion of the physical region, one could obtain the 
value of F(A) by analytic continuation of the function 
(A*?+?)(a(cos@)),. The relation between the value of 
this function at A?=— ?, and the pion form factor is 
given by Eq. (2.8). This is an idealization, of course, 
since in practice one can know the function only to 
within a certain error and at a finite number of points. 
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Qne practical procedure that can be employed is to 
plot the experimental value of (A*+ y")*(o), and fit a 
polynomial in cos@ to these points by the method of 
least squares.*:'® The residue is then given by the value 
of this polynomial at the pole. Some consideration will 
be given in the next section to the error involved in 
this method of extrapolation. 


C. Kinematical Considerations 


In order to perform the proposed extrapolation, it is 
necessary to know the cross section as a function of 
both the extrapolation variable A? and the variable \ on 
which the form factor F,(A) depends. Since an \-par- 
ticle (incoming plus outgoing) scattering problem is a 
function of 3.V—10 variables (neglecting spins), electro- 
production is a function of five variables. A convenient 
choice for the other three variables is £, the total energy 
of the pion and nucleon in the p’+q=0 system; 7, the 
laboratory kinetic energy of the incident electron; 
and @, defined in the p’+q=0 system by 


cosd = (kX s)-(kXq)/|kxXs  kxq). 


(2.9) 


In principle these three variables could be integrated 
out and the cross section measured as a function of A? 
and A only. In practice, it may be most convenient to 
determine all five variables; for instance, by knowing 
the incident electron energy and measuring the distri- 
bution of mesons as a function of direction, in coinci- 
dence with final electrons of given direction and energy. 
Then in performing the extrapolation all variables 
except A? must be held fixed. The question then arises: 
what values of E, 7, and @ are most favorable to the 
extrapolation procedure ? 

To answer this question we must be able to estimate 
the error associated with extrapolation. Let us assume 
that the extrapolation will be done by fitting a poly- 


Kph (Mev) 
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Fic. 3. The distance of extrapolation as a function of E for two 
values of A. Also shown is kpn, the equivalent laboratory energy in 
photoproduction (E?=M?+2M:k,))). 


16 P. Cziffra and M. J. Moravesik, University of California 
Radiation Laboratory Report UCRL-8523, October 17, 1958 
(unpublished). 
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nomial in cos@ to the function (A?+y*)%o), by the 
method of least squares. Then we can use a well-known 
formula to calculate the error.’ 
Let a» be the position of the pole as a function of 
cos@. Then define 
x=xXo— cosé. 


In the p’+q=0 system, we find 
A?+py?=2|q| | k] x. 
We wish to extrapolate the function 


f (x)= (2! q| | k])?x*(0), (2.10) 


to the point x=0, the position of the pole of (@),. The 
error in the least-squares polynomial at the point «=0 
is given by 

(2.11) 


1 
“) 


Aao= pl (h 00 ] 
where 
N x,™t n 
ie " y 


9 
i=1 


oi 


(244 


linn= 


and where p, which depends upon the goodness of fit, 
is ~1.!° The summation in Eq. (2.11’) extends over 
the points at which f(«) is known, and ¢; is the experi- 
mentally determined standard deviation of f(x) at a;. 
In order to use Eq. (2.11) to determine the dependence 
of Aap on the experimental parameters, we must make 
some assumption about the behavior of the ¢,’s. Let us 
assume that there is a constant standard deviation € in 
the cross section (¢), for all values of x. This seems to 
be the most reasonable assumption to make in the 
absence of detailed knowledge of the behavior of (¢),. 
This assumption implies that the standard deviation ¢; 
of f(x) is 


Then if p=1, Eq. (2.11) becomes 


Aao= &(2!q |k )* (H N09 |, 


where 


N 
Hnn=> xymtn 1 


i=l 


Note that the error Aap is proportional to the absolute 
error in the measured cross section, and to the factor 
[(H~ Joo]! which depends only upon the distance of 
extrapolation and the distribution of points in the 
physical region. 

It is qualitatively evident that the error will rise 
with the distance of extrapolation. Let us calculate the 
distance of extrapolation as a function of E and X. 
Equation (2.2) shows that the pole occurs at 


cosd= (A+ 2weko)/2| q| | k| =xo. (2.13) 


Holding \ fixed, one finds that all quantities on the 
right-hand side are known functions of E. One can easily 
show that in the p’+q=0 system the following rela- 
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Fic. 4. The dependence of the extrapolation error Aao/é on E. 


tions hold: 
2w.= E— (M?—y")/E, 


2ko= E— (M?+))/E, 
E,= E-w2= (|q| *+M)!, 
E,= E—ko= (|k|?+M?)!. 


The distance of extrapolation as a function of £, for 
two values of \ is shown in Fig. 3. Unless otherwise 
specified, pion mass units are used throughout this 
paper. 

In order to evaluate the dependence of Ado on the 
distance of extrapolation, we must calculate (H™~')oo. 
If we assume that we can fit f(#) with a polynomial of 
the fourth order (S and P waves only), then H,,,, is a 
five-by-five matrix. The inversion of this matrix was 
performed by a machine calculation for various dis- 
tances of extrapolation for the arbitrary case of eleven 
points x; spread evenly over the physical region. The 
results are summarized in Fig. 4, where Aao/é is shown 
as a function of E for \=10. If & does not vary much 
with E, higher values of E are clearly favored by this 
consideration. 

A second consideration is the size of the residue at 
the pole. For a given Ado, the size of the residue deter- 
mines the percent error. From Eq. (2.8) one sees that 
for a given value of \ and T,, the residue is propor- 
tional to 

(4(q:s)(q-s’) —pA)s. 


In Fig. 5 this factor is plotted against E for two extreme 
values of ¢. If one chooses ¢=0 or z, the size of the 
residue decreases with E. This decrease is, however, 
greatly outweighed by the more rapid decrease of Aao/é 
with E shown in Fig. 4. The conclusion indicated is that 
unless experimental conditions create very strong vari- 
ations with £ in the absolute accuracy attainable, high 
values of £ are desirable. 

On the other hand, if E is too high, one will be forced 
to use a polynomial of higher than fourth order to 
fit f(x); ie, D waves will become important. The 
machine calculation of (Z7~')oo showed that the error 
increases markedly with the order of the polynomial. 
The error was calculated in the same manner as for 
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Fig. 4 for the point E=9.66 for fifth- and sixth-order 
polynomials. For the fourth-order polynomial, Fig. 4 
shows Aao/&= 134. For the fifth-order, we found Ado/ é 
= 345, and for the sixth-order, Aao/£= 952. 

By machine calculation of the error for several specific 
distributions of points «;, some qualitative conclusions 
were drawn concerning the most favorable distribution. 
First, it seems desirable to have measurements spread 
over as wide a region as possible, preferably the entire 
range of cosé. Second, it seems desirable to concentrate 
most of the points close to cos#=1. For example, at 
E=9.66 we saw that for eleven evenly spaced points 
Aao/é= 134. For nine points at cos@= 1, 0.98, 0.94, 0.87, 
0.71, 0.26, 0, —0.71, and —1, we found Aao/é=95. 
Third, it is of course desirable to have as many points 
as possible. For 6, 11, 14, and 21 evenly spaced points 
(E=9.66, fourth-order polynomial) ; Aao/£=423, 134, 
100, and 75, respectively. 

It is beyond the scope of this work to provide a 
definitive answer to the experimental question of the 
optimum values of the parameters for the purpose of 
the extrapolation. An attempt has been made in this 
section to discuss the most important factors. In 
Sec. III D a calculation will be made of the electro- 
production cross section in order to translate Eq. (2.12) 
into an estimate of the maximum percent error com- 
patible with a significant extrapolation. It is desirable, 
of course, to carry out this calculation at the most 
favorable value of the parameters. In order to consider 
a more familiar parameter, let us introduce the equiva- 
lent photoproduction energy ky», defined by 


kyn= (E?—M*)/2M. (2.14) 


The quantity &,» is the energy a photon must have in 
the laboratory in order to produce a pion-nucleon final 
state having total energy £ in the barycentric system. 
On the basis of the considerations discussed in this 
section, a reasonable guess for the most promising value 
of ky» is ~500 Mev. The corresponding value of 
E= 9.66 will be used in the calculation in Sec. III D. 
As a final kinematical consideration let us determine 
the possible range of A for a given £ and 7,. In order 
to derive this range, note the relationship of A, Z, and A?” 
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to laboratory quantities, denoted by the subscript L: 
A= 2T .€21,(1—cosy), (2.15) 
E*= M*—\+2M (T1—«21), 


A*?=2M(E21—M). 


(2.16) 
(2.17) 


Here y is the laboratory angle between the initial and 
final electron directions. In Eq. (2.15) the mass of the 
electron has been neglected compared to its momentum. 
From Eqs. (2.15) and (2.16) we find 

| (2.18) 


For a given A, the maximum value of E? is obtained for 
backward scattering, Y=. Conversely, ¥y=7 also gives 
the maximum value of \ permitted for a given E and 7, 
(see Fig. 6). By increasing 7; one can obtain higher 
values of A. 


M 


p—M+2MT La 1+ —— 
T 1(1—cosy) 


III. ESTIMATE OF ELECTROPRODUCTION 
CROSS SECTION 


A. General 


In order to assess the difficulty of carrying out the 
proposed extrapolation, one must estimate the electro- 
production cross section as a function of cos@, with the 
parameters A, E, 7, and ¢ held fixed. A general treat- 
ment of the electroproduction problem has been given 
by Dalitz and Yennie (hereafter DY).'® The most 
recent calculation, based on the photoproduction theory 
of Chew, Goldberger, Low, and Nambu,!! was made by 
IF NW. In this section a treatment will be given which 
relies heavily on the aforementioned papers, but which 
treats somewhat differently the corrections due to the 
finite mass of the nucleon. 

Define the T matrix 


m*M? , 
Si “bio p+ ats'—p-s)( — ) te (3.1) 


wok Foeyes 


The T matrix element can be expressed in terms of 
the current j associated with the transition from nucleon 


16 R. H. Dalitz and D. R. Yennie, Phys. Rev. 105, 1598 (1957), 
quoted in the following as DY. 
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to final pion-nucleon state : 
T= [eg,/ (2m)?! ]j-«. 


This form applies to both electroproduction and photo- 
production. In the latter case, € is proportional to the 
polarization vector; in the former, ¢ has to lowest 
order in e the value 


(3.2) 


€, = ent(s’)y,u(s)/d. (3.3) 


As pointed out in DY, the time components of 7 and € 

can be expressed in terms of their space components by 

means of the continuity equation. This is useful in 

adapting photoproduction results to electroproduction. 
The cross section can be written 


E\? | 
) a, (3.4) 
M 


where the dimensionless quantity ® has the value 


a’g” ( 
dP a m’Ms 1X 


do 


(2M)? 


r 


M\? 
(—) DX jut jr(2s4s’— suk’ — s*ke— 3rg). (3.5) 
E 


We must now develop an expression for the current j. 


B. Dispersion Theory of Photoproduction 
and Electroproduction 


The most successful calculation of photoproduction, 
given in CGLN, is based on the method of spectral 
representations. This calculation was extended to elec- 
troproduction by FNW. At the values of the experi- 
mental parameters suggested in Sec. II C, namely 
T,=700 Mev and ky»=500 Mev, the approximate 
solutions of the dispersion equations given in CGLN 
cannot be expected to be very accurate. The solutions 
are based on neglecting terms of higher than first order 
in p/M, w/M, and |k|/M. When kyn=500 Mev and 
\=10, |k| /M=0.56. An indication of the accuracy of 
the CGLN formulas can be obtained by comparison 
with photoproduction data. Although photoproduction 
of neutral pions from protons is in excellent agreement 
with the CGLN theory up to 450-Mev lab energy,” 
positive-pion production shows poor agreement above 
300 Mev. A summary of the comparison of theory and 
experiment in positive-pion production is given in Fig. 8 
of the paper by Lazarus, Panofsky, and Tangherlini.!* 
The lack of quantitative success of the photoproduction 
theory at high energies need not discourage us much 
here. The theory gives a semiquantitative fit, which 
should be quite adequate for the purpose of estimating 
the difficulty of the proposed extrapolation. 

Let us now consider those aspects of the dispersion 
theory of electroproduction which are most important 
for our calculation, referring the reader to FNW for a 


‘7 McDonald, Peterson, and Corson, Phys. Rev. 107, 577 (1957). 
18 Lazarus, Panofsky, and Tangherlini, Phys. Rev. 113, 1330 
(1959). 





ELECTROMAGNETIC FORM 


more detailed account. The dominant terms in the 
amplitude for the production of positive mesons are the 
renormalized electron-charge Born approximation jg 
and the resonant magnetic-dipole amplitude jay. The jz 
term is 


F,(A)(2q—k)’ 
jp’ =i(p')) t¥s— 
A 


F(X) 


¥ (p-g+M | 


0) — is] (3.6) 


The first term, arising from the diagram of Fig. 2, 
contains the pole on which the extrapolation is based. 
Note the presence of the pion form factor, which was 
set equal to unity in FNW. The proton form factor 
F,?(A) is quite well known from the electron-proton 
scattering experiments by Hofstadter and collaborators. 
For F,"(A), the assumption F,"(A)=0 will be made. 
At present experimental results indicate that no large 
error is made by this assumption at the value A= 10y’." 
The resonant magnetic dipole term is 


D 

7 fiL2axk—ia(e-k) +io(q-k)]|%), (3.7) 
BY(A) 

= ———-e*8 sind. 


6f?|q)* 


(3.0) 


Additional resonant terms arising from the meson 
current will be neglected here, along with corrections to 
the Born-approximation values of the S-wave and small 
P-wave amplitudes. In this approximation the complete 
expression found by FNW for the current 7 consists 
of ju, of the static limit of jz, and of recoil corrections 
of order u/M. 

An alternative procedure, which will be employed 
here for including recoil corrections, is to use the com- 
plete covariant expression jg instead of its static limit; 
i.e., Set 

j=jatJw. (3.8) 
Except for the small part of w* that arises from the 
anomalous moment, this expression includes all the 
u/M terms in FNW. In addition, by avoiding the 
expansion of 7g in powers of 1/M, it includes some of 
the higher-order corrections which have been dropped 
in FNW. One might hope that these corrections which 
have been included are the more important ones. The 
magnetic dipole amplitude jy already includes recoil 


19 Wolfgang K. H. Panofsky, 1958 Annual International Confer 
ence on High-Energy Physics at CERN, edited by B. Ferretti 
(CERN Scientific Information Service, Geneva, 1958). See 
especially p. 18, 
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corrections to some extent when one uses experimental 
values from pion-nucleon scattering for the phase shift. 
In fact, the CGLN formula for photoproduction of 
neutral pions, which is dominated by jy, has been 
shown to agree well with the experimental data’ up to 
450-Mev photon laboratory energy. 

In order to gain further confidence in the use of the 
amplitude jy at energies well above resonance and at 
values of \ up to 10, which is to be used here, let us 
examine how the specific form of jx arises in the CGLN 
theory. In their notation M1, is essentially our jy. 
They found [CGLN Eq. (13.1) ] a simple proportion- 
ality accurate to order u/M, of Mi, and the corre- 
sponding pion-nucleon scattering amplitude fi,. If we 
wish to use this result at energies well above the (3, 3) 
resonance energy, the important question is: How 
large a deviation from Eq. (13.1) of CGLN occurs when 
higher-order terms in u/M become appreciable? A par- 
tial answer to this question can be obtained by con- 
sidering the Born-term contributions to My, and fi,. 
By a straightforward but lengthy projection, one ob- 
tains, neglecting D waves and higher waves and setting 
u=l, 

2M (E— M)g*|\q\°V-? 
== , 

3E(2Esw2— 1)? 


(3.9) 


where V;2=[(M+£),2)/2M }}. For electroproduction, 
which reduces easily to photoproduction when A=0, 
the nucleon-current part of the electric Born terms gives 
Mi4,.2 ef 
——=4—Fy,, (3.10) 
q| | k| 3 
where 


M?ViV 2 E-—M 
Fy, = | = 
E (2Eow.— 1)? 


1 
4 | (3.10’) 
(FA, +M)(2E\wo—1) 


The anomalous-moment Born term gives 


: V,. .B ui VeMViV2 
Mi, == 
lal {k} 3E(2Ews—1) 


ko Rw 
x{ 14 + | (3.11) 
2(2£;+M) 2(2E\w2—1) 


Recalling that uw” (A) =u’ (A) +e" (A)/2M, we find that 
the simple proportionality, 
1 1 
— (Mj, 2 H+My, 2 )= M,,8 
iq) | k! 1q!|k| 
u(X) 


) d 
2f\q\? 


fi . 
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is exact to order u4/M with r=1. Calculating the exact r 
from Eqs. (3.9), (3.10), (3.11), and (3.12), one finds for 
photoproduction at 260- and 400-Mev laboratory 
energies that r=0.990 and 1.007, respectively, whereas 
for electroproduction at ky»=500 Mev and A=10, 
0.932. Thus in photoproduction the simple propor- 
tionality of the Born terms is retained well above the 
(3, 3) resonance energy ; even for electroproduction, up 
to at least the stated values of k,» and X, the value of r 
can be set equal to unity with sufficient accuracy for 
our purposes. The expression (3.7) for jx will therefore 
be used as it stands. 

One further comment should be made about Eq. (3.8) 
which will be used for j, namely, that the inclusion of 
both jg and jy results in counting one term twice. We 
have seen above in Eq. (3.10) that jy contributes to 
the amplitude M,, .%°*’. This contribution is included 
in jaw and should therefore be subtracted out of jz. 
Equation (3.10) shows that the quantity that should be 
subtracted from j is 


r 


rie 
- Fy (f| (2qx& k—i(e-k)q+i(q-k)e@}! i). 
6M? 


e 


Numerically this term proves to be no more important 
in the case to be considered than many terms already 
neglected and will therefore be neglected also. 


C. Calculation of the Photoproduction 
Cross Section 


The formula (3.8) for j developed in the previous 
section avoids expanding the electric Born terms in 
powers of 1/M and may therefore include nucleon- 
recoil effects more accurately than the CGLN formula. 
Since definite disagreement has been observed!® in 
positive-pion production between the CGLN formula 
and experiment,” a calculation has been made at a 
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Fic. 7. Photoproduction at 400 Mev. Solid line: prediction of 
modified CGLN formula, Eq. (3.8). Dashed line: prediction of 
CGLN formula. Experimental points are those of Walker et al.” 

* Walker, Teasdale, Peterson, and Vette, Phys. Rev. 99, 210 
(1955). 
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laboratory energy of 400 Mev to see if Formula (3.8) 
leads to better agreement. Unfortunately this does not 
seem to be the case. As shown in Fig. 7, the calculation 
led to an angular distribution almost identical to that 
predicted by the CGLN formula. Nevertheless the 
formula developed in this paper may be significantly 
more accurate in the electroproduction case, where 
recoil corrections can be more important. 


D. Calculation of the Electroproduction 
Cross Section 


Having developed and investigated a formula for j, 
we are ready to proceed to a calculation of the electro- 
production cross section. Since the machinery for the 
calculation has been set up in Sec. III A, we need only 
insert Formula (3.8) for j into Eq. (3.5). Separating 
into the part coming from jz, from jy, and from the 
cross term, we write 


&=$3;4+4O),4+- Pc. (3.13) 


For the magnetic-dipole term ®y,, one finds 


Py= 'D *{5/q)?|k|?—3(q-k)? 
+ (4/A)[3(q- kX s)?+|q)?(sxXk)?]}, (3.14) 
where D is defined by Eq. (3.7’). 
The Born-term part of © can be evaluated covariantly 
by substituting Eq. (3.6) into (3.5). The result is: 


A2 FF, 
- —[4(q-s)(q-s’)—d ] 
X (4?+1)? 


Pp = 


+—*_| — rata’) + (p-8) (02422) 
(E2—M?)2| 


A? 
en (p-s)(p-s’)+2(p-s)(p’-s) 


4 
+2(6-8')(B 8) +1209" B)(b-)(0°8) 


—(p-k)(p-s)(p’-s")— (pk) (p-s’) (p's) ] 
2FyF, 


\(E2—M2)(A2-+ 


SLA oA)M(2) 
. —(p-k)h(p’)]. (3.15) 


In this formula the assumption has been made that 
Fy"(A)=0, and F,?(A)=F2?(\)=F2"(\)=F (A), and 
the abbreviation 

h(p) = (2g—k) pp (2s4s’— shh” — s*ke— 3g") (3.15’) 


has been introduced. 
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Finally, the cross term is 


E®¢ 2Fy Z 2s-k—X 
orem SE ee oem “(a k+-C:+—— “c.) 
2M ViV2 ReD w* nN 0° 
+ 
2M 
a a i 
+— —4 2|q|?|k]|? sin’?¢+-— 
V2(A2-+1) | d 
(28-k—2)(2q—k)-k 
x|2020- k) -s——— | 


F ty 


"| | q*| 53 —cos') 
2Mw*V\?V 2" 


4 
sa liane s)?+Ciq-k+C2q:s ] 


2(2s-k—|k|2) C. 
stiches decom (F,—Fy)— 
Ako? V;? 


(3.16) 


The following symbols were introduced : 
w*=E—M, 
C:=(sXq)-(sxXk), 
C.= (kXq)-(kxXs). 


(3.16) 


The last term in Eq. (3.16) is the contribution of a term 
that, following FNW, was added to jx to restore gauge 
invariance. 

A numerical calculation of these formulas as a func- 
tion of @ has been made for the case discussed in 
Sec. IT C, namely 


kyn=500 Mev, or E=9.657, 
T .=5.039~ 700 Mev. 


Two values of the angle @ [defined in Eq. (2.9) ] were 





Fic. 8. The dimen- 
sionless functions 
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+BF,Fy+CFy’, for 
= 180°. 











8 (deg) 


FACTOR 





Fic. 9. The func- 
tions A (@), B(6), and 
C(6@) for ¢=0°. 
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used: ¢6=0° and ¢=180°. From electron-nucleon scat- 
tering experiments the value of Fy was chosen to be 
Fy (10) =0.62.! 

In order to carry out the calculation it was necessary 
to estimate the value of the phase shift 63; at E= 9.66, 
which corresponds to pion-nucleon scattering at 350 
Mev. One cannot use the Chew-Low effective-range 
formula at such a high energy. At 3 7 Mev, Chiu and 
Lomon?! find a significant deviation. In order to esti- 
mate the phase shift at 350 Mev, the points of Chiu 
and Lomon and one point by Willis”? have been joined 
by a smooth curve and the value 63;= 145° read off at 
350 Mev. This crude estimate should be quite adequate 
for our purposes. 

To make the dependence on the unknown form factor 
F, explicit, let us write 


b= A (6,6) F 7+ B06) FF nt+C(0,6)Fy*. (3.17) 


The calculated dependence of A, B, and C on @ is 
shown in Fig. 8 for ¢=180° and in Fig. 9 for ¢=0°. 
The physical meaning of the two values of ¢ is illus- 
trated in Fig. 10. Comparison of the two cases reveals 
the fact that A depends very strongly on ¢. The origin 
of this strong dependence is the term in brackets in the 


Fic. 10. Illustration of the two 
cases 6=180° and ¢=0° in terms of 
momenta in the system in which 
p’+q=0. 


21H. Y. Chiu and E. L. Lomon, Ann. Phys. 6, 50 (1959), 

* Reported by Oreste Piccioni, in 1958 Annual International 
Conference on High-Energy Physics at CERN, edited by B. Ferretti 
(CERN Scientific Information Service, Geneva, 1958), p. 67. 
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Fic. 11. A plot of the dimensionless function , proportional to 


the electroproduction cross section, for E=9.66, \=10, T,=700 
Mev, ¢=7, and various values of the pion form factor. 


following expression for A: 


2 9 


M 2 2 
1-4(—) ———[4(q-s)(q-s’)—d]. (3.18) 
E/ \(A*?+1)? 


The term A is of course the pole term which has been 
discussed, but the extreme @ dependence noted above 
will be smoothed by the extrapolation procedure sug- 
gested in Sec. II B. Recall that it is necessary to sym- 
metrize the cross section as prescribed by Eq. (2.7) in 
order to eliminate a branch point. The symmetrized ® 
can equivalently be defined as 


, (0,6) = (0,6) +9(6, +7). 


This symmetrized ®, is then the average of the ¢=7 
case of Fig. 8 and the ¢=0 case of Fig. 9. 

Before going onto the consideration of ®, and its 
extrapolation, let us note the dependence of the electro- 
production cross section on the parameter F,(10), with 
F y(10)=0.62. Several cases are plotted in Fig. 11. One 
interesting feature of this graph is that for any curve 
F,(10)=8, where 80.85, there is another very similar 
curve corresponding to a different value of F,(10). 
This is illustrated in Fig. 11 by the curves for F,(10) 
=F y(10)=0.62 and for F,=0.20. 

Finally, Fig. 12 shows the calculated behavior of the 
extrapolation function (A?+1)*, at the end of the 
physical region. 


(3.19) 


FRAZER 





(A? +1)? @ 








Cos 8 


Fic. 12. The calculated behavior of the extrapolation func- 
tion (A?+1)%b, at the end of the physical region for the case 
discussed. 


IV. CONCLUSIONS 


We are now able to apply the error estimates de- 
veloped in Sec. II C. For example, recall that for one 
case in which the cross section was measured at nine 
specific points, the error was found to be Aao~ 1008. 
If Aap is the error in (A*+1)*#, at the point A?=—1, 
then é is the error (assumed constant) in ®, at each 
measured point. From Fig. 12 we see that if Aao>2 the 
extrapolation yields no useful information. This requires 
a constant absolute error in ®, less than 0.02. Figure 11 
shows that if F,=0.62, this means a percent error 
ranging from 2% at 6=0 to 6% at 6=90° and 20% at 
6= 180°. Even this estimated upper limit of accuracy is 


considerably better than that achieved by Panofsky and 
Allton? in an electroproduction experiment in which 
only £ and \ (not A*) were determined. Thus it appears 
that the proposed extrapolation, although possible in 
principle, will require experiments of great accuracy. 
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A method of testing the hypothesis that there is global symmetry of the pion-baryon interaction is 
proposed: Upon analyzing low-energy K~—p scattering data, one finds a variety of scattering length solu- 
tions which are compatible with the elastic scattering, charge exchange scattering, and total charged hyperon 
production. Our suggestion involves the use of the experimental =+/Z> ratios (a) to test the global sym 
metry hypothesis (or any other quantitative description of the pion-hyperon interaction), (b) to reduce 
the ambiguity in the K—N scattering length solutions, (c) hence, to predict the 2° cross section. We need 
to know the x—Y phase shifts, in addition to the K—N scattering lengths in order to predict the 2+, => 
cross sections. If there is global symmetry of pion-baryon interactions, then we know the x— Y phases in the 
absence of a K—N reaction channel. We demonstrate how the actual x—Y phase shifts can be obtained, in 
a non perturbative manner, from idealized x— Y phases (i.e., in the absence of K—N reactions) and the K—N 
scattering lengths. Earlier proposals making use of the dependence of the hyperon production ratios on the 
a —Y phase shifts are also examined in terms of this result. Certain fits to the present rough data with scat- 
tering lengths of negative real part are shown to be incompatible with global symmetry. The proposed 
analysis involves the assumption that the K is an isotopic doublet. 


I. INTRODUCTION Wyld? that it may be useful to describe (1) and (2) 
along with the sum of (3) and (4) in the scattering 
length (and effective range) approximation. They have 
given convenient expressions for this purpose. On the 
other hand, Amati and Vitale* noted the possibility of 
relating the scattering in the final pion-hyperon states 
with the cross sections (3)-(6). We suggested‘ that it 
seems feasible to combine these ideas and study the 
m—Y interaction from an empirical analysis of the 
K~—p reactions. 

The point is that the problem becomes even more 
ambiguous if one just introduces the r— Y phase shifts 
as free parameters to be fitted to the data (i.e., the 
~+/Z- ratios). A whole variety of scattering length 
solutions, a, for the K—N system already exist which 
fit (1), (2), and (3)+ (4). What we do is to assume a 
model for the interaction. This model (e.g., global sym- 
metry) would give us a definite prediction for the 
“separated” #—Y phase shifts, 5o(7V), i.e., the phase 
shifts in the absence of a K—N reaction channel.! It is 
the purpose of this paper to discuss the 2 methods* 
which have been proposed for determining the actual 
m—Y phase shifts, 6(rY), in the presence of a K-N 
reaction channel from 6o(7Y) (and the scattering 
lengths, a), and suggest a much better approximation! : 
o(K-): K-+p—K-+9, (1) The first suggestion* was to approximate the phase of 
o(K°): ‘ — R+n, (2) the reaction channel by 4o(7V). In B, we approximated 

the pion-hyperon K matrix element, c, by tand(rV). 
o(2*): attr, (3) In A, we derived an expression for c which involves 


o(2- , Tate, (4) 2 Jackson, Ravenhall, and Wyld, Nuovo cimento 9, 834 (1958). 

o(2?): ita", (5) iM gone a . —— Maen Thee al Sdn Con- 

a(A): Ate, O iaivomersk 

It has been pointed out by Jackson, Ravenball, and —seid'Silt eee, Sint gy aeang 
re intermediate state. Thus in the Chew-Low one-meson approxi- 


T is the purpose of this paper to suggest a quanti- 
tative test for various theories of the pion-hyperon 
interaction. Any theory that gives a definite prediction 
for pion-hyperon scattering in the absence of a K—N 
reaction channel may be tested by analyzing K~—p 
reactions. Our proposal is based on the results of a 
paper! in which we considered the general problem of 
several open coupled two-particle channels near the 
threshold for one of them. In A, we derived relations 
(e.g., in the case of two open channels) which enable us 
to determine the actual phase shift in the second 
channel, from knowledge of the actual phase shift in 
the first channel and the phase shift in the second 
channel if it were decoupled from the first, in a nonper- 
turbative manner. These relations should be quantita- 
tively useful for interactions of short, well-defined range. 
We want to consider, in this paper, low-energy 
(100 Mev) K~- incident on protons; thus we need 
only deal with incident s waves. We will assume that 
K-—K® form an isotopic spin J=} (equal parity) 
doublet. There are five open two-body channels which 
are coupled to the incident channel. We denote the six 
cross sections by 


* Work supported in part by the National Science Foundation. mation (for both K and 7m), we would calculate the “separated”’ 
1M. Ross and G. Shaw, Ann. Phys. (N. Y.) (to be published). phase omitting all reference to K interactions except that the 
We shall refer to this paper as A. constants include certain renormalization effects due to K’s. 
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bo(V) and a. We will refer to these approximations as 
I, II, and III, respectively. 

An analysis of the data would amount to fitting (1), 
(2), (3), and (4) as a function of energy (over the s-wave 
region) simply by finding the appropriate a’s and using 
the 6o0(7Y) from the model. 

In Sec. II we present the necessary relations, between 
the scattering lengths for the K—N system and c, 
which one would use for an analysis of the data. In 
order to discuss the approximations I, II, III in Sec. 
III, it is useful to present, here, a brief description of 
the experimental situation. 

At present cross sections for processes (1)—(4) have 
been experimentally determined® at low energy (< 100 
Mev). The cross sections (5) and (6) for production of 
the neutral hyperons have not, as yet, been measured. 
However, it is important to note that the at rest 
capture of K~ by protons’ produces very few A’s. The 
at rest data, which probably include some p state 
capture in addition to s state capture, give the branching 
ratios 

2~/2t/2°/A=4/2/2/ 3}. (7) 


There are several interesting features of the in-flight 
data: There is a K°—K~ mass difference® which leads 
to a threshold laboratory energy of 8 Mev for charge 
exchange scattering. Since the cusp effect on the 2+/2- 
ratio,‘ above the threshold for (2), will not change any 
general arguments which we shall make, we shall ignore 
the K-—K® mass difference in this paper. We refer the 
reader to Jackson and Wyld® for a (straightforward) 
method for including this mass difference, as well as 
Coulomb effects. 

In addition, the charge exchange cross section is 
small over the entire energy range. A qualitative aspect 
is the rapid variation of the 2+/Z~ ratio. However, the 
data are too preliminary to consider this large energy 
dependence as established. 


II. RELATIONS BETWEEN THE CROSS SECTIONS 
AND x—Y SCATTERING 


The scattering operator T and the reaction operator 
K are given by the usual relations: 


S=1+T=(1—K)(14+3iK)-. (8) 


The Hermiticity of K insures the unitarity of S. Also, 
the matrix elements of T and K are defined as 


T= —2nid(E;—E;)T;;, 
namie K;;= 2765(E,— E;)K;;', 


® Nordin, Rosenfeld, Solmitz, Tripp, and Watson, Bull. Am. 
Phys. Soc. 4, 24 (1959); A. H. Rosenfeld, Bull. Am. Phys. Soc. 
3, Ser. II, 363 (1958); Ascoli, Hill, and Yoon, Nuovo cimento 9, 
813 (1958); R. S. White (private communication). 

7L. Alvarez et al., University of California Radiation Labora- 
tory Report UCRL-3775, 1957 (unpublished). 

® Rosenfeld, Solmitz, and Tripp, Phys. Rev. Letters 2, 110 
(1959) ; F. Crawford et al., Phys. Rev. Letters 2, 112 (1959). 

*J. D. Jackson and H. W. Wyld, Phys. Rev. Letters 2, 355 
(1959) ; Nuovo cimento (to be published). 
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where £ is the total energy in the center-of-mass 
system. In a given isotopic spin state 7, we expand 7” 
and K’ in terms of eigenvalues of orbital angular mo- 
mentum / and total angular momentum J to obtain 


Tas! (IJ) — Kas’ (lJ) =2i Oy Kay! (lJ) pT yp’ (lJ), (10) 


where the Greek subscripts refer to channels. When y 
is a two-particle channel (which is the case we will be 
interested in), then 


py=k/dk,/dE,, (11) 


where k, is the momentum in the center-of-mass 
system. (We use units 4=c=1.) If we make the trans- 
formation 

T.p'= — pa 'Tasps* (12) 
along with a similar one for Kas’, we get the matrix 
equation 

T=(1—iK)"'K, (13) 

where there is a submatrix along the diagonal for each 
set of quantum numbers /, J, J, etc. The diagonal 
element of this 7 matrix has the usual form 


T aa(lJ) = {exp[2i6aa(lJ) ]—1}/2i, (14) 


where 6 is the phase shift. The total cross section from 
a to B, for the case where a consists of one particle with 
spin } and one with spin zero, is, for a particular isotopic 
spin state, 


oap=4rka? > 1,4(J +4) | Tas(lJ) |?. (15) 


We shall be dealing, in this paper, only with an 
initial K-—p state J=0. The following modification 
must be made if the intrinsic parity of 8, P(@), is not 
equal to that of a: If P(K—N) is the same as P(r—Y), 
the relevant —Y phase shifts are the s; ones. If 
P(K—N) is minus P(r—Y), then the 6(r—Y) that 
enter, enter are the p; ones. 

In order to describe the K—N processes, we need the 
T matrix in both isotopic spin zero and one states. For 
example, we write,’ for 7=0, 


( T(K) 
Tea 
| 7°(Z)| exp(i¢") 


| 7°(~)| exp(i¢") 
). (16) 
T° (1) 


where the first row refers to K—V and the second to 
a— . Thus," 7°(xr) is the amplitude for r+ — 2+ 
in the J=0 state. Note that @ is a real number. We then 


” The T matrix for a given J and J, is symmetric if time reversal 
holds. 

1 All numerical superscripts (with the exception of those on the 
hyperons or mesons which refer to charge) denote the isotopic 
spin state J. The subscript 0 [with the exception of that on the 
effective range (31)] refers to the “separated” quantity, i.e., the 
quantity which would exist if the coupling between channels 
vanished. 
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have 


(17) 


Kk > — 
o( nari T'(R)+T(R) |, 
Ke 


o(2*+)=4rkX$ | (1/v2)|7(2)| 
X exp(ip') F (1/v3) | T°(Z)| exp(i¢") 2°, 
o (2°) =4rkX§ | T°(2) |’, 
o (A) =4rkX3|T(A)|*, 


(18) 
(19) 
(20) 


where k is the momentum of the incident A~ in the 
center-of-mass system. 
Let us consider how many independent quantities 


we need to describe the A—.V processes. For J=0, we 


have 
P?m @& 
“(9 
oe of 


where d°, e°, and c° are real, since K is symmetric as well 
as Hermitian. Thus three real parameters determine 
the T matrix (16). Let us introduce the complex scat- 
tering length a’ for the incident channel in the isotopic 
spin state J: 


k coté! (KN) = —1/a!(k) = —[A1(k) —iB(k) P, 


(21) 


(22) 


where 5(KN) is the complex phase shift in the K—N 
channel and A and B are real (note that B is a positive 
quantity). We choose to express all the scattering 
amplitudes in terms of A, B, and c. Then, from (13) 
using (14), (16), (21), and (22), we obtain (after some 
algebra) for the two-channel problem: 


" 1 “ 
T°(K)= -1(— +t) ; 
a°(k) 


T(z) = exp (i) (RB)? | 1+-7ka®| : 
ALL + (2°)2] 
¢=tan |e- a gly =| 
1+k(B+0A4°) 
T(x) =[coté°(d) —7 |", 
ikBL1+ (o)?] 
tand°(xZ) = + ' 
1-+ik(A°— cB) 
d°=k(B°— A), 
&={kBL1+ (0)? }}}. 


(23) 


(24) 


where 


(25) 


where 


(27) 


(28) 
(29) 
For energies below the K—WN threshold, we continue 
(23)-(27) according to the relation 

k—ilk|. (30) 
We may exhibit the energy dependence of a(k) in the 
effective-range approximation; i.e., to order k? we have 


1, a(k)= (1, a)—4rok?, (31) 
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where ro is the effective range. For the K~—? reactions 
it is very likely that we may ignore ro for k<0.6 f— 
(1 f=1 fermi=10-" cm); we shall discuss this in Sec. 
IIT. In general, we also have 


c(k)=c+c,k?+---. 


However, for the energy region in which we are inter- 
ested, it is a good approximation to neglect the energy 
dependence of c. Thus for small k, we consider A°, B® 
and ¢® in (23)—(29) to be constants. (Therefore d° « k 
and ¢° « k}.) 

Now, as noted previously, the first approximation, I, 
for ¢° is given by 


tang?= tan 609 (42) =co", (32) 


where 60°(72) is the ‘‘separated” pion-sigma J=0 
phase shift, ie., that calculated when there is no 
coupling to the K—N channel.’ The second approxi- 
mation, IT, is 
RAT 1+ (co)? ] 
tang? = coo —-——— ‘ 
1+k( B+ c,°A°) 


(33) 


In A, we derived the following expression for c°: 
B°+-co°(R—A°) 
( R- d 1°) = cB 


0- 


(34) 


CA 


where R is a constant approximately equal to the range 
of the diagonal interaction in the K—N channel. (See 
A for an exact definition of the constant R—it is 
referred to as Z in A.) The relation (34) for c® should 
be accurate for interactions of short, well-defined range. 
This is the basis for approximation IIT; knowing c,°, A®°, 
B® (and R), we determine c,° from (34) and use this ¢® 
in (25) to calculate ¢°. Thus to completely determine 
the T matrix in the 7=0 state, we need to know the 
two observable quantities A° and B® and the theoretical 
phase shift 6o°(w2). 

The J=1 case is more complicated. The real K 
matrix can be represented as 


d} e! g) 
K'= i. a 
g! hi PJ 


The first row corresponds to the K—N channel, the 
second to r—, and the third to r—A. Here, we need 
to know six parameters to determine 7", e.g., co!, ho! 
and jo! (the pion A! matrix elements in the absence 
of any coupling to the K—N channel), the complex 
scattering length a! (=A!—7B'), and the ratio 

T'(2)|?, (35) 
which is a constant for small momentum k. We then 
could find relations, similar to (34), from which we could 
calculate the actual K! matrix elements c', /', and /', 
and hence determine 7". 
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If we consider the situation where ¢o', io!, and jo! are 
negligible as essentially they would be with global sym- 
metry (both the s; and p; pion-nucleon phases shifts are 
small), then we obtain! the simple results: 


(RB)! 
(2) =exp(i¢') (36) 


\1+ika"| (1+a)! 


kA'{1+[c'(1+a) }*} 
tang! = c!(1+a) -————— (37) 
1+4(B'+c\(1+a). 7 


where 


and our approximation ca! to c is given by 


B 
(38) 


~ (1+a)(R—A') 


Thus for the /=1 system, we need to know the three 
observables A', B', and a. It might be implied from the 
at-rest data, (7), that a<1. 


III. ANALYSIS OF DATA AND DISCUSSION 


In this section, we will present a rough analysis of 
the K~— data, to illustrate our proposal, and discuss 
some of the approximations made. 

Our first step is to analyze the data o(K-), o(K°), 
and o(=*+)+o(=-). We shall assume that the A pro- 
duction is negligible, or more specifically thdt a=0. 
Then, the two phase shifts 6’(A.V) are sufficient to 
portray this data. In fact, within the fairly large errors 
of the experimental points, we find that a scattering 
length approximation [a/(k) =a" ] is sufficient to deter- 
mine the energy dependence of the A—.V s-state scat- 
treing phase shifts: 

k coté!(KN)=—1/a’=—(A!—iB!), [=0,1. 

We find a variety of solutions which work: The A’s 
must both be the same sign [to keep o(K®) small], 
either positive or negative. The absorption can be 
divided in any way between the /=0 and 1 states. We 
illustrate the families of solutions to in-flight data with 
a few cases. We can have all J=1 absorption’ +; 

a'=+1.20—0.871, =+0.58; (39) 
roughly equal J=0 and J=1 absorption, as discussed 
by Dalitz and Tuan," for example, 

a'=+0.40—0.411, 


a°= +1.88—0.821, (40) 


and 


a'=+1.62—0.391, a°=+0.20—0.787; (41) 


or all J=0 absorption’: 


a'=+0.88, a9=+0.53—1.19i. (42) 


12 All lengths will be measured in fermis (f), and all momenta 
in f. 1 fermi=10~" cm. 

1% R. H. Dalitz and S. F. Tuan, 
(1959); Ann. Phys. (to be published). 
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On the basis of the K~—p capture at rest and K-—d 
capture at rest, regardless of the complexity of the 
analysis of this data, we can exclude the extreme solu- 
tion (39). We must consider, however, all cases with 
B> B'. 

Now we come to the problem of fitting the o(2*)/o(2~) 
ratio. This ratio appears to change from ~} at k=0.3 
to ~2 at k=0.4 to ~1 at k=0.6 and then stays ~1. 
Since the data are preliminary, let us content ourselves 
with exploring the implications of o(2*+)/o(2~) being 
approximately equal to one at k=0.6. Whether a 
solution leads to a rapidly changing ratio as a function 
of energy will also be discussed. 

To separate the charged hyperons, we need to know 
something about pion-hyperon scattering. Due to our 
knowledge of the pion-nucleon system, the most con- 
venient assumption to make is that there is global 
symmetry in the pion-baryon interactions. Then we 
have the pion K-matrix elements, in the absence of the 
K—N reaction channels given by! 

co = 3 K!+ 2K! ], 

co= Ki}, 

ho =4v2.K!— K?], 

jo =3(2Ki+ K'] 
where K/ are the pion-nucleon K-matrix elements. Our 
results turn out to be essentially independent of the 
parity P(K—J) since both the s; and #; pion-nucleon 


phase shifts are small. Thus for illustrative purposes, 
let the A particles be pseudoscalar so that 


=tand*! (2), 


(43) 


where 6(7V) are the s-state r—.V phase shifts. Evalu- 
ated at the appropriate pion momentum, which is 
essentially a constant over the range in K~ energy we 
are working with, we get 

1=—0.14, K!=0.22, 
with which we obtain 


co=0.22, 
hj = —0.17, 


Co =(), 10, 


jo'= 0.02. (4) 


As we note in Sec. II, for the 7=0 system, we need the 
three quantities A°, B®, and c° (and R) to determine 
the 7° matrix, whereas for 7=1, we need six quantities: 
A!, B', co, ho, jo', and a. Since co, o', and jo! are so 
small, we may use Eqs. (36)—(38). ry, we will 
assume that jo! and Ao! are negligible and co! is small but 
not negligible. Then with a=0, the 3X3 T! matrix has 
been effectively reduced to a system of two coupled 
channels. In the following discussion we shall assume 
that the 7=0 and 1 systems are described by similar 
two-channel T matrices, i.e., we ignore the A channel. 


4 Senate. Miller, Murray, Schwartz, and Bull. Am, 
Phys. Soc. Ser. IT, 3, 363 (1958). 


15M. Ross, Phys. Rev, 112, 986 (1958). 


Taft, 
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Let us consider the various methods for determining 
the =+/Z~- ratio. We can discuss the extreme type of 
solution (42) before going further. The ratio is equal 
to one, independent of the phases; it can show no 
variation with energy. Now, we consider the more 
complex cases (40) and (41) at the various levels of 
approximation discussed in Sec. IT: 


I: tan¢d’=cy', ie., the phase of the production T 
matrix element is taken as the “separated” r—Z scat- 
tering phase. Referring to (25) we see that I embodies 
two approximations: 

(i) RA(1+c?)/(1+kB+kAc)<1, 

(ii) C= Co. 
The first approximation, (i), is good at zero energy. It 
becomes very poor for the observed scattering lengths 
at about k 20.3 (i.e., for lab lenergies of 10 Mev and 


above). Approximation (ii) can be studied using (34). 
We see that c= cp if 


R-—A>B (and coB). (45) 


In certain cases this relation is roughly satisfied. If the 
range R is about the pion Compton wavelength, 1.4, 
it will be satisfactory for small or negative A. If A <0, 
it will usually be satisfied. However, we probably 
should consider R~0.5, near the K Compton wave- 
length.' It is clear that (45) will not be satisfied for 


many solutions we want to consider, 

IT: c!=co’, i.e., the diagonal K matrix element in the 
a— channel is taken from the “separated” xr—Z 
scattering. This approximation, which involves only 
approximation (ii) of I (it is equivalent to I at zero 
energy), formed the basis of the analysis B. Both 
procedures I and II lead to a 2~/2* ratio about 10 or 
greater unless we adopt the extreme type solution (42). 

III: c! is given by ca’, Eq. (34). Let us consider the 
four solutions (40) and (41), and assume that the 
K-—N interaction is characterized by a range equal to 
the K Compton wavelength. Using (34) and the “sepa- 
rated” #—Z phases, co’, from (44), we calculated the 
>-/=+ ratio. The results are given in Table I for an 
energy corresponding to k=0.6. We see that: (1) The 
negative A solutions are not compatible with weak 
“separated” —Y scattering, as predicted by global 
symmetry; (2) The positive A solutions may be com- 
patible with the present data and small do(rV). The 
great role of the energy-dependent second term on the 
right-hand side of (25) is evidenced by the difference 
between tang and c. Add to this the sensitivity of the 
~+/=- ratio to the phases, ¢, and we see that there will 
be very rapid variation of }+/~ as a function of energy. 
It should be hard to fit a particular energy dependence 
and we may hope that only few satisfactory solutions 
could be found. Such fitting must await more de- 
tailed data. 

An important feature of any detailed analysis is the 
effective range, ro, in the expansion (31) for a(k). We 
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TaBLE I. The ratio ~/E* calculated at k=0.6 (using approxi- 
mation ITI) with R=0.5, co°=0.22 and co! =0.10 for the scattering 
length solutions (40) and (41). 


o cos (¢! —¢°) 


247 


1.24 0.60 


1.19 


0.17 0.90 


1.33 


2.24 0.61 


1.13 


0.93 0.98 


have shown, in A, that a necessary condition for 
|ro'|>>R is that Im (1/a’) (measured in f-'), times 
some kinematical factors which are ~ 1 for this problem, 
is much greater than 1. The data indicate that the 
Im (1/a’)<1. Thus we feel that ro’ should prove to 
be small. 

We can summarize our proposal by stating that a 
given theory of r—Y interactions may be tested by 
evaluating the r— Y scattering omitting K—WN reaction 
channels, and then combining the results of this 
“separated” scattering theory with empirical observa- 
tion of A—N processes via (34) in order to predict the 
hyperon production ratio 2+/2~. In the example shown 
above, the global symmetry model was considered. We 
concluded only that the scattering length solutions with 
negative A of Dalitz and Tuan" are not compatible'® 
with present data. Data on the energy dependence of 
~+/2~ and an understanding of the capture at rest 
data should allow us to draw some strong conclusions 
with this type of analysis. It is important to note that 
we need not be restricted to an analysis which neglects 
A production: (35)—(38) indicated how the generaliza- 
tion is made to a three channel 7' matrix when the 
“separated” +—Y phase shifts, 6o'(rV), are small. To 
embody a theory in which the 6o!(rY) are large, similar 
(but more involved) relations may be derived (see A)}. 

In conclusion, we want to emphasize the importance 
of relation (34): Dalitz and Tuan" have noted that a 
perturbation treatment of the effect of the K-baryon 
coupling on r—Y scattering is a very poor approxima- 
tion. We agree with this statement, and point out that 
(34) enables us to calculate r—Y scattering from the 
“separated”” *—Y phase shifts, 6o(7Y), in a nonper- 
turbative manner. Thus one can theoretically investi- 
gate r—Y scattering in the absence of a K—N reaction 
channel without assuming that the coupling to this 
reaction channel is small. 
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Ferromagnetic and Antiferromagnetic Curie Tem- 
peratures, H. A. Brown AND J. M. LUTTINGER 
[ Phys. Rev. 100, 685 (1955) ]. A calculational error 
in the published value of k7./J for the body- 
centered cubic lattice with spin 1 found by the 
Bethe-Peierls-Weiss method has been corrected. 
The corrected values are k7’./J=8.66 and 8.96 for 
the ferromagnetic and antiferromagnetic lattices, 
respectively, for which the values given in the 
above reference are 8.72 and 9.03. This correction 
still is far from Weiss’s earlier result! and the recent 
remarks of Van Vleck? as to the doubtfulness of 
that result still apply. 

Thanks are given to J. S. Smart, who found the 
error and to D. F. Morgan, who calculated the 
correct result. Since the error involved using 
incorrect values of w(S,) [see Eq. (19) of the 
above |, only this particular case is affected. 

1P. R. Weiss, Phys. Rev. 74, 1493 (1948). 

2 J. H. Van Vleck, J. phys. radium 20, 124 (1959). 


Effective Mass of Electrons in Gallium Arsenide, 
L. C. Barcus, A. PERLMUTTER, AND J. CALLAWAY 
[Phys. Rev. 111, 167 (1958) ]. A recent redetermi- 
nation of the carrier concentration in the sample of 
GaAs used in the measurement of the index of 
refraction gave N=1.05X10'® cm™* (instead of 
6.9X 10"? cm~* previously reported). Consequently, 
the computed effective-mass ratio is m*/m=0.065 
(instead of 0.043). We are indebted to Dr. Emil 
Arnold of Sylvania Laboratories for making this 
measurement. 


Scattering of Protons from Helium and Level 
Parameters in Li’, Puitirp D. MILLER AND G. C. 
Pui.uirs [ Phys. Rev. 112, 2043 (1958) ]. The values 
of k used for the dispersion-theory analysis of the 
phase shifts are incorrect by a constant factor due 
to a numerical error. The figures and conclusions 
are correct, however, if all radii and reduced 
widths are multiplied by 1.177. This requires 6,” in 
Table VI to be multiplied by 1.39. The corrected 
Table VI now reads: 

TaBLeE VI. Parameters for the first two states of Li® as 


given by the scattering of protons from He‘, using an inter- 
action radius of 3.1 107!4 cm. 


Exci- 
tation 
energy 
(Mev) 


(Ey )e.m 


(Mev) 


Exes 


( ) lat 
(Mev) 


Spin Parity yp? 
(h) 7 (Mev-cm) 
2.6 3.9 


0 4 
i 10.8 16.9 


| 

8.6 4 35.3 

The authors are grateful to Dr. R. J. N. Phillips 
for pointing out this error. 
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General Relativity and Particle Dynamics, L. H. 
Tuomas [Phys. Rev. 112, 2129 (1958) ]. In the 
first sentence of the second paragraph of the 
abstract the words ‘the product of two matri- 
ces . . ."’ should be replaced by “the product of a 
matrix and the Hermitian conjugate of a 
second...” 

In the third sentence of the fourth-from-last 
paragraph of the paper, the words ‘. . . of the 
matrix product of two of them . ’ should be 
replaced by “... of the matrix product of 
one of them with the Hermitian conjugate of 
another .. .”’ 

The second sentence in the third-from-last 
paragraph should be changed to read: “It is an 
infinitesimal unitary transformation of the larger 
space if A, and B, are Hermitian, and it further 
maintains the Hermitian nature of F if it has the 
form 


F’= F+.e>, (A,FB,—B,FA,), (4.2) 


where A, and B, are Hermitian, while any trans- 
formation which does this can be reduced to this 
form.” 

The next-to-last and last paragraph should be 
replaced by the following: 

“Any infinitesimal transformation of classical 
mechanics can be put in the form, with A, and B, 
dynamical variables, 

F’=F+e)> ,.B,(A,,F), (4.4) 


where (A,F) means the Poisson bracket, and the 
condition it has a unit multiplier is 


> ers (A be) =(). 


We may regard the form, with A, and B, Hermitian 
matrices, 


(4.5) 


2r € 
F’ = F+—— D> {(B,A,+A,B,)F+2A,FB, 


th 4r.s 


—F(B,A,+A,B,)—2B,FA,}, (4.6) 


subject to 


>... (A,B,—B,A,) =0, (4.7) 


as a quantization of the classical transformation: 
it is of the form (4.2) with (4.3). Moreover it can 
be written 


2r € 
F’=F+ > {B,(A,F—FA,) 
Diag 


th 2 


+(A,F—FA,)B,}, (4.8) 


and, as in the classical case, any dynamical variable 
commuting with each of A, is invariant for this 
transformation. 

The Einstein theory of Sec. 3 can be at once 
quantized if the operators for the homogeneous 
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Lorentz group are supposed to be in canonical 
form, while the operators for space and time 
displacement are of the more general kind, forming 
a four-vector for the homogeneous Lorentz group. 
The conditions for a Riemannian space are then 
naturally taken to be the result of quantizing (3.12) 
in the above manner, which can be done in the 
above way. The classical limit will then be a theory 
of motion of a dynamical system in a Riemannian 
space determined by the system. 

These operators are expected to have enough of 
the properties of the infinitesimal operators of 
classical theory to allow us to extend that theory. 
Even if the difficulty that our observers and test 
particles are defined by g numbers rather than c 
numbers makes it impossible to set up a Riemann 
space except on the classical limit, that would 
perhaps be sufficient.'*”’ 


Metastability of 2s States of Hydrogenic Atoms, 
J. SHarrro AnD G. Breit [Phys. Rev. 113, 179 
(1959) ]. Addendum: After the publication of the 
above-mentioned paper, our attention has been 
called to related work by Spitzer and Greenstein,! in 
which the probability of double photon emission to 2s 
states of hydrogenic atoms has been calculated for 
the case Z=1. There is excellent agreement in the 
values obtained for the transition probability in 
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the two papers, the value of Spitzer and Greenstein 
being 8.277 sec~! and that in the above-mentioned 
reference, 8.226+0.001 sec~'. The latter publi- 
cation goes somewhat beyond the earlier one in this 
connection in the consideration of the Z dependence, 
of the accuracy of the result, and the discussion of 
the distinction between the double emission and 
cascade processes especially in connection with the 
Lamb shift. 


1L. Spitzer and J. L. Greenstein, Astrophys. J. 114, 407 


(1951). 


Effect of Impurities of Angular Correlation of 
Positron Annihilation Radiation, R. L. pEZAFRA 
[Phys. Rev. 113, 1547 (1959) ]. The title as it now 
reads is incorrect. It should read: “Effect of 
Impurities on Angular Correlation of Positron 
Annihilation Radiation.” 


Fixed Angle Dispersion Relations for Nucleon- 
Nucleon Scattering, M. Cini, S. Fupini, ANp A. 
STANGHELLINI [Phys. Rev. 114, 1633 (1959) ]. In 
Table III the residues B and C of the poles for the 
amplitude 7yo+7% are incorrect. These items on 
the fourth line should be replaced by: 


TatTiw —v2f?M(1—c?)! 2v2f?*M(1—c*)!. 
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